Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jie Cheng's picture
16 12

Jie Cheng

jinachris
dark-pen's profile picture
·
https://github.com/CJReinforce
  • CJReinforce

AI & ML interests

Reinforcement learning, LLM

Recent Activity

upvoted a paper 1 day ago
LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth
liked a model 9 days ago
stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S
liked a model 9 days ago
stepfun-ai/Step-3.5-Flash-FP8
View all activity

Organizations

None yet

authored 2 papers 9 months ago

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Paper • 2504.15275 • Published Apr 21, 2025 • 2

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Paper • 2410.00564 • Published Oct 1, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs