Jie Cheng's picture

16 12

Jie Cheng

jinachris

·

https://github.com/CJReinforce

CJReinforce

AI & ML interests

Reinforcement learning, LLM

Recent Activity

upvoted a paper 1 day ago

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

liked a model 9 days ago

stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_S

liked a model 9 days ago

stepfun-ai/Step-3.5-Flash-FP8

View all activity

Organizations

None yet

authored 2 papers 9 months ago

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Paper • 2504.15275 • Published Apr 21, 2025 • 2

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Paper • 2410.00564 • Published Oct 1, 2024 • 1