Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 4 days ago • 37
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents Paper • 2602.07274 • Published 6 days ago • 23
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 11 days ago • 32
Closing the Loop: Universal Repository Representation with RPG-Encoder Paper • 2602.02084 • Published 10 days ago • 82
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 29 days ago • 126
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-20251217_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.01-zw1_1_1-20251216_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0-zw1_1_1-20251216_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-longest_only-20251218_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-minspan2-20251217_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-minspan2-20251217_step_19000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.01-zw1_1_1-20251216_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-longest_only-20251218_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-20251217_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0-zw1_1_1-20251216_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-minspan2-20251217_step_10000 2B • Updated Dec 19, 2025
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-minspan2-20251217_step_19000 2B • Updated Dec 19, 2025