submission19025
submission19025
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning
upvoted
an
article
about 1 month ago
Re-understanding KL Approximation from an RL-for-LLM Lens: Notes on “Approximating KL Divergence”
liked
a dataset
4 months ago
ftajwar/deduplicated_dapo_dataset
Organizations
None yet