This collection includes KnowRL-Nemotron-1.5B, train data, test data from the KnowRL project.
Linhao Yu
HasuerYu
AI & ML interests
None yet
Recent Activity
commentedon a paper 26 days ago
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance upvoted a paper about 1 month ago
Co-Evolving Policy DistillationOrganizations
None yet