view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 152
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples Paper • 2502.09650 • Published Feb 11, 2025