Understanding Reward Hacking in Text-to-Image Reinforcement Learning Paper • 2601.03468 • Published Jan 6 • 1
AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment Paper • 2605.17602 • Published 6 days ago • 16
QG-CoC: Question-Guided Chain-of-Captions for Large Multimodal Models Paper • 2511.03206 • Published Nov 5, 2025
AutoRubric-T2I: Robust Rule-Based Reward Model for Text-to-Image Alignment Paper • 2605.17602 • Published 6 days ago • 16
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns? Paper • 2407.05134 • Published Jul 6, 2024 • 1