arxiv:2605.09063
Seungone Kim PRO
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
authored a paper about 5 hours ago
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation authored a paper about 5 hours ago
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs