arxiv:2604.13602
zwhy
XiaohuaWang
ยท
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges updated a model 3 months ago
XiaohuaWang/math-interactive-rl