zwhy's picture

2 2

zwhy

XiaohuaWang

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

upvoted a paper 1 day ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

updated a model 3 months ago

XiaohuaWang/math-interactive-rl

View all activity

Organizations

authored a paper 1 day ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published 10 days ago • 20

upvoted a paper 1 day ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published 10 days ago • 20

updated a model 3 months ago

XiaohuaWang/math-interactive-rl

published a model 3 months ago

XiaohuaWang/math-interactive-rl

upvoted a paper 3 months ago

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Paper • 2601.05107 • Published Jan 8 • 24

liked a Space 10 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset over 1 year ago

allenai/WildChat-1M

Viewer • Updated Oct 17, 2024 • 838k • 11.4k • 426

updated a dataset almost 2 years ago

FudanDNN-NLP/Wiki_Med_DB

Updated Jul 2, 2024 • 3

updated a model almost 2 years ago

FudanDNN-NLP/llama3-8b-instruct-ragga-disturb

Text Generation • 8B • Updated Jul 2, 2024 • 3