arxiv:2508.16949
YANG ZHOU
Yang-Zhou
AI & ML interests
RLHF and DPO
Recent Activity
upvoted a paper 13 days ago
The Trinity of Consistency as a Defining Principle for General World Models liked
a dataset about 2 months ago
sojuL/RubricHub_v1 upvoted a paper about 2 months ago
RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation Organizations
None yet