arxiv:2509.02479
Xiaosen Zheng
xszheng2020
AI & ML interests
Code AI and Data-Centric AI.
Recent Activity
upvoted a paper about 14 hours ago
CausalMix: Data Mixture as Causal Inference for Language Model Training liked a dataset 10 days ago
open-alchemy/code-alchemy liked a dataset 10 days ago
nvidia/Nemotron-Pretraining-Code-v2