li sheng
bambisheng
AI & ML interests
None yet
Recent Activity
new activity 9 days ago
TsinghuaC3I/ZEDA-Evaluation:Add dataset card and link to paper/code upvoted a collection 11 days ago
ZEDA authored a paper 11 days ago
Post-Trained MoE Can Skip Half Experts via Self-Distillation