arxiv:2605.16928
Richard ZHou
zykRichard
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Rethinking Cross-Layer Information Routing in Diffusion Transformers submitted a paper 3 days ago
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps