arxiv:2605.29447
Hao Jiang
Lutalica
AI & ML interests
Multimodal LLMs, LLM Reasoning, Reinforcement Learning, Efficient Inference
Recent Activity
authored a paper 2 days ago
Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents upvoted a paper 3 days ago
Pyramid Texture Filtering authored a paper 6 days ago
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use