13 22 1

Jinyang Wu

Jinyang23

https://orcid.org/my-orcid?orcid=0009-0006-0220-616X

jinyangwu

AI & ML interests

large language models, reasoning, agentic rl

Recent Activity

upvoted a paper about 8 hours ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

upvoted a paper 6 days ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

upvoted a paper 13 days ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published 19 days ago • 21

upvoted a paper 6 days ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

Paper • 2602.14492 • Published 7 days ago • 18

upvoted a paper 13 days ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published 14 days ago • 152

authored 2 papers 14 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 18 days ago • 57

Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

Paper • 2602.01064 • Published 22 days ago • 2

upvoted a paper 14 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 18 days ago • 57

submitted a paper to Daily Papers 15 days ago

Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

Paper • 2602.01064 • Published 22 days ago • 2

upvoted 3 papers 18 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published 25 days ago • 9

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 21 days ago • 34

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Paper • 2602.02419 • Published 21 days ago • 4

upvoted 2 papers 20 days ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 25 days ago • 156

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 21 days ago • 238

upvoted a paper 22 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 25 days ago • 12

submitted a paper to Daily Papers 22 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 25 days ago • 12

New activity in Jinyang23/Spark-1.5B-ScienceWorld 24 days ago

Update README.md

#2 opened 24 days ago by

shuo-yan

New activity in Jinyang23/Spark-1.5B-WebShop 24 days ago

Update README.md

#2 opened 24 days ago by

shuo-yan

New activity in Jinyang23/Spark-1.5B-ALFWorld 24 days ago

Update README.md

#2 opened 24 days ago by

shuo-yan

authored 2 papers 25 days ago

Double: Breaking the Acceleration Limit via Double Retrieval Speculative Parallelism

Paper • 2601.05524 • Published Jan 9 • 1

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 27 days ago • 22

updated a model 26 days ago

Jinyang23/Spark-1.5B-ALFWorld

Reinforcement Learning • 2B • Updated 24 days ago • 10

Jinyang Wu

AI & ML interests

Recent Activity

Organizations

Jinyang23's activity

Update README.md

Update README.md

Update README.md