Aiden Mitchell's picture

Aiden Mitchell

aidenmitchell

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

liked a model 23 days ago

yusufizzetmurat/fomc-rv-qlike-forecaster-dji

liked a model 25 days ago

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

Paper • 2602.21103 • Published 25 days ago • 8

upvoted 4 papers about 1 month ago

Anticipate and Learn: Unleashing Idle-Time Compute in Proactive Agents

Paper • 2605.25971 • Published May 25 • 16

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

Auditing Multimodal LLM Raters: Central Tendency Bias in Clinical Ordinal Scoring

Paper • 2605.16386 • Published May 11 • 3

Video Models Can Reason with Verifiable Rewards

Paper • 2605.15458 • Published May 14 • 11

upvoted 2 papers about 2 months ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published Apr 30 • 59

X2SAM: Any Segmentation in Images and Videos

Paper • 2605.00891 • Published Apr 27 • 25

upvoted 7 papers 3 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 329

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 638

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

Paper • 2603.06679 • Published Mar 30 • 7

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published Mar 25 • 183

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

upvoted 2 papers 4 months ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 150

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526