AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts Paper ⢠2601.20730 ⢠Published 2 days ago ⢠17
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization Paper ⢠2601.16480 ⢠Published 8 days ago ⢠50
view reply @sysia48 , I think the comment is random (or at least pseudo random š ). Yes I also received this harassment with no reason, really frustrating š
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper ⢠2512.04677 ⢠Published Dec 4, 2025 ⢠170
Patient-Similarity Cohort Reasoning in Clinical Text-to-SQL Paper ⢠2601.09876 ⢠Published 16 days ago ⢠6
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper ⢠2601.15876 ⢠Published 9 days ago ⢠89
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper ⢠2601.14724 ⢠Published 10 days ago ⢠73
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper ⢠2601.14724 ⢠Published 10 days ago ⢠73
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper ⢠2601.14724 ⢠Published 10 days ago ⢠73
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper ⢠2601.13836 ⢠Published 11 days ago ⢠34
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Paper ⢠2601.11077 ⢠Published 15 days ago ⢠64
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper ⢠2601.01554 ⢠Published 26 days ago ⢠57
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper ⢠2512.07525 ⢠Published Dec 8, 2025 ⢠59