WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation Paper • 2605.10912 • Published 6 days ago • 39
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory Paper • 2605.15128 • Published 3 days ago • 53
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published Apr 7 • 120
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models Paper • 2605.14906 • Published 3 days ago • 67
view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 2 days ago • 19
Localized Sentiment Models Collection A group of sentiment detection models dedicated for specific languages • 2 items • Updated Jan 10, 2024 • 1
Finance Sentiment Collection A collections of models for detecting financial sentiment. • 8 items • Updated Jan 10, 2024 • 1
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 3 days ago • 38
MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments Paper • 2605.09131 • Published 8 days ago • 38
Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics Paper • 2605.12178 • Published 5 days ago • 59
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards Paper • 2605.10899 • Published 6 days ago • 72
δ-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 5 days ago • 110
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 5 days ago • 170
MemPrivacy: Privacy-Preserving Personalized Memory Management for Edge-Cloud Agents Paper • 2605.09530 • Published 7 days ago • 141
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 9 days ago • 94
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 10 days ago • 183
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published 14 days ago • 106