LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know? Paper • 2605.28721 • Published 7 days ago • 15
VibeSearchBench: Benchmarking Long-horizon Proactive Search in the Wild Paper • 2605.27882 • Published 7 days ago • 13
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 22 days ago • 195
REDSearcher Collection REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents. Project page: https://redsearchagent.github.io/ • 5 items • Updated Feb 28 • 4
HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam Paper • 2602.13964 • Published Feb 15 • 11
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents Paper • 2602.14234 • Published Feb 15 • 28
REDSearcher Collection REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents. Project page: https://redsearchagent.github.io/ • 5 items • Updated Feb 28 • 4
REDSearcher Collection REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents. Project page: https://redsearchagent.github.io/ • 5 items • Updated Feb 28 • 4
REDSearcher Collection REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents. Project page: https://redsearchagent.github.io/ • 5 items • Updated Feb 28 • 4
REDSearcher Collection REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents. Project page: https://redsearchagent.github.io/ • 5 items • Updated Feb 28 • 4