A General Theoretical Paradigm to Understand Learning from Human Preferences Paper • 2310.12036 • Published Oct 18, 2023 • 20
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 88
Model Capability Dominates: Inference-Time Optimization Lessons from AIMO 3 Paper • 2603.27844 • Published Apr 16 • 3
Model Capability Dominates: Inference-Time Optimization Lessons from AIMO 3 Paper • 2603.27844 • Published Apr 16 • 3
natnitaract/exams-basic-and-quantum-cryptography-and-security-latex Viewer • Updated Mar 25 • 104 • 152
pythainlp/thainer-corpus-v2-base-model Token Classification • 0.1B • Updated Mar 23, 2023 • 198k • • 16
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published Mar 12 • 65