Code Llama Family Collection This collection hosts the transformers repos of the Code Llama release • 12 items • Updated Dec 6, 2024 • 68
Llama 3.1 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.1 models, including the configurations, • 6 items • Updated Dec 6, 2024 • 20
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 709
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 7 days ago • 459
MapTrace: Scalable Data Generation for Route Tracing on Maps Paper • 2512.19609 • Published Dec 22, 2025 • 2
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 13 days ago • 31
Audio dataset Collection N datasets showcase how to configure and load audio datasets • 11 items • Updated Aug 2, 2024 • 6
Format: CSV and TSV Collection 6 datasets showcase how to configure and load CSV and TSV files. • 6 items • Updated Nov 23, 2023 • 9
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 604 items • Updated about 19 hours ago • 83
The Pensieve Paradigm: Stateful Language Models Mastering Their Own Context Paper • 2602.12108 • Published 14 days ago • 13
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper • 2509.24002 • Published Sep 28, 2025 • 176
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 23 days ago • 80
NoCode-bench: A Benchmark for Evaluating Natural Language-Driven Feature Addition Paper • 2507.18130 • Published Jul 24, 2025 • 1