eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 777

upvoted 2 papers 7 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 260

Language Models are Injective and Hence Invertible

Paper • 2510.15511 • Published Oct 17, 2025 • 70

upvoted an article 10 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 280

updated 5 models over 1 year ago

published 5 models over 1 year ago

ugonfor/mistral-7b-qlora-arc-mixlora

Text Generation • 7B • Updated Feb 20, 2025 • 1

ugonfor/mistral-7b-4bits

Text Generation • 7B • Updated Feb 20, 2025 • 3

ugonfor/mistral-7b-qlora-arc

Text Generation • 7B • Updated Feb 20, 2025 • 1

ugonfor/mistral-7b-loftQ-arc

Text Generation • 7B • Updated Feb 20, 2025

ugonfor/mistral-7b-qlora-arc-cot

Text Generation • 7B • Updated Feb 20, 2025

Hyogon Ryu

AI & ML interests

Recent Activity

Organizations

ugonfor's activity

SmolLM3: smol, multilingual, long-context reasoner

Fine-tuning LLMs to 1.58bit: extreme quantization made easy