In a Training Loop 🔄

2 12 23

Subarno Sadat Barno

barnobarno666

AI & ML interests

reinforming learning

Recent Activity

liked a dataset 6 days ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

liked a dataset 6 days ago

zwhe99/DeepMath-103K

liked a dataset 6 days ago

TeichAI/claude-4.5-opus-high-reasoning-250x

View all activity

Organizations

liked 3 datasets 6 days ago

#1 opened 9 days ago by

gergopool

liked a model 9 days ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Text Generation • 28B • Updated 2 days ago • 15.7k • 307

liked 2 datasets 2 months ago

RUC-AIBOX/OlymMATH-eval

Viewer • Updated May 11, 2025 • 579k • 141 • 4

brando/olympiad-bench-imo-math-boxed-825-v2-21-08-2024

Viewer • Updated Nov 6, 2024 • 1.65k • 60 • 5

liked 2 models 3 months ago

Synthyra/ESM2-8M

Fill-Mask • 7.52M • Updated 4 days ago • 1.23k • 2

biomap-research/proteinglm-100b-int4

50B • Updated Mar 17, 2025 • 81 • 11

liked a model 4 months ago

Adilbai/ppo-LunarLander-v2

Reinforcement Learning • Updated Jun 9, 2025 • 2

updated a model 4 months ago

barnobarno666/Whisper-medium-bangla

Automatic Speech Recognition • 0.8B • Updated Nov 23, 2025 • 6

published a model 4 months ago

barnobarno666/Whisper-medium-bangla

Automatic Speech Recognition • 0.8B • Updated Nov 23, 2025 • 6

upvoted a collection 4 months ago

Gemma 3 Release

Collection

28 items • Updated Aug 11, 2025 • 619

liked a Space 4 months ago

The Smol Training Playbook

📚

3.04k

The secrets to building world-class LLMs

upvoted 2 papers 5 months ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6, 2025 • 23

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 181

liked a model 5 months ago

unsloth/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Jun 2, 2025 • 165k • 88

liked a model 6 months ago

unsloth/Qwen3-1.7B-Base-unsloth-bnb-4bit

Text Generation • Updated May 13, 2025 • 6.53k • 3

upvoted 2 papers 6 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 91

Subarno Sadat Barno

AI & ML interests

Recent Activity

Organizations

barnobarno666's activity

Claude distillation

The Smol Training Playbook