Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
205527.2
TFLOPS
1218
237
840
Lewis Tunstall
PRO
lewtun
Follow
kimleang123's profile picture
Bellamy66's profile picture
herooooooooo's profile picture
1,337 followers
·
131 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
new
activity
about 11 hours ago
futurehouse/labbench2:
Mismatch in SourceQuality rows vs those reported in paper
updated
a model
about 14 hours ago
lewtun/olmo3-7b-lora_ds200_ep32
published
a model
about 16 hours ago
lewtun/olmo3-7b-lora_ds200_ep32
View all activity
Organizations
lewtun
's models
292
Sort: Recently updated
lewtun/olmo3-7b-lora_ds200_ep32
Updated
about 13 hours ago
lewtun/data-repetition-replication
Updated
3 days ago
lewtun/wordle-grpo-Qwen3-1.7B
Text Generation
•
2B
•
Updated
Jan 14
•
3
lewtun/qwen3-4b-s1k-sft
Text Generation
•
4B
•
Updated
Jan 8
•
1
lewtun/Qwen3-32B-SFT-20250908120312
Updated
Sep 8, 2025
lewtun/Qwen3-0.6B-SFT-20250908114642
Text Generation
•
0.6B
•
Updated
Sep 8, 2025
•
6
lewtun/Qwen3-32B-SFT-20250908115917
Updated
Sep 8, 2025
lewtun/SmolLM2-135M-Instruct-SFT-Trackio-Test
Text Generation
•
0.1B
•
Updated
Aug 7, 2025
•
1
lewtun/Qwen3-0.6B-SFT-Trackio-Test
Text Generation
•
0.6B
•
Updated
Aug 7, 2025
•
6
lewtun/Qwen3-0.6B-SFT-Demo
Text Generation
•
0.6B
•
Updated
Aug 7, 2025
lewtun/zephyr-7b-gemma-dpo
Updated
Jul 24, 2025
lewtun/zephyr-7b-gemma-sft
Updated
Jul 24, 2025
lewtun/smollm2-360M-sft
Updated
Jul 24, 2025
lewtun/smollm2-1.7B-sft
Updated
Jul 24, 2025
lewtun/smollm-360M-instruct-new
Updated
Jul 24, 2025
lewtun/mistral-7b-sft-constitutional-ai
Updated
Jul 24, 2025
lewtun/mistral-7b-dpo-constitutional-ai
Updated
Jul 24, 2025
lewtun/zephyr-7b-sft-full
Text Generation
•
266k
•
Updated
Jul 24, 2025
•
6
lewtun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Apr 16, 2025
•
1
lewtun/does-deepspeed-still-work-sft
Text Generation
•
2B
•
Updated
Apr 16, 2025
•
1
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-Llama
Text Generation
•
1B
•
Updated
Apr 16, 2025
•
1
lewtun/Qwen2.5-1.5B-SFT-Capybara-No-Packing
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
8
lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-ChatML
Text Generation
•
1B
•
Updated
Apr 15, 2025
•
3
lewtun/Qwen2.5-7B-Instruct-GRPO
Updated
Mar 21, 2025
lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO
Updated
Mar 6, 2025
lewtun/dummy-config-test
Text Generation
•
Updated
Feb 20, 2025
•
1
lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Feb 18, 2025
lewtun/smollm2-distill-default-chat-template
Text Generation
•
2B
•
Updated
Feb 17, 2025
•
3
lewtun/qwen2.5-1.5b-distill-default-chat-template
2B
•
Updated
Feb 17, 2025
lewtun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Feb 7, 2025
Previous
1
2
3
...
10
Next