MicrowaveJack (Trevor Miller)

liked 3 Spaces 4 months ago

The Ultra-Scale Playbook

🌌

3.73k

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

The Smol Training Playbook

📚

3.04k

The secrets to building world-class LLMs

liked a model 5 months ago

microsoft/UserLM-8b

Text Generation • Updated Oct 9, 2025 • 397 • 363

liked a Space over 1 year ago

Qwen2.5 Coder Artifacts

🐢

1.72k

Generate and preview code from your app description

liked a model over 1 year ago

BAAI/bge-small-en-v1.5

Feature Extraction • 33.4M • Updated Feb 22, 2024 • 6.93M • • 419

liked 2 datasets over 1 year ago

gretelai/gretel-math-gsm8k-v1

Viewer • Updated Oct 16, 2024 • 24.9k • 178 • 39

TIGER-Lab/SKGInstruct

Preview • Updated Apr 9, 2024 • 136 • 28

liked 2 models over 1 year ago

google/gemma-scope

Updated Aug 29, 2024 • 194

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 7.36M • • 5.55k

liked 2 models almost 2 years ago

VAGOsolutions/Kraken-LoRA

Updated May 28, 2024 • 3 • 38

failspy/Llama-3-8B-Instruct-MopeyMule

Text Generation • 8B • Updated May 30, 2024 • 31 • 86

liked a dataset almost 2 years ago

TIGER-Lab/MMLU-Pro

Benchmark • Updated Jan 19 • 12.1k • 109k • 447

liked 3 models about 2 years ago

liked 3 models over 2 years ago

thesephist/contra-bottleneck-t5-large-wikipedia

Text Generation • Updated Oct 9, 2023 • 616 • 20

NumbersStation/nsql-llama-2-7B

Text Generation • Updated about 14 hours ago • 86 • 82

stabilityai/stablecode-instruct-alpha-3b

Text Generation • 3B • Updated Aug 8, 2023 • 302

liked a model almost 3 years ago

MrHup/coloring-book

Updated May 21, 2023 • 40

Trevor Miller

AI & ML interests

Organizations

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

microsoft/UserLM-8b

Qwen2.5 Coder Artifacts

BAAI/bge-small-en-v1.5

gretelai/gretel-math-gsm8k-v1

TIGER-Lab/SKGInstruct

google/gemma-scope

meta-llama/Llama-3.1-8B-Instruct

VAGOsolutions/Kraken-LoRA

failspy/Llama-3-8B-Instruct-MopeyMule

TIGER-Lab/MMLU-Pro

TheBloke/CodeLlama-70B-hf-GGUF

mistralai/Mixtral-8x7B-Instruct-v0.1

microsoft/phi-2

thesephist/contra-bottleneck-t5-large-wikipedia

NumbersStation/nsql-llama-2-7B

stabilityai/stablecode-instruct-alpha-3b

MrHup/coloring-book

Trevor Miller

AI & ML interests

Organizations

MicrowaveJack's activity

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

Qwen2.5 Coder Artifacts