-
Leon-Sean-Dev/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
Leon-Sean-Dev/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
Leon-Sean-Dev/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
Leon-Sean-Dev/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Sean McLeish PRO
smcleish
AI & ML interests
None yet
Recent Activity
updated a model 1 day ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 updated a collection 1 day ago
compression published a model 1 day ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5Organizations
Diff Datasets
Datasets containing github diffs
-
CarperAI/github-diffs-deduped
Viewer • Updated • 10.7M • 176 • 3 -
bigcode/github-commits-diff-dedup-pjjs-april
Viewer • Updated • 146k • 3.67k • 3 -
ASSERT-KTH/megadiff-single-function
Viewer • Updated • 72.4k • 29 • 3 -
ASSERT-KTH/megadiff
Viewer • Updated • 657k • 72 • 2
compression
-
Leon-Sean-Dev/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
Leon-Sean-Dev/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
Leon-Sean-Dev/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
Leon-Sean-Dev/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Diff Datasets
Datasets containing github diffs
-
CarperAI/github-diffs-deduped
Viewer • Updated • 10.7M • 176 • 3 -
bigcode/github-commits-diff-dedup-pjjs-april
Viewer • Updated • 146k • 3.67k • 3 -
ASSERT-KTH/megadiff-single-function
Viewer • Updated • 72.4k • 29 • 3 -
ASSERT-KTH/megadiff
Viewer • Updated • 657k • 72 • 2
models 60
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5
Updated
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-2e-5
Updated
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5
Updated
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5
Updated
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-1e-5
Updated
smcleish/tinyllama_4_8_4_last_8_layers_add_adapter
Text Generation • 0.8B • Updated • 42
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-attn-mlp-ov256-stage3-lr-1e-5
Updated
smcleish/deepscaler-1.5b-8k-dapo-random-step400-hf
Text Generation • 2B • Updated • 3
smcleish/deepscaler-1.5b-8k-dapo-random-step200-hf
Text Generation • 2B • Updated • 3
smcleish/deepscaler-1.5b-8k-dapo-hard-step400-hf
Text Generation • 2B • Updated • 3
datasets 6
smcleish/deepscaler_outputs
Updated • 2
smcleish/error_at_k_saved_start_0_end_20000_num_completions_10
Viewer • Updated • 18.9k • 3
smcleish/retrofitting-llama-fineweb-edu-tokenized
Viewer • Updated • 332M • 334
smcleish/scaling-laws-cache
Viewer • Updated • 13 • 215 • 1
smcleish/CLRS-Text-train
Viewer • Updated • 2.15M • 196 • 2
smcleish/CLRS-Text-test
Viewer • Updated • 503k • 113