memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_180B_Harvard_5B Text Generation • 1B • Updated about 5 hours ago
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_180B_Harvard_5B Text Generation • 1B • Updated about 5 hours ago
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_185B Text Generation • 1B • Updated 1 day ago • 19
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative_185B Text Generation • 1B • Updated 1 day ago • 19
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative Text Generation • 1B • Updated 3 days ago • 29
memorizations/HF_Llama_1B_WebOrganizer_Without_Creative Text Generation • 1B • Updated 3 days ago • 29
aflah/llama32_1b_dclm-SL-2048-PGBS-16-GAS-4-NGPU-8-NNODES-1-TW-PERF-step-23999 1B • Updated Mar 17, 2025 • 1
aflah/llama32_1b_dclm-SL-2048-PGBS-16-GAS-4-NGPU-8-NNODES-1-TW-PERF-step-23999 1B • Updated Mar 17, 2025 • 1
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 37 items • Updated Mar 2 • 376