AI & ML interests
None yet
Organizations
None yet
moon4sake/Llama-3.1-8B_s0.25_block
Text Generation
• 4B • Updated
moon4sake/Llama-3.1-8B_s0.50_block
Text Generation
• 8B • Updated
moon4sake/DeepSeek-R1-Distill-Qwen-1.5B_s0.75_block
Text Generation
• 1.0B • Updated
• 3
moon4sake/DeepSeek-R1-Distill-Qwen-1.5B_s0.50_block
Text Generation
• 1B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.10_channel
Text Generation
• 7B • Updated
• 3
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.75_channel
Text Generation
• 2B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.50_channel
Text Generation
• 4B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.25_channel
Text Generation
• 6B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.75_block
Text Generation
• 4B • Updated
• 2
moon4sake/Llama-3.1-8B_s0.75_block
Text Generation
• 4B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.50_block
Text Generation
• 5B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Llama-8B_s0.25_block
Text Generation
• 7B • Updated
• 2
moon4sake/Llama-3.1-8B_s0.75_channel
Text Generation
• 2B • Updated
moon4sake/Llama-3.1-8B_s0.50_channel
Text Generation
• 4B • Updated
• 2
moon4sake/Llama-3.1-8B_s0.25_channel
Text Generation
• 6B • Updated
moon4sake/DeepSeek-R1-Distill-Qwen-1.5B_s0.75_channel
Text Generation
• 0.4B • Updated
• 5
moon4sake/DeepSeek-R1-Distill-Qwen-1.5B_s0.50_channel
Text Generation
• 0.9B • Updated
• 2
moon4sake/Qwen2.5-Math-1.5B_s0.75_channel
Text Generation
• 0.4B • Updated
• 2
moon4sake/Qwen2.5-Math-1.5B_s0.50_channel
Text Generation
• 0.9B • Updated
moon4sake/DeepSeek-R1-Distill-Qwen-1.5B_s0.25_channel
Text Generation
• 1B • Updated
• 2
moon4sake/DeepSeek-R1-Distill-Qwen-1.5B_s0.10_channel
Text Generation
• 2B • Updated
• 2
moon4sake/Qwen2.5-Math-1.5B_s0.25_channel
Text Generation
• 1B • Updated
moon4sake/Qwen2.5-Math-1.5B_s0.10_channel
Text Generation
• 2B • Updated
• 2
moon4sake/Llama-3.1-8B_s0.10_channel
Text Generation
• 7B • Updated