RL Models for RL virtuoussy/Qwen2.5-7B-Instruct-RLVR 8B • Updated May 4, 2025 • 17 • 18 virtuoussy/Math-RLVR Viewer • Updated Apr 16, 2025 • 782k • 60 • 9 virtuoussy/Multi-subject-RLVR Viewer • Updated Apr 16, 2025 • 579k • 75 • 67 agentica-org/DeepCoder-14B-Preview Text Generation • Updated May 11, 2025 • 369 • • 680
Enhancements Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16 nvidia/Llama-3.1-Nemotron-70B-Instruct Updated Apr 13, 2025 • 30 • 568 microsoft/phi-4 Text Generation • Updated Nov 24, 2025 • 904k • 2.22k
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16
RL Models for RL virtuoussy/Qwen2.5-7B-Instruct-RLVR 8B • Updated May 4, 2025 • 17 • 18 virtuoussy/Math-RLVR Viewer • Updated Apr 16, 2025 • 782k • 60 • 9 virtuoussy/Multi-subject-RLVR Viewer • Updated Apr 16, 2025 • 579k • 75 • 67 agentica-org/DeepCoder-14B-Preview Text Generation • Updated May 11, 2025 • 369 • • 680
Enhancements Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16 nvidia/Llama-3.1-Nemotron-70B-Instruct Updated Apr 13, 2025 • 30 • 568 microsoft/phi-4 Text Generation • Updated Nov 24, 2025 • 904k • 2.22k
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts Paper • 2406.12034 • Published Jun 17, 2024 • 16