-
-
-
-
-
-
Inference Providers
Active filters: RL
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
• Updated
• 907
• 40
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF
Reinforcement Learning
• 8B • Updated
• 46
• 4
prithivMLmods/Blitzar-Coder-4B-F.1
Text Generation
• 4B • Updated
• 17
• 9
nvidia/Nemotron-Cascade-8B
Text Generation
• Updated
• 31.6k
• 61
bartowski/nvidia_Nemotron-Cascade-8B-GGUF
Text Generation
• 8B • Updated
• 474
• 3
bartowski/nvidia_Nemotron-Cascade-8B-Thinking-GGUF
Text Generation
• 8B • Updated
• 379
• 3
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation
• Updated
• 12
stanfordnlp/SteamSHP-flan-t5-xl
Updated
• 5
• 43
stanfordnlp/SteamSHP-flan-t5-large
Updated
• 85
• 33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
• 2B • Updated
• 7
• 5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B • Updated
• 71
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
• 71B • Updated
• 30
• • 3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
• 71B • Updated
• 57
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
• 71B • Updated
• 121
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
• Updated
JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
• Updated
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
• 0.5B • Updated
• 9
• 24
Reinforcement Learning
• Updated
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B • Updated
• 153
• 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B • Updated
• 80
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B • Updated
• 62
Text Generation
• 684B • Updated
• 60
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B • Updated
• 101
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
• 0.5B • Updated
• 16
mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
0.5B • Updated
• 61
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
• 2B • Updated
• 33
• 1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
2B • Updated
• 69
• 1
mradermacher/Zireal-0-GGUF
mradermacher/Magellanic-Qwen-25B-R999-GGUF
25B • Updated
• 187
• 1
mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF
25B • Updated
• 101
• 1