Inference Providers
Active filters: GRPO
TianheWu/VisualQuality-R1-7B
Reinforcement Learning
• 8B • Updated • 1.77k
• 11
Delta-Vector/Nanuq-R1-14B
Text Generation
• 14B • Updated • 6
• 3
mradermacher/Nanuq-R1-14B-i1-GGUF
14B • Updated • 136
• 1
OpenMOSS-Team/SciJudge-4B
Text Generation
• 4B • Updated • 341
• 6
OpenMOSS-Team/SciJudge-30B
Text Generation
• 31B • Updated • 341
• 10
airev-ae/Qwen-0.8B-AgentJSON
Text Generation
• 0.8B • Updated • 643
• 1
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
• 0.5B • Updated • 120
• 24
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
• 1B • Updated • 11
• 1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
1B • Updated • 59
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
1B • Updated • 261
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
• 1B • Updated • 6
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
• 1B • Updated • 2
Reinforcement Learning
• Updated • 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B • Updated • 106
• 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B • Updated • 527
• 1
alpha-ai/Deep-Reason-SMALL-V0-GGUF
3B • Updated • 29
• 1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
• 3B • Updated • 8
• 2
mradermacher/Deep-Reason-SMALL-V0-GGUF
3B • Updated • 49
• 2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
3B • Updated • 120
• 1
alpha-ai/qwen2.5-reason-thought-lite-GGUF
3B • Updated • 52
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
• 3B • Updated • 2
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
3B • Updated • 49
• 2
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
• 3B • Updated • 5
mradermacher/Cogito-R1-GGUF
33B • Updated • 53
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
• 1B • Updated • 2
• mradermacher/Cogito-R1-i1-GGUF
33B • Updated • 111
AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
3B • Updated • 39
• 1
Nitral-AI/Captain-Eris_Violet-GRPO-v0.420
Text Generation
• 12B • Updated • 25
• • 24
prithivMLmods/SmolLM2_135M_Grpo_Gsm8k
Text Generation
• 0.1B • Updated • 13
• 8
prithivMLmods/SmolLM2_135M_Grpo_Checkpoint
Text Generation
• 0.1B • Updated • 6
• 1