-
-
-
-
-
-
Inference Providers
Active filters: gsm8k
August4293/mistral_gsm8k_ssl_it1
Updated
August4293/mistral_gsm8k_ssl_it2
Updated
Text Generation
• Updated
• 16
• mradermacher/Qwen-0.5B-GRPO-GGUF
0.5B • Updated
• 183
mradermacher/prem-1B-grpo-GGUF
Reinforcement Learning
• 1B • Updated
• 125
yeok/DeepScaleR-1.5B-Preview-GSM8K-Demo
2B • Updated
• 2
LahiruWije/Qwen2.5-0.5B-Instruct-GPRO-GSM8K
Question Answering
• 0.5B • Updated
eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small
3B • Updated
• 405
eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-data-enhanced-with-deepseek-v3
3B • Updated
• 152
eagle0504/qwen-2-5-3b-instruct-using-openai-gsm8k-data-enhanced-with-deepseek-v4
3B • Updated
• 140
Text Generation
• Updated
• 1
• 1
koolkarni-Atharva10/Nano_R1
Reinforcement Learning
• Updated
Text Generation
• Updated
• 2
• 3
klei1/bleta-logjike-27b-gguf
27B • Updated
• 15
solarpunkin/OpenELM-450M-gsm8k-LoRA
darshjoshi16/phi2-lora-math
Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl
Reinforcement Learning
• 2B • Updated
• 27
• 3
Text Generation
• 0.6B • Updated
• 4
• 2
shivs28/jee_nujan_mix_v2_base
Text Generation
• 2B • Updated
• 2
tahamajs/Qwen3-4B-GSM8k-GRPO-Unsloth
4B • Updated
• 3
tahamajs/gemma-3-1b-it-finetune-gsmk8
Text Generation
• 1.0B • Updated
• 1
TroglodyteDerivations/smol_lm_3b
Updated
safouaneelg/Apertus-8B-Instruct-2509-GSM8k-SFT
Text Generation
• 8B • Updated
• 1
kotekjedi/qwen3-32b-lora-jailbreak-detection-merged
Text Generation
• 33B • Updated
• 4
yassine-boua/olmo-gsm8k-finetuned
Text Generation
• Updated
kotekjedi/qwen3-32b-lora-jailbreak-detection-merged_v2
Text Generation
• 33B • Updated
mradermacher/qwen3-32b-lora-jailbreak-detection-merged_v2-GGUF
33B • Updated
• 55
karthik/verl-qwen2.5-0.5b-gsm8k-ppo-step360
Text Generation
• 0.5B • Updated
• 1
DeryFerd/Qwen2.5-Math-7B-Instruct-Distill-Phi2-2.5K-MixMath
Text Generation
• 3B • Updated
• 20
• 1
DeryFerd/Qwen2.5-Math-Coder-Distill-Phi-2-4.4K-MixMathCode
Text Generation
• 3B • Updated
• 2
• 5