xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_190 8B • Updated Oct 9, 2025 • 7
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_185 8B • Updated Oct 9, 2025 • 4
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_180 8B • Updated Oct 9, 2025 • 7
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_175 8B • Updated Oct 9, 2025 • 8
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_120 8B • Updated Oct 9, 2025 • 3
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_170 8B • Updated Oct 9, 2025 • 7
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_165 8B • Updated Oct 9, 2025 • 6
xinpeng/big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_15 8B • Updated Oct 9, 2025 • 4
xinpeng/big-math-hard_tiny_instruct_cheat_rm_loophole_v2_mixed_0.5 Viewer • Updated Dec 1, 2025 • 25.8k • 8