kakaocorp/kanana-2-30b-a3b-thinking-2601 Text Generation • 31B • Updated 23 days ago • 1.24k • 55
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation • 33B • Updated Jan 6 • 3.75k • 395
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 3 days ago • 625k • 616
Running on CPU Upgrade Featured 2.96k The Smol Training Playbook 📚 2.96k The secrets to building world-class LLMs
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 26 items • Updated 11 days ago • 103