APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 23 items • Updated 2 days ago • 44
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 6 days ago • 47
Running Featured 75 Cohere Transcribe WebGPU ⚡ 75 Run Cohere Transcribe locally in your browser on WebGPU.
Running Featured 75 Nemotron 3 Nano WebGPU ⚛ 75 A compact reasoning-capable model running in your browser.