view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding nvidia • Mar 19 • 47
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 7 days ago • 64