Running 67 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 67 Building and scaling RL environments for LLM training
Running 17 Defeating the trainer-generator precision mismatch in TRL 🎯 17 Download research PDF (Pro access required)
Running Featured 77 Distilling 100B+ Models 40x Faster with TRL 📝 77 TRL distillation for 100B+ teachers, 40x faster
google/timesfm-2.5-200m-transformers Time Series Forecasting • 0.2B • Updated 26 days ago • 162k • 80