From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation Paper • 2510.07624 • Published Oct 8, 2025 • 7
Solving a Million-Step LLM Task with Zero Errors Paper • 2511.09030 • Published Nov 12, 2025 • 20 • 3
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper • 2509.24372 • Published Sep 29, 2025 • 9
view article Article 🌁#85: Curiosity, Open Source, and Timing: The Formula Behind DeepSeek’s Phenomenal Success Jan 27, 2025 • 6
Evolution and The Knightian Blindspot of Machine Learning Paper • 2501.13075 • Published Jan 22, 2025 • 6
Evolution and The Knightian Blindspot of Machine Learning Paper • 2501.13075 • Published Jan 22, 2025 • 6
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models Paper • 2412.02980 • Published Dec 4, 2024 • 15