view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day 24 days ago • 46
view article Article Australian-made LLM beats OpenAI and Google at legal retrieval Oct 23, 2025 • 26
The Majority is not always right: RL training for solution aggregation Paper • 2509.06870 • Published Sep 8, 2025 • 16
Large Language Models are Locally Linear Mappings Paper • 2505.24293 • Published May 30, 2025 • 14
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models Paper • 2505.11711 • Published May 16, 2025 • 11
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 247
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published Apr 18, 2025 • 139