mradermacher/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF 27B • Updated 18 days ago • 16.4k • 19
In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 17 days ago • 41