Thinking While Listening: Simple Test Time Scaling For Audio Classification Paper • 2509.19676 • Published Sep 24, 2025 • 4
Large Language Models Implicitly Learn to See and Hear Just By Reading Paper • 2505.17091 • Published May 20, 2025 • 5
Whisper-GPT: A Hybrid Representation Audio Large Language Model Paper • 2412.11449 • Published Dec 16, 2024 • 4