VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control Paper • 2601.05138 • Published about 14 hours ago • 4
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published about 13 hours ago • 7
ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing Paper • 2601.03467 • Published 2 days ago • 4
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published 4 days ago • 12
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 5 days ago • 30
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 4 days ago • 22
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper • 2601.01576 • Published 5 days ago • 8
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Paper • 2601.02356 • Published 4 days ago • 12
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 4 days ago • 27