VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 1 day ago • 23 • 2
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 9 days ago • 5 • 3
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience Paper • 2512.17260 • Published 19 days ago • 48 • 3
Scaling Zero-Shot Reference-to-Video Generation Paper • 2512.06905 • Published about 1 month ago • 28 • 4
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators Paper • 2512.06963 • Published about 1 month ago • 3 • 2
Scaling Zero-Shot Reference-to-Video Generation Paper • 2512.06905 • Published about 1 month ago • 28 • 4
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling Paper • 2512.05343 • Published Dec 5, 2025 • 24 • 2
ProPhy: Progressive Physical Alignment for Dynamic World Simulation Paper • 2512.05564 • Published Dec 5, 2025 • 5 • 2
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38 • 3
Self-Improving VLM Judges Without Human Annotations Paper • 2512.05145 • Published Dec 2, 2025 • 19 • 2
World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty Paper • 2512.05927 • Published Dec 5, 2025 • 11 • 2
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published Dec 4, 2025 • 24 • 2
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published Dec 4, 2025 • 18 • 2
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral Paper • 2512.04220 • Published Dec 3, 2025 • 13 • 2
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published Dec 3, 2025 • 150 • 6