Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 3 days ago • 17
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 6 days ago • 119
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 22 days ago • 32
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 28 days ago • 166
VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction Paper • 2601.05966 • Published 27 days ago • 23
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published about 1 month ago • 101
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published Jan 5 • 29
SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published Dec 23, 2025 • 43
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published Dec 17, 2025 • 65
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published Dec 9, 2025 • 119
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published Dec 5, 2025 • 38
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published Dec 2, 2025 • 71
Thinking with Programming Vision: Towards a Unified View for Thinking with Images Paper • 2512.03746 • Published Dec 3, 2025 • 17