Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published 3 days ago • 39
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16, 2025 • 52
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published 17 days ago • 90
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 4 days ago • 62
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published 5 days ago • 17
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper • 2512.25073 • Published 3 days ago • 29
SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 8 days ago • 36
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 5 days ago • 87
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published 7 days ago • 39
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published 5 days ago • 41