SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 12 days ago • 185
Stream-T1: Test-Time Scaling for Streaming Video Generation Paper • 2605.04461 • Published 18 days ago • 103
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published Apr 8 • 72
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published Mar 14 • 6
XToM: Exploring the Multilingual Theory of Mind for Large Language Models Paper • 2506.02461 • Published Jun 3, 2025 • 3
ReasonNavi: Human-Inspired Global Map Reasoning for Zero-Shot Embodied Navigation Paper • 2602.15864 • Published Jan 26
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published Mar 14 • 6
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published Mar 14 • 6