Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper
•
2512.22615
•
Published
•
40
None defined yet.
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models