Factorized Learning for Temporally Grounded Video-Language Models
Paper
•
2512.24097
•
Published
•
6
None defined yet.
Factorized Learning for Temporally Grounded Video-Language Models
SpotEdit: Selective Region Editing in Diffusion Transformers