VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published 4 days ago • 9
VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published 4 days ago • 9
VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published 4 days ago • 9
DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion Paper • 2510.20766 • Published Oct 23, 2025 • 37
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published 28 days ago • 31
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper • 2502.09619 • Published Feb 13, 2025 • 36
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper • 2508.09983 • Published Aug 13, 2025 • 70
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models Paper • 2503.18033 • Published Mar 23, 2025 • 30
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization Paper • 2503.07038 • Published Mar 10, 2025
EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Paper • 2405.18065 • Published May 28, 2024
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention Paper • 2602.01801 • Published 11 days ago • 28
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention Paper • 2602.01801 • Published 11 days ago • 28