view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 16 days ago • 91
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 3 days ago • 549
🎭 Avatars Collection The latest AI-powered technologies usher in a new era of realistic avatars! 🚀 • 75 items • Updated Apr 20, 2025 • 92
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 97
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper • 2404.05719 • Published Apr 8, 2024 • 83