Qwen2.5 Omni 7B Demo
🏆
372
Chat with text, audio, images, and video, get spoken replies
Chat with text, audio, images, and video, get spoken replies
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Conversational speech generation
Full working POC demonstrating text to speech and speech
(Unofficial) Gradio demo for Spark-TTS
Create a textured 3D model from a single image
Generate images from text prompts with FLUX.1-schnell