--- library_name: mlx tags: - mlx - music-generation - ace-step - audio - text-to-music base_model: ACE-Step/ACE-Step1.5 --- # ACE-Step 1.5 MLX (4-bit Quantized) 4-bit quantized MLX weights for [ACE-Step/ACE-Step1.5](https://huggingface.co/ACE-Step/ACE-Step1.5). - Decoder and encoder quantized to 4-bit (group_size=64) - VAE, tokenizer, and detokenizer kept in full precision - **2.2GB** main model + 0.7GB VAE + 2.4GB text encoder ## Usage ```python from mlx_audio.tts import load model = load("mlx-community/ACE-Step1.5-MLX-4bit") for result in model.generate( text="upbeat electronic dance music with energetic synthesizers", duration=30.0, ): audio = result.audio # [samples, 2] stereo @ 48kHz sample_rate = result.sample_rate ``` ## With Vocals ```python for result in model.generate( text="English pop song with clear female vocals, catchy melody", lyrics="""[verse] Dance with me tonight Under the neon lights [chorus] We're alive, we're on fire Dancing higher and higher """, duration=60.0, vocal_language="en", ): ... ``` The model uses a 5Hz Language Model planner by default (`use_lm=True`) which generates a song blueprint before running the diffusion transformer.