Name		Name	Last commit message	Last commit date
parent directory ..
audioldm		audioldm
configs		configs
diffusers		diffusers
layers		layers
tools		tools
README.md		README.md
modelling_deberta_v2.py		modelling_deberta_v2.py
models.py		models.py
mustango.jpg		mustango.jpg
mustango.py		mustango.py
requirements.txt		requirements.txt

README.md

Mustango: Toward Controllable Text-to-Music Generation

Demo | Model | Website and Examples | Paper | Dataset

Meet Mustango, an exciting addition to the vibrant landscape of Multimodal Large Language Models designed for controlled music generation. Mustango leverages the Latent Diffusion Model (LDM), Flan-T5 encoder of Tango with musical features to do the magic!

🔥 Live demo available on Replicate and HuggingFace.

Quickstart Guide

Generate music from a text prompt:

import IPython
import soundfile as sf
from mustango import Mustango

model = Mustango("declare-lab/mustango")

prompt = "This is a new age piece. There is a flute playing the main melody with a lot of staccato notes. The rhythmic background consists of a medium tempo electronic drum beat with percussive elements all over the spectrum. There is a playful atmosphere to the piece. This piece can be used in the soundtrack of a children's TV show or an advertisement jingle."

music = model.generate(prompt)
sf.write(f"{prompt}.wav", audio, samplerate=16000)
IPython.display.Audio(data=audio, rate=16000)

Installation

git clone https://github.com/declare-lab/tango
cd tango/mustango
pip install -r requirements.txt
cd diffusers
pip install -e .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mustango

mustango

README.md

Mustango: Toward Controllable Text-to-Music Generation

Quickstart Guide

Installation

Files

mustango

Directory actions

More options

Directory actions

More options

Latest commit

History

mustango

Folders and files

parent directory

README.md

Mustango: Toward Controllable Text-to-Music Generation

Quickstart Guide

Installation