You're looking at a specific version of this model. Jump to the model overview.
sakemin /musicgen-remixer:cdba0e0d
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
model_version |
string
(enum)
|
stereo-chord
Options: stereo-chord, stereo-chord-large, chord, chord-large |
Model type. Computations take longer when using `large` or `stereo` models.
|
prompt |
string
|
A description of the music you want to generate.
|
|
music_input |
string
|
An audio file input for the remix.
|
|
multi_band_diffusion |
boolean
|
False
|
If `True`, the EnCodec tokens will be decoded with MultiBand Diffusion. Not compatible with `stereo` models.
|
normalization_strategy |
string
(enum)
|
loudness
Options: loudness, clip, peak, rms |
Strategy for normalizing audio.
|
beat_sync_threshold |
number
|
When beat syncing, if the gap between generated downbeat timing and input audio downbeat timing is larger than `beat_sync_threshold`, consider the beats are not corresponding. If `None` or `-1`, `1.1/(bpm/60)` will be used as the value. 0.75 is a good value to set.
|
|
large_chord_voca |
boolean
|
True
|
If `True`, more chords like 7th, diminished and etc are used. If `False` only 12 major and 12 minor chords are used.
|
chroma_coefficient |
number
|
1
Min: 0.5 Max: 2 |
Coefficient value multiplied to multi-hot chord chroma.
|
top_k |
integer
|
250
|
Reduces sampling to the k most likely tokens.
|
top_p |
number
|
0
|
Reduces sampling to tokens with cumulative probability of p. When set to `0` (default), top_k sampling is used.
|
temperature |
number
|
1
|
Controls the 'conservativeness' of the sampling process. Higher temperature means more diversity.
|
classifier_free_guidance |
integer
|
3
|
Increases the influence of inputs on the output. Higher values produce lower-varience outputs that adhere more closely to inputs.
|
output_format |
string
(enum)
|
wav
Options: wav, mp3 |
Output format for generated audio.
|
seed |
integer
|
Seed for random number generator. If `None` or `-1`, a random seed will be used.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}