You're looking at a specific version of this model. Jump to the model overview.
zsxkib /realistic-voice-cloning:0a9c7c55
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
song_input |
string
|
Upload your audio file here.
|
|
rvc_model |
string
(enum)
|
Squidward
Options: Squidward, MrKrabs, Plankton, Drake, Vader, Trump, Biden, Obama, Guitar, Voilin, CUSTOM |
RVC model for a specific voice. If using a custom model, this should match the name of the downloaded model. If a 'custom_rvc_model_download_url' is provided, this will be automatically set to the name of the downloaded model.
|
custom_rvc_model_download_url |
string
|
URL to download a custom RVC model. If provided, the model will be downloaded (if it doesn't already exist) and used for prediction, regardless of the 'rvc_model' value.
|
|
pitch_change |
string
(enum)
|
no-change
Options: no-change, male-to-female, female-to-male |
Adjust pitch of AI vocals. Options: `no-change`, `male-to-female`, `female-to-male`.
|
index_rate |
number
|
0.5
Max: 1 |
Control how much of the AI's accent to leave in the vocals.
|
filter_radius |
integer
|
3
Max: 7 |
If >=3: apply median filtering median filtering to the harvested pitch results.
|
rms_mix_rate |
number
|
0.25
Max: 1 |
Control how much to use the original vocal's loudness (0) or a fixed loudness (1).
|
pitch_detection_algorithm |
string
(enum)
|
rmvpe
Options: rmvpe, mangio-crepe |
Best option is rmvpe (clarity in vocals), then mangio-crepe (smoother vocals).
|
crepe_hop_length |
integer
|
128
|
When `pitch_detection_algo` is set to `mangio-crepe`, this controls how often it checks for pitch changes in milliseconds. Lower values lead to longer conversions and higher risk of voice cracks, but better pitch accuracy.
|
protect |
number
|
0.33
Max: 0.5 |
Control how much of the original vocals' breath and voiceless consonants to leave in the AI vocals. Set 0.5 to disable.
|
main_vocals_volume_change |
number
|
0
|
Control volume of main AI vocals. Use -3 to decrease the volume by 3 decibels, or 3 to increase the volume by 3 decibels.
|
backup_vocals_volume_change |
number
|
0
|
Control volume of backup AI vocals.
|
instrumental_volume_change |
number
|
0
|
Control volume of the background music/instrumentals.
|
pitch_change_all |
number
|
0
|
Change pitch/key of background music, backup vocals and AI vocals in semitones. Reduces sound quality slightly.
|
reverb_size |
number
|
0.15
Max: 1 |
The larger the room, the longer the reverb time.
|
reverb_wetness |
number
|
0.2
Max: 1 |
Level of AI vocals with reverb.
|
reverb_dryness |
number
|
0.8
Max: 1 |
Level of AI vocals without reverb.
|
reverb_damping |
number
|
0.7
Max: 1 |
Absorption of high frequencies in the reverb.
|
output_format |
string
(enum)
|
mp3
Options: mp3, wav |
wav for best quality and large file size, mp3 for decent quality and small file size.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
{'format': 'uri', 'title': 'Output', 'type': 'string'}