You're looking at a specific version of this model. Jump to the model overview.
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
audio_file |
string
|
URL or path to the audio file to be pitch-corrected
|
|
scale |
string
(enum)
|
auto
Options: auto, closest, A:maj, A:min, Bb:maj, Bb:min, B:maj, B:min, C:maj, C:min, Db:maj, Db:min, D:maj, D:min, Eb:maj, Eb:min, E:maj, E:min, F:maj, F:min, Gb:maj, Gb:min, G:maj, G:min, Ab:maj, Ab:min |
Musical key for correction. 'auto' will detect the key, 'closest' uses chromatic correction.
|
correction_strength |
number
|
0.7
Max: 1 |
Strength of pitch correction (0.0 to 1.0). Higher values = stronger correction.
|
adaptive_strength |
boolean
|
True
|
Adapt correction strength based on voice confidence.
|
smooth_transitions |
boolean
|
True
|
Smooth transitions between voiced/unvoiced segments.
|
plot |
boolean
|
False
|
Generate a pitch correction visualization plot.
|
output_format |
string
(enum)
|
wav
Options: wav, mp3 |
Output format for generated audio.
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}