You're looking at a specific version of this model. Jump to the model overview.
chenxwh /cosyvoice2-0.5b:669b1cd6
Input schema
The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.
Field | Type | Default value | Description |
---|---|---|---|
source_audio |
string
|
Source audio
|
|
source_transcript |
string
|
Transcript of the source audio, you can use models such as whisper to transcribe first
|
|
tts_text |
string
|
Text of the audio to generate
|
|
task |
string
(enum)
|
zero-shot voice clone
Options: zero-shot voice clone, cross-lingual voice clone, Instructed Voice Generation |
An enumeration.
|
instruction |
string
|
|
Instruction for Instructed Voice Generation task
|
Output schema
The shape of the response you’ll get when you run this model with an API.
Schema
{'format': 'uri', 'title': 'Output', 'type': 'string'}