bzikst/xtts-v2-fork:35f3f05f | Run with an API on Replicate

You're looking at a specific version of this model. Jump to the model overview.

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field	Type	Default value	Description
text	string	Hi there! This is forked Coqui XTTS-2 model with 17 languages supported.	Text to synthesize
speaker	string		Original speaker audio (wav, mp3, m4a, ogg, or flv). Duration should be at least 6 seconds.
language	string (enum)	en Options: en, es, fr, de, it, pt, pl, tr, ru, nl, cs, ar, zh-cn, hu, ko, ja, hi	Output language for the synthesised speech

The shape of the response you’ll get when you run this model with an API.

Schema

{'format': 'uri', 'title': 'Output', 'type': 'string'}