You're looking at a specific version of this model. Jump to the model overview.

suminhthanh /vixtts:5222190b

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
text
string
Xin chào các bạn
Text to synthesize
speaker
string
Original speaker audio (wav, mp3, m4a, ogg, or flv). Duration should be at least 6 seconds.
language
string (enum)
vi

Options:

vi, en, es, fr, de, it, pt, pl, tr, ru, nl, cs, ar, zh, hu, ko, hi

Output language for the synthesised speech
cleanup_voice
boolean
True
Whether to apply denoising to the speaker audio (microphone recordings)
use_deepfilter
boolean
True
Whether to use deepfilter
normalize_text
boolean
True
Whether to normalize the text
aws_access_key_id
string
AWS ACCESS KEY ID
aws_secret_access_key
string
AWS SECRET ACCESS KEY
bucket_name
string
AWS S3 Bucket Name
cdn_download_url
string
CDN Download URL

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'properties': {'path': {'title': 'Path'}}, 'title': 'Output', 'type': 'object'}