Collections

Generate speech

Generate natural-sounding speech from text with these powerful models. Clone your own voice or pick from a variety of languages and speaking styles.

Recommended models

zsxkib / dia

Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning

Updated 17 hours ago

6.8K runs

minimax / voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

Updated 2 months, 1 week ago

6.9K runs

lucataco / csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

Updated 3 months, 3 weeks ago

554 runs

lucataco / orpheus-3b-0.1-ft

Orpheus 3B - high quality, emotive Text to Speech

Updated 3 months, 3 weeks ago

19.2K runs

cjwbw / voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Updated 4 months ago

10.4K runs

fermatresearch / spanish-f5-tts

A F5-TTS fine-tuned for Spanish

Updated 8 months ago

678 runs

x-lance / f5-tts

F5-TTS, the new state-of-the-art in open source voice cloning

Updated 9 months ago

26.3K runs

platform-kit / mars5-tts

A novel speech model for insane prosody.

Updated 1 year ago

483 runs

chenxwh / openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

Updated 1 year, 1 month ago

65.5K runs

cjwbw / parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

Updated 1 year, 3 months ago

2.6K runs

camenduru / metavoice

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

Updated 1 year, 5 months ago

12.3K runs

adirik / styletts2

Generates speech from text

Updated 1 year, 5 months ago

131.3K runs

lucataco / pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

Updated 1 year, 6 months ago

532 runs

zsxkib / realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

Updated 1 year, 8 months ago

865.9K runs

cjwbw / seamless_​communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Updated 1 year, 10 months ago

84K runs

awerks / neon-tts

NeonAI Coqui AI TTS Plugin.

Updated 1 year, 11 months ago

136.7K runs

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Updated 2 years, 2 months ago

299.7K runs

afiaka87 / tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

Updated 2 years, 11 months ago

171.4K runs