Spanish and Italian model: 3b-es_it-ft-research_release
https://huggingface.co/canopylabs/3b-es_it-ft-research_release
Orpheus 3B 0.1 Finetuned
Note on emotional tags:
- Italian supports sigh, laugh, cough, sniffle, groan, yawn, gemito, gasp
- Spanish supports groan, chuckle, gasp, resoplido, laugh, yawn, cough
More info: https://canopylabs.ai/releases/orpheus_can_speak_any_language
Orpheus TTS is a state-of-the-art, Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been finetuned to deliver human-level speech synthesis, achieving exceptional clarity, expressiveness, and real-time streaming performances.
Model Details
Model Capabilities
- Human-Like Speech: Natural intonation, emotion, and rhythm that is superior to SOTA closed source models
- Zero-Shot Voice Cloning: Clone voices without prior fine-tuning
- Guided Emotion and Intonation: Control speech and emotion characteristics with simple tags
- Low Latency: ~200ms streaming latency for realtime applications, reducible to ~100ms with input streaming
Model Sources
- GitHub Repo: https://github.com/canopyai/Orpheus-TTS
- Blog Posts: https://canopylabs.ai/releases
Model Misuse
Do not use our models for impersonation without consent, misinformation or deception (including fake news or fraudulent calls), or any illegal or harmful activity. By using this model, you agree to follow all applicable laws and ethical guidelines. We disclaim responsibility for any use.