
nvidia / canary-qwen-2.5b
🎤The best open-source speech-to-text model as of Jul 2025, transcribing audio with record 5.63% WER and enabling AI tasks like summarization directly from speech✨
4.5K runs
Public

nvidia / sana-sprint-1.6b
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
728K runs
Public

nvidia / pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.
460 runs
Public

nvidia / sana
A fast image model with wide artistic range and resolutions up to 4096x4096
181.4K runs
Public

nvidia / parakeet-rnnt-1.1b
🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝
16.1K runs
Public

nvidia / prismer
A Vision-Language Model with An Ensemble of Experts
1.7K runs
Public