nvidia

GitHub
https://github.com/nvidia

nvidia / canary-qwen-2.5b

🎤The best open-source speech-to-text model as of Jul 2025, transcribing audio with record 5.63% WER and enabling AI tasks like summarization directly from speech✨

4.5K runs
Public

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

728K runs
Public

nvidia / pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

460 runs
Public

nvidia / sana

A fast image model with wide artistic range and resolutions up to 4096x4096

181.4K runs
Public

nvidia / parakeet-rnnt-1.1b

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

16.1K runs
Public

nvidia / prismer

A Vision-Language Model with An Ensemble of Experts

1.7K runs
Public