NGC Catalog
Explore
Search
Support
API Catalog
Forum
Search
Containers
DeepSeek-R1
NVIDIA Developer Program
+1
Llama-3.1-Nemotron-70B-Instruct
NVIDIA Developer Program
+1
PyTorch
Collections
Omniverse Kit (FB)
NVIDIA AI Enterprise
+2
DeepStream SDK
Omniverse Kit App Streaming
NVIDIA AI Enterprise
+2
Models
StyleGAN3 pretrained models
PeopleNet
TrafficCamNet
Resources
Kit SDK - Windows (PB25h1)
NVIDIA AI Enterprise
+2
Kit SDK - Linux (PB25h1)
NVIDIA AI Enterprise
+2
Riva Skills Quick Start
Helm Charts
GPU Operator
NVIDIA NIM Operator
Welcome Guest
Setup
Terms of Use
Theme
Use System Settings
Light
Dark
Sign In / Sign Up
Search
Search thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
Search
Container (3)
Collection (1)
Model (45)
Resource (1)
Helm Chart (2)
NVIDIA Enterprise
(0)
NVIDIA Enterprise
NVIDIA Enterprise
NVIDIA AI Enterprise Supported
4
NVIDIA AI Enterprise
3
NVIDIA Developer Program
2
NVIDIA NIM
(0)
NVIDIA NIM
NVIDIA NIM
Accelerate custom generative AI app deployment using pre-built containers with optimized AI models.
NVIDIA NIM
3
NIM Container GPUs
(0)
NIM Container GPUs
NIM Container GPUs
Use Case
(0)
Use Case
Use Case
Text to Speech
13
Automatic Speech Recognition
1
Natural Language Processing
1
Speech to Text
1
Translation
1
NVIDIA Platform
(0)
NVIDIA Platform
NVIDIA Platform
NeMo
20
Deep Learning Examples
3
Riva
2
Industry
(0)
Industry
Industry
Automotive / Transportation
2
Solution
(0)
Solution
Solution
Conversational AI
8
DL
7
AI
5
Publisher
(0)
Publisher
Publisher
Nvidia
46
Nvidia deep learning examples
5
Policy
(0)
Policy
Policy
Displaying 52 results
Sort: Most Popular
Sort: Most Popular
Sort: Relevance
Sort: Most Popular
Sort: Last Updated
Sort: Alphabetical (A-Z)
Sort: Alphabetical (Z-A)
Sort: Relevance
Sort: Most Popular
Sort: Last Updated
Sort: Alphabetical (A-Z)
Sort: Alphabetical (Z-A)
Search
tts
label: tts
Clear Filters
NVIDIA
Riva Speech Skills
Riva Speech Skills is a scalable Conversational AI service platform.
Automatic Speech Recognition
Conversational AI
+3
Speech to Text
Text to Speech
Translation
Container
1w
Updated
06/16/2026 UTC
NVIDIA AI Enterprise
NVIDIA
Riva TTS NIM
RIVA TTS NIM provide easy access to state-of-the-art text to speech models, capable of synthesizing English speech from text
Automotive / Transportation
Conversational AI
+2
Riva
Text to Speech
Container
7mo
Updated
11/06/2025 UTC
NVIDIA
Riva Speech Skills
Riva Speech Skills Helm Chart
Helm Chart
14mo
Updated
04/24/2025 UTC
NVIDIA Developer Program
+1
NVIDIA AI Enterprise
NVIDIA
TTS FastPitch HifiGAN Riva
RIVA TTS NIM provide easy access to state-of-the-art text to speech models, capable of synthesizing English speech from text
Automotive / Transportation
Conversational AI
+2
Riva
Text to Speech
Container
7mo
Updated
11/06/2025 UTC
NVIDIA
Riva TTS English US Auxiliary Files
Contains files used in rmir creation
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
Riva Skills Embedded Quick Start
Scripts and utilities for getting started with Riva Speech Skills on Embedded platforms
Resource
1w
Updated
06/16/2026 UTC
NVIDIA
TTS En TalkNet
Speech Synthesis model trained on female English speech
NeMo
Model
>3y
Updated
04/04/2023 UTC
NVIDIA Developer Program
+1
NVIDIA AI Enterprise
NVIDIA
Riva NIM
Riva NIM Helm Chart
Helm Chart
6mo
Updated
12/18/2025 UTC
NVIDIA
WaveGlow LJS 256 Channels
WaveGlow model weights pre-trained on the LJ Speech dataset to be used with https://github.com/NVIDIA/waveglow.
Conversational AI
DL
+1
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA Deep Learning Examples
HiFi-GAN PyT checkpoint (22kHz, AMP)
HiFi-GAN v1 PyTorch checkpoint trained on 8GPU with AMP on LJSpeech-1.1 (22kHz).
Deep Learning Examples
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
TTS Vocoder Hifigan
HiFiGAN Speech Synthesis model
NeMo
Model
18mo
Updated
11/27/2024 UTC
NVIDIA
TTS En FastPitch
FastPitch Speech Synthesis model trained on female English speech.
NeMo
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
WaveGlow
WaveGlow is a flow-based network capable of generating high quality speech from mel-spectrograms.
AI
Conversational AI
+2
DL
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA Deep Learning Examples
HiFi-GAN PyT checkpoint (FastPitch ftune, 22kHz, AMP)
HiFi-GAN v1 PyTorch checkpoint trained on 8GPU with AMP on LJSpeech-1.1 (22kHz), fine-tuned on FastPitch outputs.
Deep Learning Examples
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
Tacotron2 LJSpeech
Model checkpoints for the Tacotron 2 model trained with NeMo.
Conversational AI
NeMo
+1
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
Speech Synthesis English FastPitch
Mel-Spectrogram prediction conditioned on input text with LJSpeech voice.
Model
>2y
Updated
10/06/2023 UTC
NVIDIA
TTS En E2E FastPitch Hifigan
FastPitch+HiFiGAN End-to-End Speech Synthesis model trained on female English speech
NeMo
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
Speech Synthesis HiFi-GAN
GAN-based waveform generator from mel-spectrograms.
Model
>2y
Updated
10/06/2023 UTC
NVIDIA
WaveGlow LJSpeech
Model checkpoints for the WaveGlow model trained with NeMo.
Conversational AI
NeMo
+1
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
Riva TTS Spanish US FastPitch
Spanish US FastPitch model
Model
23mo
Updated
07/02/2024 UTC
NVIDIA
TTS En FastSpeech 2
FastSpeech 2 speech synthesis model trained on female English speech
NeMo
Model
>3y
Updated
04/04/2023 UTC
NVIDIA Deep Learning Examples
Tacotron2 PyTorch checkpoint (AMP)
Tacotron2 PyTorch checkpoint trained with AMP
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
TTS Vocoder Melgan
MelGAN Speech Synthesis model
NeMo
Model
>3y
Updated
04/04/2023 UTC
NVIDIA
Flowtron
Flowtron is an Autoregressive Flow-based Network for Text-to-Mel-spectrogram Synthesis.
AI
Conversational AI
+3
DL
Natural Language Processing
Text to Speech
Model
>3y
Updated
04/04/2023 UTC
24
Select item
24
48
96
192
24
48
96
192
1-24 of 52 items
1
1
2
2
3
3
π