NGC Catalog
Explore
Search
Support
API Catalog
Forum
Search
Containers
DeepSeek-R1
NVIDIA Developer Program
+1
Llama-3.1-Nemotron-70B-Instruct
NVIDIA Developer Program
+1
PyTorch
Collections
Omniverse Kit (FB)
NVIDIA AI Enterprise
+2
DeepStream SDK
Omniverse Kit App Streaming
NVIDIA AI Enterprise
+2
Models
StyleGAN3 pretrained models
PeopleNet
TrafficCamNet
Resources
Kit SDK - Windows (PB25h1)
NVIDIA AI Enterprise
+2
Kit SDK - Linux (PB25h1)
NVIDIA AI Enterprise
+2
Riva Skills Quick Start
Helm Charts
GPU Operator
NVIDIA NIM Operator
Welcome Guest
Setup
Terms of Use
Theme
Use System Settings
Light
Dark
Sign In / Sign Up
Search
Search thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
Search
Container (2)
Collection (0)
Model (16)
Resource (0)
Helm Chart (0)
NVIDIA Enterprise
(0)
NVIDIA Enterprise
NVIDIA Enterprise
NVIDIA NIM
(0)
NVIDIA NIM
NVIDIA NIM
Accelerate custom generative AI app deployment using pre-built containers with optimized AI models.
NIM Container GPUs
(0)
NIM Container GPUs
NIM Container GPUs
Use Case
(0)
Use Case
Use Case
Automatic Speech Recognition
15
Speech to Text
2
Speech enhancement
1
Synthetic Data Generation
1
Text to Speech
1
NVIDIA Platform
(0)
NVIDIA Platform
NVIDIA Platform
NeMo
11
PyTorch
3
Clara
1
Industry
(0)
Industry
Industry
Cloud Services
1
Healthcare
1
Solution
(0)
Solution
Solution
AI
11
Conversational AI
5
DL
5
NVIDIA AI
1
Publisher
(0)
Publisher
Publisher
Nvidia
16
Meetkai inc.
1
Policy
(0)
Policy
Policy
Displaying 18 results
Sort: Most Popular
Sort: Most Popular
Sort: Relevance
Sort: Most Popular
Sort: Last Updated
Sort: Alphabetical (A-Z)
Sort: Alphabetical (Z-A)
Sort: Relevance
Sort: Most Popular
Sort: Last Updated
Sort: Alphabetical (A-Z)
Sort: Alphabetical (Z-A)
Search
PyTorch with NeMo
label: PyTorch with NeMo
Clear Filters
NVIDIA
Domain Specific NeMo ASR Application
The Domain Specific - NeMo Automatic Speech Recognition (ASR) Application facilitates training, evaluation and performance comparison of ASR models. This NeMo application enables you to train or fine-tune pre-trained ASR models with your own data.
Automatic Speech Recognition
NeMo
+1
PyTorch with NeMo
Container
7mo
Updated
11/06/2025 UTC
MeetKai Inc.
MK-SQuIT
SQuIT (Synthesizing Questions using Iterative Template-Filling) is a generated dataset produced with little human intervention. This container provides several tutorial applications - an interactive dataset explorer, a walkthrough of the generation pipeline, and a demonstration using NeMo to fine tune and evaluate a model on the dataset.
DL
Container
7mo
Updated
11/06/2025 UTC
NVIDIA
Audio Codec 16kHz Small
This model card contains a Small Audio Codec model trained on the Libri-Light audiobook recordings dataset, comprising approximately 60,000 hours of English language speech with a 16kHz sampling rate.
AI
Automatic Speech Recognition
+8
Conversational AI
NeMo
NVIDIA AI
PyTorch
PyTorch with NeMo
Speech enhancement
Speech to Text
Text to Speech
Model
>2y
Updated
02/27/2024 UTC
NVIDIA
TitaNet-S
TitaNet Small model for Speaker Verification and Diarization tasks
AI
English
+2
NeMo
PyTorch with NeMo
Model
3y
Updated
06/07/2023 UTC
NVIDIA
LangID PearlNet
PearlNet Lang ID model for Spoken Language Identification
AI
Automatic Speech Recognition
+4
Conversational AI
DL
NeMo
PyTorch with NeMo
Model
3y
Updated
05/31/2023 UTC
NVIDIA
STT En Fast Conformer-CTC Large
Fast Conformer-CTC-Large model for English Automatic Speech Recognition, Trained on NeMo ASRSET
AI
Automatic Speech Recognition
+3
English
NeMo
PyTorch with NeMo
Model
>3y
Updated
04/25/2023 UTC
NVIDIA
Parakeet-TDT_CTC-110M
Large size version of hybrid Fast Conformer TDT-CTC 114M parameter model trained on larger dataset of 36000 hrs with Punctuation and Capitalization. This model is jointly developed by NVIDIA NeMo and Suno.ai teams.
Automatic Speech Recognition
Cloud Services
+6
Conversational AI
English
NeMo
PyTorch
PyTorch with NeMo
Speech to Text
Model
18mo
Updated
11/27/2024 UTC
NVIDIA
STT En FastConformer Hybrid Transducer-CTC Large P&C
This collection contains the large version (114M) of the English speech recognition model with a FastConformer encoder and a Hybrid decoder (joint RNNT-CTC loss). The model has a vocab size of 1024 and emits text with punctuation and capitalization.
AI
Automatic Speech Recognition
+5
DL
English
NeMo
PyTorch
PyTorch with NeMo
Model
>2y
Updated
07/20/2023 UTC
—
STT En Zh Multilingual Code-Switched FastConformer Transducer L
English + Mandarin Multilingual and Code-Switched Speech Recognition FastConformer Transducer Large Model
Automatic Speech Recognition
Conversational AI
+4
English
Mandarin
NeMo
PyTorch with NeMo
Model
>2y
Updated
09/26/2023 UTC
NVIDIA
STT En Fast Conformer-Transducer Large
Fast Conformer-Transducer-Large model for English Automatic Speech Recognition, Trained on NeMo ASRSET
AI
Automatic Speech Recognition
+3
English
NeMo
PyTorch with NeMo
Model
>3y
Updated
04/25/2023 UTC
NVIDIA
STT Fa FastConformer Hybrid Transducer-CTC Large
This collection contains the large version (114M) of the Persian speech recognition model with a FastConformer encoder and a Hybrid decoder (joint RNNT-CTC loss). The model has a vocab size of 1024.
AI
Automatic Speech Recognition
+4
DL
NeMo
PyTorch
PyTorch with NeMo
Model
>2y
Updated
11/07/2023 UTC
NVIDIA
TTS En FastPitch SpectrogramEnhancer For-ASR-Finetuning
This collection contains FastPitch and Spectrogram Enhancer models. Main use case is English ASR domain fine-tuning. Direct TTS use is not advised.
Automatic Speech Recognition
English
+3
NeMo
PyTorch with NeMo
Synthetic Data Generation
Model
>2y
Updated
07/10/2023 UTC
NVIDIA
STT En Fast Conformer-Transducer XXLarge
Fast Conformer-Transducer-XXLarge model for English Automatic Speech Recognition, trained on NeMo ASRSET 3.0
AI
Automatic Speech Recognition
+3
English
NeMo
PyTorch with NeMo
Model
>2y
Updated
07/28/2023 UTC
NVIDIA
STT En Fast Conformer-CTC XLarge
Fast Conformer-CTC-XLarge model for English Automatic Speech Recognition, Trained on NeMo ASRSET
AI
Automatic Speech Recognition
+3
English
NeMo
PyTorch with NeMo
Model
3y
Updated
06/07/2023 UTC
NVIDIA
STT En Fast Conformer-Transducer XLarge
Fast Conformer-Transducer-Large model for English Automatic Speech Recognition, Trained on NeMo ASRSET 3.0
AI
Automatic Speech Recognition
+3
English
NeMo
PyTorch with NeMo
Model
3y
Updated
06/07/2023 UTC
NVIDIA
STT En Fast Conformer-Transducer Large LibriSpeech
Fast Conformer-Transducer-Large model for English Automatic Speech Recognition, Trained with NeMo on LibriSpeech dataset
AI
Automatic Speech Recognition
+3
English
NeMo
PyTorch with NeMo
Model
>3y
Updated
04/25/2023 UTC
NVIDIA
STT En Fast Conformer-CTC XXLarge
Fast Conformer-CTC-XXLarge model for English Automatic Speech Recognition, Pre-trained on LibriLight and fine-tuned on NeMo ASRSET 3.0
Automatic Speech Recognition
Conversational AI
+3
English
NeMo
PyTorch with NeMo
Model
>2y
Updated
07/28/2023 UTC
NVIDIA
ESM-2nv 8M
An 8 million parameter BERT model fully pre-trained with BioNeMo
Clara
DL
+5
Healthcare
Megatron-LM
NeMo
PyTorch
PyTorch with NeMo
Model
11mo
Updated
07/17/2025 UTC
24
Select item
24
48
96
192
24
48
96
192
1-18 of 18 items
1
1
π