SearchSearch thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
NVIDIA Enterprise
NVIDIA Enterprise
NVIDIA NIM
NVIDIA NIM
NIM Container GPUs
NIM Container GPUs
Use Case
Use Case
17
5
2
2
1
NVIDIA Platform
NVIDIA Platform
41
3
3
Industry
Industry
Solution
Solution
26
23
16
3
Publisher
Publisher
82
1
Policy
Policy
Displaying 85 results
Speech Synthesis model trained on female English speech
Model
Base English 3-gram LM
Model
English Conformer ASR model trained on ASR set 6.0
Model
Conformer-CTC-Large model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Model
FastPitch Speech Synthesis model trained on female English speech.
Model
Neural Machine Translation (NMT) model to translate from English to Spanish
Model
Citrinet 1024model trained on ASR Set dataset
Model
Tacotron2 Speech Synthesis model trained on female English speech
Model
FastPitch+HiFiGAN End-to-End Speech Synthesis model trained on female English speech
Model
Conformer-CTC-Medium model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Model
Conformer-Transducer-Large model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Model
Chunghwa Telecom Laboratories
SF Bilingual Speech in Chinese and English
A bilingual (Mandarin-English) Speech Dataset.
Resource
Conformer-CTC-Small model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Model
Neural Machine Translation (NMT) model to translate from English to Spanish
Model
QuartzNet is a Jasper-like network that uses separable convolutions and larger filter sizes. It has comparable accuracy to Jasper while having much fewer parameters. This particular model has 15 blocks each repeated 5 times.
Model
Conformer-CTC-XLarge model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Model
Conformer-CTC-Large model for English Automatic Speech Recognition, Trained with NeMo on LibriSpeech dataset
Model
Transformer-Large language model for English ASR, Trained on LibriSpeech text corpus with NeMo
Model
FastSpeech 2 speech synthesis model trained on female English speech
Model
Conformer-Transducer-Small model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Model
Megatron Multilingual Neural Machine Translation model to translate from English to Any* language Supported languages: cs, da, de, el, es, fi, fr, hu, it, lt, lv, nl, no, pl, pt, ro, ru, sk, sv, zh, ja, hi, ko, et, sl, bg, uk, hr, ar, vi, tr, id
Model
Punctuation and Capitalization model with BERT
Model
This collection contains the large version (114M) of the streaming speech recognition model trained on NeMo ASRSET for English with look-ahead of 80ms. All models are cache-aware hybrid FastConformer with both Transducer and CTC decoders.
Model
Named Entity Recognition model with BERT
Model

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.