NGC | Catalog
Welcome Guest
CatalogModels
Many AI applications have common needs: classification, object detection, language translation, text-to-speech, recommender engines, sentiment analysis, and more. When developing applications with these capabilities, it is much faster to start with a model that is pre-trained and then tune it for a specific use case. The NGC catalog offers pre-trained models for a variety of common AI tasks that are optimized for NVIDIA Tensor Core GPUs, and can be easily re-trained by updating just a few layers, saving valuable time.
Sort: Last Modified
RMIR Conformer Hindi (hi-IN) Streaming Througput
Model
Conformer trained on Riva ASR-set 1.0
RMIR Conformer Hindi (hi-IN) Offline
Model
Conformer trained on Riva ASR-set 1.0
RMIR Conformer Hindi (hi-IN) Streaming
Model
Conformer trained on Riva ASR-set 1.0 with 3 gram LM
RMIR Conformer Hindi (hi-IN) Streaming
Model
Conformer trained on Riva ASR-set 1.0 with 3 gram LM
RMIR Conformer Hindi (hi-IN) Offline
Model
Conformer trained on Riva ASR-set 1.0 with 3 gram LM
RMIR Conformer Hindi (hi-IN) Streaming Throughput
Model
Conformer trained on Riva ASR-set 1.0 with 3 gram LM
Logo for SSL En  Conformer Large
SSL En Conformer Large
Model
Self-Supervised Learning (SSL) checkpoints for Conformer Large model. These are similar to w2v-Conformer model and can be fine-tuned for Automatic Speech Recognition (ASR).
Logo for STT En Conformer-CTC XLarge
STT En Conformer-CTC XLarge
Model
Conformer-CTC-XLarge model for English Automatic Speech Recognition, Trained on NeMo ASRSET
Logo for STT En Conformer-Transducer XLarge
STT En Conformer-Transducer XLarge
Model
Conformer-Transducer-XLarge model for English Automatic Speech Recognition, trained on NeMo ASRSET
Logo for SSL En  Conformer XLarge
SSL En Conformer XLarge
Model
Self-Supervised Learning (SSL) checkpoints for Conformer XLarge model. These are similar to w2v-Conformer model and can be fine-tuned for Automatic Speech Recognition (ASR).
Logo for PeopleNet
PeopleNet
Model
3 class object detection network to detect people in an image.
Logo for PeopleSemSegnet
PeopleSemSegnet
Model
Semantic segmentation of persons in an image.
Logo for Riva ASR Mandarin LM
Riva ASR Mandarin LM
Model
Base Mandarin 4-gram LM
Logo for DashCamNet
DashCamNet
Model
4 class object detection network to detect cars in an image.
Logo for TrafficCamNet
TrafficCamNet
Model
4 class object detection network to detect cars in an image.
Logo for PeopleSegNet
PeopleSegNet
Model
1 class instance segmentation network to detect and segment instances of people in an image.
Logo for LPDNet
LPDNet
Model
Object Detection network to detect license plates in an image of a car.
Logo for Riva TTS English US Auxiliary Files
Riva TTS English US Auxiliary Files
Model
Contains files used in rmir creation
Logo for Riva TTS English Normalization Grammar
Riva TTS English Normalization Grammar
Model
Base English grammar
Logo for RIVA Punctuation and Capitalization for Mandarin
RIVA Punctuation and Capitalization for Mandarin
Model
For each word in the input text, the model: 1) predicts a punctuation mark that should follow the word (if any), the model supports commas, periods and question marks) and 2) predicts if the word should be capitalized or not.
Logo for RIVA Conformer ASR Hindi
RIVA Conformer ASR Hindi
Model
Hindi Conformer ASR model trained on ASR set 1.0
Logo for FinMegatron345m-gpt2-bpe
FinMegatron345m-gpt2-bpe
Model
FSI : Financial Megatron GPT2 345m parameters model with BPE tokenizer, gpt vocabulary and merge file, pre-trained on subsets of CC-100 text corpus.
Logo for FinMegatron345m-uncased
FinMegatron345m-uncased
Model
FSI : Financial Megatron 345m parameters model with bert vocabulary (28k size) uncased, pre-trained on subsets of CC-100 text corpus.
Logo for STT En ContextNet 1024
STT En ContextNet 1024
Model
ContextNet-1024 model for English Automatic Speech Recognition, trained on NeMo ASRSET
Logo for Megatron GPT2 345M
Megatron GPT2 345M
Model
345M parameter GPT generative Megatron model