NGC | Catalog
CatalogModels
Models
The NGC catalog offers 100s of pre-trained models for computer vision, speech, recommendation, and more. Bring AI faster to market by using these models as-is or quickly build proprietary models with a fraction of your custom data.
Sort: Most Popular
Displaying 0 models
NVIDIA AI Enterprise Support
82
Use Case
76
58
18
15
13
13
7
7
5
4
4
2
1
1
NVIDIA Platform
103
103
78
41
21
20
7
6
Framework
Industry
Solution
Publisher
Language
Other
Logo for Stable Diffusion XL
Stable Diffusion XL (SDXL) enables you to generate expressive images with shorter prompts and insert words inside images.
Logo for CLIP
The CLIP (Contrastive Language-Image Pretraining) model combines vision and language using contrastive learning. It understands images and text together, enabling tasks like image classification and object detection.
Logo for efficientnet-widese-b4 weights (PyTorch, AMP, ImageNet)
efficientnet-widese-b4 ImageNet pretrained weights
Logo for Riva TTS English US Auxiliary Files
Contains files used in rmir creation
Logo for Riva TTS English US Auxiliary Files
Contains files used in rmir creation
Logo for TrafficCamNet
4 class object detection network to detect cars in an image.
Logo for NeVA: NeMo Vision and Language Assistant
NeVA is a multi-modal vision-language model that understands text and images and generates informative responses.
Logo for BodyPoseNet
Detect body pose from an image.
Logo for TAO Pretrained Object Detection
Pretrained weights to facilitate transfer learning using TAO Toolkit.
Logo for PeopleSegNet
1 class instance segmentation network to detect and segment instances of people in an image.
Logo for PeopleNet
3 class object detection network to detect people in an image.
Logo for GPUNet-0 pretrained weights (PyTorch, AMP, ImageNet)
GPUNet-0 ImageNet pretrained weights
Logo for DashCamNet
4 class object detection network to detect cars in an image.
Logo for TAO Pretrained DetectNet V2
Pretrained weights to facilitate transfer learning using TAO Toolkit.
Logo for PeopleSemSegnet
Semantic segmentation of persons in an image.
Logo for TAO Pretrained Classification
Pretrained weights to facilitate transfer learning using TAO Toolkit.
Logo for VehicleTypeNet
Resnet18 model to classify a car crop into 1 out 6 car types.
Logo for StyleGAN3 pretrained models
StyleGAN3 pretrained models for FFHQ, AFHQv2 and MetFaces datasets.
Logo for VehicleMakeNet
Resnet18 model to classify a car crop into 1 out 20 car brands.
Logo for TTS En TalkNet
Speech Synthesis model trained on female English speech
Logo for FaceDetectIR
1 class object detection network to detect faces in an image.
Logo for Action Recognition Net
5 class action recognition network to recognize what people do in an image.
Logo for GPUNet-P0 pretrained weights (PyTorch, AMP, ImageNet)
GPUNet-P0 ImageNet pretrained weights
Logo for FaceDetect
Detect faces from an image.
Logo for License Plate Recognition
Model to recognize characters from the image crop of a License Plate.