NGC Catalog
Explore
Search
Support
API Catalog
Forum
Search
Containers
DeepSeek-R1
NVIDIA Developer Program
+1
Llama-3.1-Nemotron-70B-Instruct
NVIDIA Developer Program
+1
PyTorch
Collections
Omniverse Kit (FB)
NVIDIA AI Enterprise
+2
DeepStream SDK
Omniverse Kit App Streaming
NVIDIA AI Enterprise
+2
Models
StyleGAN3 pretrained models
PeopleNet
TrafficCamNet
Resources
Kit SDK - Windows (PB25h1)
NVIDIA AI Enterprise
+2
Kit SDK - Linux (PB25h1)
NVIDIA AI Enterprise
+2
Riva Skills Quick Start
Helm Charts
GPU Operator
NVIDIA NIM Operator
Welcome Guest
Setup
Terms of Use
Theme
Use System Settings
Light
Dark
Sign In / Sign Up
Search
Search thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
Search
Container (8)
Collection (6)
Model (10)
Resource (1)
Helm Chart (0)
NVIDIA Enterprise
(0)
NVIDIA Enterprise
NVIDIA Enterprise
NVIDIA AI Enterprise
9
NVIDIA Developer Program
6
NVIDIA AI Enterprise Supported
5
NVIDIA AI Enterprise IGX
1
NVIDIA NIM
(0)
NVIDIA NIM
NVIDIA NIM
Accelerate custom generative AI app deployment using pre-built containers with optimized AI models.
NVIDIA NIM
6
NIM Container GPUs
(0)
NIM Container GPUs
NIM Container GPUs
Use Case
(0)
Use Case
Use Case
Automatic Speech Recognition
6
Object Detection
6
Speech to Text
6
Video Analytics
3
Video enhancement
3
Image Segmentation
2
Action Recognition
1
Application Development
1
Graph Neural Networks
1
Image Synthesis
1
Natural Language Processing
1
Natural Language Understanding
1
Question Answering
1
Recommendation
1
Text to Speech
1
Translation
1
NVIDIA Platform
(1)
NVIDIA Platform
NVIDIA Platform
NeMo
148
Omniverse
115
Riva
84
Deep Learning Examples
78
PyTorch
59
DOCA
48
Maxine
47
DeepStream
43
Metropolis
41
Clara
39
Runs on RTX
35
TAO Toolkit
35
Triton Inference Server
34
Holoscan
26
Isaac
25
TensorRT
25
Aerial
24
TensorFlow
19
HPC
18
PhysicsNeMo
16
CUDA
15
Metropolis Microservices
11
Morpheus
11
RAPIDS
11
Network Operator
10
Merlin
8
GPU Operator
5
CUDA Toolkit
4
Clara AGX
3
Clara Parabricks
3
Container Toolkit
3
Deep Learning Institute
3
Monai
3
cuOpt
3
DCGM
2
HPC SDK
2
PyTorch Geometric
2
Deep Graph Library
1
GPU Driver
1
JAX
1
Industry
(0)
Industry
Industry
Robotics
5
Automotive / Transportation
2
Media & Entertainment
2
Academia / Higher Education
1
Architecture / Engineering / Construction
1
Gaming
1
Smart Cities / Spaces
1
Solution
(0)
Solution
Solution
AI
15
Computer Vision
10
Conversational AI
7
NVIDIA AI
5
DL
4
Vision AI
4
Inference
3
ML
2
Application Development
1
Recommender Systems
1
Publisher
(0)
Publisher
Publisher
Nvidia
19
Policy
(0)
Policy
Policy
Government ready
Labeled versions meet security requirements for FedRAMP High or equivalent use cases
2
Displaying 25 results
Sort: Most Popular
Sort: Most Popular
Sort: Relevance
Sort: Most Popular
Sort: Last Updated
Sort: Alphabetical (A-Z)
Sort: Alphabetical (Z-A)
Sort: Relevance
Sort: Most Popular
Sort: Last Updated
Sort: Alphabetical (A-Z)
Sort: Alphabetical (Z-A)
Search
TensorRT
NVIDIA Platform: TensorRT
Clear Filters
NVIDIA
TensorRT LLM Release
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
AI
Conversational AI
+3
DL
PyTorch
TensorRT
Container
4d
Updated
06/23/2026 UTC
NVIDIA
NVOF
NVOF is a deep learning based optical flow estimation and stereo matching solution.
Action Recognition
Automotive / Transportation
+5
Computer Vision
Gaming
Robotics
TensorRT
Video enhancement
Model
7mo
Updated
11/17/2025 UTC
NVIDIA
TensorRT LLM Develop
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.
AI
Application Development
+3
DL
PyTorch
TensorRT
Container
4d
Updated
06/23/2026 UTC
NVIDIA AI Enterprise
NVIDIA
TensorRT PB October 2025 (PB25h2)
TensorRT Production Branch October 2025 (PB 25h2) offers a 9-month lifecycle for API stability, with monthly patches for high and critical software vulnerabilities. This release includes Government Ready images for regulated environments.
AI
TensorRT
Container
1mo
Updated
05/26/2026 UTC
NVIDIA AI Enterprise
Nvidia
TensorRT May 2025 (PB25h1)
TensorRT Production Branch May 2025 (PB 25h1) offers a 9-month lifecycle for API stability, with monthly patches for high and critical software vulnerabilities. This release is a branch of TensorRT 25.03.
NVIDIA AI
TensorRT
Container
6mo
Updated
12/17/2025 UTC
NVIDIA Developer Program
+1
NVIDIA AI Enterprise
NVIDIA
relighting
AI4M Relighting is an AI-powered video relighting that dynamically re-illuminates a person with virtual studio lighting using HDR environment maps. Supports adjustable lighting direction, intensity, specular highlights, and background compositing.
Computer Vision
Media & Entertainment
+3
TensorRT
Triton Inference Server
Video enhancement
Container
2mo
Updated
04/17/2026 UTC
NVIDIA AI Enterprise IGX
NVIDIA
TensorRT LTSB2 IGX
NVIDIA TensorRT is a C++ library that facilitates high-performance inference on NVIDIA graphics processing units (GPUs). TensorRT takes a trained network and produces a highly optimized runtime engine that performs inference for that network.
AI
DL
+2
Inference
TensorRT
Container
2mo
Updated
04/02/2026 UTC
NVIDIA
Nvidia VSS CV Event Detector
Nvidia Sample CV Event Detector Microservice for detecting events for VSS Event Reviewer workflow.
AI
Computer Vision
+8
DeepStream
Metropolis
Metropolis Microservices
Object Detection
TensorRT
Triton Inference Server
Video Analytics
Vision AI
Container
7mo
Updated
11/06/2025 UTC
NVIDIA
RT-DETR 2D Warehouse
RT-DETR object detection model for 2D warehouse applications
Computer Vision
DeepStream
+8
Metropolis
Metropolis Microservices
Object Detection
Robotics
Smart Cities / Spaces
TensorRT
Video Analytics
Vision AI
Model
1mo
Updated
05/27/2026 UTC
NVIDIA
PyNvVideoCodec
PyNvVideoCodec is NVIDIA’s Python based video codec library for hardware accelerated video encode and decode on NVIDIA GPUs.
Academia / Higher Education
AI
+15
Application Development
Architecture / Engineering / Construction
Automotive / Transportation
Computer Vision
CUDA
DeepStream
Image Segmentation
Image Synthesis
Media & Entertainment
Object Detection
PyTorch
TensorRT
Video Analytics
Video enhancement
Vision AI
Resource
4mo
Updated
01/30/2026 UTC
NVIDIA AI Enterprise
NVIDIA
TensorRT Production Branch 6
TensorRT Production Branch 6 offers a 9-month lifecycle for API stability, with monthly patches for high and critical software vulnerabilities. This release includes Government Ready images for regulated environments.
NVIDIA AI
TensorRT
Container
1mo
Updated
05/28/2026 UTC
NVIDIA
SyntheticaDETR
SytheticaDETR is a real-time object detection model based on a transformer architecture trained entirely in simulation and works on real images zero-shot.
AI
Computer Vision
+8
CUDA
English
ML
Object Detection
PyTorch
Robotics
TAO Toolkit
TensorRT
Model
18mo
Updated
12/03/2024 UTC
NVIDIA
PeopleSemSeg AMR
People semantic segmentation network, finetuned on robotics AMR dataset, optimized for Issac Perceptor & Nvblox.
AI
Computer Vision
+7
Image Segmentation
NVIDIA AI
Robotics
TAO Toolkit
TensorRT
Triton Inference Server
Vision AI
Model
18mo
Updated
12/03/2024 UTC
NVIDIA
Phi-2 (TensorRT LLM)
Phi-2 is a 2.7 billion parameter language model developed by Microsoft Research. The phi-2 model is best suited for prompts using the Question-Answer (QA) format, the chat format, and the code format.
AI
TensorRT
Model
2y
Updated
06/14/2024 UTC
NVIDIA
PeopleNet AMR
People bounding box detection network, finetuned on robotics AMR dataset, optimized for multi-camera RealSense setup. Used in nvblox multi-camera optimization.
AI
Computer Vision
+8
Inference
ML
NVIDIA AI
Object Detection
Robotics
TAO Toolkit
TensorRT
Triton Inference Server
Model
18mo
Updated
12/03/2024 UTC
NVIDIA
Llama 2 7B Chat (TensorRT LLM)
Llama 2 is a large language AI model comprising a collection of models capable of generating text and code in response to prompts.
AI
TensorRT
Model
19mo
Updated
11/12/2024 UTC
NVIDIA
Mistral 7B Instruct (TensorRT LLM)
Mistral-7B-Instruct is a language model that can follow instructions, complete requests, and generate creative text formats.
AI
TensorRT
Model
2y
Updated
06/14/2024 UTC
NVIDIA
Gemma 2B Instruct (TensorRT LLM)
Gemma-2B is a 2.5B parameter model from Gemma family of models from Google. It has been instruction-tuned so it can respond to prompts in a conversational manner.
AI
TensorRT
Model
2y
Updated
06/14/2024 UTC
NVIDIA
NVSaliENC
NVSaliENC uses deep learning-based saliency maps to optimize perceptual video quality in real time, prioritizing visually important regions for efficient, bandwidth-saving compression with NVENC integration.
AI
Computer Vision
+2
CUDA
TensorRT
Model
5mo
Updated
01/07/2026 UTC
Collection
NVIDIA
Deep Learning Frameworks
This collection contains performance-optimized Deep Learning frameworks.
AI
Automatic Speech Recognition
+21
Computer Vision
Conversational AI
CUDA
CUDA Toolkit
DL
Graph Neural Networks
Inference
Natural Language Processing
Natural Language Understanding
NVIDIA AI
Object Detection
PyTorch
Question Answering
RAPIDS
Recommendation
Recommender Systems
Speech to Text
TensorFlow
TensorRT
Text to Speech
Translation
9
Container
3mo
Updated
03/06/2026 UTC
Collection
NVIDIA AI Enterprise
+1
NVIDIA Developer Program
NVIDIA
English Parakeet 0.6b-v2 TDT collection
A collection of easy to use, highly optimized Deep Learning Models for Speech Recognition. The Parakeet collection provides Data Scientists and Software Engineers with recipes to train, fine-tune, and deploy state-of-the-art ASR models.
Automatic Speech Recognition
Conversational AI
+4
Riva
Speech to Text
TensorRT
Triton Inference Server
1
Container
3
Model
8mo
Updated
10/16/2025 UTC
Collection
NVIDIA AI Enterprise
+1
NVIDIA Developer Program
NVIDIA
Mandarin-English Parakeet 0.6b CTC collection
A collection of easy to use, highly optimized Deep Learning Models for Speech Recognition. The Parakeet collection provides Data Scientists and Software Engineers with recipes to train, fine-tune, and deploy state-of-the-art ASR models.
Automatic Speech Recognition
Conversational AI
+4
Riva
Speech to Text
TensorRT
Triton Inference Server
1
Container
3
Model
8mo
Updated
10/16/2025 UTC
Collection
NVIDIA AI Enterprise
+1
NVIDIA Developer Program
NVIDIA
Spanish Parakeet 0.6b CTC collection
A collection of easy to use, highly optimized Deep Learning Models for Speech Recognition. The Parakeet collection provides Data Scientists and Software Engineers with recipes to train, fine-tune, and deploy state-of-the-art ASR models.
Automatic Speech Recognition
Conversational AI
+4
Riva
Speech to Text
TensorRT
Triton Inference Server
1
Container
3
Model
8mo
Updated
10/16/2025 UTC
Collection
NVIDIA AI Enterprise
+1
NVIDIA Developer Program
NVIDIA
Taiwanese Mandarin Parakeet 0.6b CTC collection
A collection of easy to use, highly optimized Deep Learning Models for Speech Recognition. The Parakeet collection provides Data Scientists and Software Engineers with recipes to train, fine-tune, and deploy state-of-the-art ASR models.
Automatic Speech Recognition
Conversational AI
+4
Riva
Speech to Text
TensorRT
Triton Inference Server
1
Container
3
Model
8mo
Updated
10/16/2025 UTC
24
Select item
24
48
96
192
24
48
96
192
1-24 of 25 items
1
1
2
2
π