NGC Catalog
Welcome Guest
All You Need to Build AI. All in One Place.
Welcome to the NGC Catalog - GPU Accelerated AI models and SDKs that help you infuse AI into your applications at speed of light
Explore Use Cases
NVIDIA NIM
View All
meta-llama-2-70b-chat
Container
NVIDIA NIM for GPU accelerated Llama 2 70B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
NVIDIA Retrieval QA Mistral 4B Reranking v3
Container
NVIDIA NIM for GPU accelerated NVIDIA Retrieval QA Mistral 4B Reranking v3 inference
nim-dev
+1
NVIDIA NIM
Llama-3.1-8b-base
Container
NVIDIA NIM for GPU accelerated Llama 3.1 8B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
Snowflake Arctic Embed Large Embedding
Container
NVIDIA NIM for GPU accelerated Snowflake Arctic Embed Large Embedding inference
nim-dev
+1
NVIDIA NIM
Getting Started
DeepStream - CV Deployment
Collection - Intelligent Video Analytics
DeepStream SDK delivers a complete streaming analytics toolkit for AI based video and image understanding and multi-sensor processing. The DeepStream SDK brings deep neural networks and other complex processing tasks into a stream processing pipeline.
Language Modelling
Collection - Natural Language Processing
A collection of easy to use, highly optimized Deep Learning Models for Language Modelling. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
NeMo - Automatic Speech Recognition
Collection - Automatic Speech Recognition
This collection contains NeMo models for Automatic Speech Recognition (ASR): Speech to Text, Speech Classification, Speaker Diarization, Speaker Verification, Speaker Recognition, Command Recognition, Voice Activity Detection
LLMs optimized for RTX PCs
Collection - Windows Rtx Accelerated Models
A collection of TensorRT-LLM accelerated Windows RTX PC LLM models.
Runs on RTX
Command Line Interface
Want to get more from NGC? Everything you see here can be used and managed via our powerful CLI tools.Â
Download Now
Documentation
We've got a whole host of documentation, covering the NGC UI and our powerful CLI. You can find out more here.Â
Go to Documentation
AI Enterprise Documentation
Learn how to virtualize any application with NVIDIA virtual GPU technology.Â
Go to Documentation
Enterprise Support
Get to access to knowledgebase articles and support cases.Â
File a Ticket
Licensing Portal
Access the software & licensing portal for your products.Â
Get Your Licenses
NGC Private Registry
Private Registries from NGC allow you to secure, manage, and deploy your own assets to accelerate your journey to AI.Â
Learn More
Getting Started with NVIDIA AI Enterprise
cuOpt
Collection - High Performance Computing
NVIDIA cuOpt is a world record GPU-accelerated optimization AI microservice that empowers instant dynamic decision-making to solve routing problems with the best-known accuracy at scale.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Meta/Llama3-8b-instruct
Container
NVIDIA NIM for GPU accelerated Llama 3 8B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
NVIDIA AI Enterprise Infra 5
Collection - Infrastructure
Access Infrastructure and workload management software, exclusively available with your NVIDIA AI Enterprise subscription.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Production Branch - May 2024 (PB 24h1)
Collection - Deep Learning
Access the production-ready branches of AI frameworks and SDKs. Supported for 9 months with monthly security patches.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Popular Collections
View All
HPC Collection
High Performance Computing
This collection provides access to the top HPC applications for Molecular Dynamics, Quantum Chemistry, and Scientific visualization.
Code Llama
Advanced
Code Llama is an LLM capable of generating code, and natural language about code, from both code and natural language prompts.
Llama 2
Advanced
Llama 2 is a large language AI model capable of generating text and code in response to prompts.
Build an AI Chatbot with RAG
Machine Learning
Use a reference application to build a fully functional retrieval-augmented generation (RAG)-based AI chatbot built with NVIDIA NIMTM microservices
Automatic Speech Recognition
Automatic Speech Recognition
A collection of easy to use, highly optimized Deep Learning Models for Recommender Systems. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
NVIDIA Holoscan
Healthcare
The AI sensor processing platform
Clara Discovery
Healthcare
Clara Discovery is a collection of frameworks, applications, and AI models enabling GPU-accelerated computational drug discovery
Clara NLP
Healthcare
Clara NLP is a collection of SOTA biomedical pre-trained language models as well as highly optimized pipelines for training NLP models on biomedical and clinical text
Popular Containers
View All
Python Basic for AI Workbench
Python Basic - AI Workbench Default Container (Beta)
python-cuda122
Python with CUDA 12.2 - AI Workbench Default Container (Beta)
DCGM
Manage and Monitor GPUs in Cluster Environments.
Validator for NVIDIA GPU Operator
Validates NVIDIA GPU Operator components
Popular Models
View All
Llama2-13b Chat Int4
LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.
monai_spleen_ct_segmentation
A pre-trained model for volumetric (3D) segmentation of the spleen from CT image.
Action Recognition Net
5 class action recognition network to recognize what people do in an image.
DashCamNet
4 class object detection network to detect cars in an image.
Popular Resources
View All
Endoscopy out of body Sample App Data
Holoscan Sample App data for Endoscopy out of body detection
Holoscan Cars Video
Video of cars for evaluating detection algorithms for Holoscan SDK.
Colonoscopy Sample App Data
Holoscan Sample App Data for AI Colonoscopy Segmentation of Polyps
Endoscopy Sample App Data
Holoscan Sample App Data for AI-based Endoscopy Tool Tracking
Popular Helm Charts
View All
RAG Application: Multimodal Chatbot
This example showcases multi modal usecase in a RAG pipeline. It can understand any kind of images in PDF or .pptx (like graphs and plots) alongside text and tables.
RAG Application: Multiturn Chatbot
This example showcases a RAG workflow with multi-turn conversation capabilities.
RAG Application: Langchain Text QA Chatbot
A helm chart demonstrating a basic RAG pipeline built using langchain leveraging Nvidia NIM LLM's and Retrievers deployed on-prem.
NVIDIA K8s Developer LLM Operator
The NVIDIA K8s Developer LLM Operator is an open source and easy to deploy Kubernetes Operator to self-host Generative AI workflows.