NGC Catalog
Welcome Guest
All You Need to Build AI. All in One Place.
Welcome to the NGC Catalog - GPU Accelerated AI models and SDKs that help you infuse AI into your applications at speed of light
Explore Use Cases
NVIDIA NIM
View All
Llama-3.1-8b-instruct
Container
NVIDIA NIM for GPU accelerated Llama 3.1 8B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
Phi-3-Mini-4K-Instruct
Container
NVIDIA NIM for GPU supported Phi-3-Mini-4K-Instruct inference through OpenAI compatible APIs
nv-ai-enterprise
+1
NVIDIA NIM
+1
ASR Parakeet CTC Riva 1.1b
Container
RIVA ASR NIM delivers accurate English speech-to-text transcription and enables easy-to-use optimized ASR inference for large scale deployments.
nim-dev
+1
NVIDIA NIM
+1
Llama-3.1-405b-instruct
Container
NVIDIA NIM for GPU accelerated Llama 3.1 405B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
Getting Started
Language Modelling
Collection - Natural Language Processing
A collection of easy to use, highly optimized Deep Learning Models for Language Modelling. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
NeMo - Automatic Speech Recognition
Collection - Automatic Speech Recognition
This collection contains NeMo models for Automatic Speech Recognition (ASR): Speech to Text, Speech Classification, Speaker Diarization, Speaker Verification, Speaker Recognition, Command Recognition, Voice Activity Detection
DeepStream - CV Deployment
Collection - Intelligent Video Analytics
DeepStream SDK delivers a complete streaming analytics toolkit for AI based video and image understanding and multi-sensor processing. The DeepStream SDK brings deep neural networks and other complex processing tasks into a stream processing pipeline.
LLMs optimized for RTX PCs
Collection - Windows Rtx Accelerated Models
A collection of TensorRT-LLM accelerated Windows RTX PC LLM models.
Runs on RTX
Command Line Interface
Want to get more from NGC? Everything you see here can be used and managed via our powerful CLI tools.Â
Download Now
Documentation
We've got a whole host of documentation, covering the NGC UI and our powerful CLI. You can find out more here.Â
Go to Documentation
AI Enterprise Documentation
Learn how to virtualize any application with NVIDIA virtual GPU technology.Â
Go to Documentation
Enterprise Support
Get to access to knowledgebase articles and support cases.Â
File a Ticket
Licensing Portal
Access the software & licensing portal for your products.Â
Get Your Licenses
NGC Private Registry
Private Registries from NGC allow you to secure, manage, and deploy your own assets to accelerate your journey to AI.Â
Learn More
Getting Started with NVIDIA AI Enterprise
cuOpt
Collection - High Performance Computing
NVIDIA cuOpt is a world record GPU-accelerated optimization AI microservice that empowers instant dynamic decision-making to solve routing problems with the best-known accuracy at scale.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Meta/Llama3-8b-instruct
Container
NVIDIA NIM for GPU accelerated Llama 3 8B inference through OpenAI compatible APIs
nim-dev
+1
NVIDIA NIM
+1
Production Branch - May 2024 (PB 24h1)
Collection - Deep Learning
Access the production-ready branches of AI frameworks and SDKs. Supported for 9 months with monthly security patches.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
NVIDIA AI Enterprise Infra 5
Collection - Infrastructure
Access Infrastructure and workload management software, exclusively available with your NVIDIA AI Enterprise subscription.
nv-ai-enterprise
NVIDIA AI Enterprise Supported
Popular Collections
View All
Code Llama
Advanced
Code Llama is an LLM capable of generating code, and natural language about code, from both code and natural language prompts.
Llama 2
Advanced
Llama 2 is a large language AI model capable of generating text and code in response to prompts.
Build an AI Chatbot with RAG
Machine Learning
Use a reference application to build a fully functional retrieval-augmented generation (RAG)-based AI chatbot built with NVIDIA NIMTM microservices
Automatic Speech Recognition
Automatic Speech Recognition
A collection of easy to use, highly optimized Deep Learning Models for Recommender Systems. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
NVIDIA Holoscan
Healthcare
The AI sensor processing platform
Clara Discovery
Healthcare
Clara Discovery is a collection of frameworks, applications, and AI models enabling GPU-accelerated computational drug discovery
Clara NLP
Healthcare
Clara NLP is a collection of SOTA biomedical pre-trained language models as well as highly optimized pipelines for training NLP models on biomedical and clinical text
Clara Parabricks
Healthcare
Clara Parabricks is a collection of software tools and notebooks for next generation sequencing, including short- and long-read applications. These tools are designed to be scalable, generating highly accurate results in an accelerated compute environmen
Popular Containers
View All
Python Basic for AI Workbench
Python Basic - AI Workbench Default Container (Beta)
python-cuda120
Python with CUDA 12.0 - AI Workbench Default Container (Beta)
python-cuda122
Python with CUDA 12.2 - AI Workbench Default Container (Beta)
DCGM
Manage and Monitor GPUs in Cluster Environments.
Popular Models
View All
ChatGLM3-6B Chat Int4
ChatGLM3-6B is the latest open-source model in the ChatGLM series. ChatGLM3-6B introduces the following features (1) More Powerful Base Model (2) More Comprehensive Function Support (3) More Comprehensive Open-source Series.
GPUNet-0 pretrained weights (PyTorch, AMP, ImageNet)
GPUNet-0 ImageNet pretrained weights
Llama2-13b Chat Int4
LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.
Mistral-7B Chat Int4
The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.
Popular Resources
View All
Endoscopy out of body Sample App Data
Holoscan Sample App data for Endoscopy out of body detection
Holoscan Cars Video
Video of cars for evaluating detection algorithms for Holoscan SDK.
Colonoscopy Sample App Data
Holoscan Sample App Data for AI Colonoscopy Segmentation of Polyps
Endoscopy Sample App Data
Holoscan Sample App Data for AI-based Endoscopy Tool Tracking
Popular Helm Charts
View All
RAG Application: Multimodal Chatbot
This example showcases multi modal usecase in a RAG pipeline. It can understand any kind of images in PDF or .pptx (like graphs and plots) alongside text and tables.
RAG Application: Multiturn Chatbot
This example showcases a RAG workflow with multi-turn conversation capabilities.
RAG Application: Structured Data Chatbot
Sample RAG application which can handle question-answering from tabular data stored in CSV format.
RAG Application: Langchain Text QA Chatbot
A helm chart demonstrating a basic RAG pipeline built using langchain leveraging Nvidia NIM LLM's and Retrievers deployed on-prem.