SearchSearch thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
NVIDIA Enterprise
NVIDIA Enterprise
68
65
60
4
1
1
1
NVIDIA NIM
NVIDIA NIM
42
NIM Container GPUs
NIM Container GPUs
Use Case
Use Case
8
5
3
3
3
3
2
2
2
2
2
2
2
2
2
2
2
1
1
1
1
NVIDIA Platform
NVIDIA Platform
7
7
6
4
3
3
3
2
2
1
1
1
1
1
1
1
Industry
Industry
124
98
64
49
48
45
45
41
29
25
24
20
17
16
15
12
11
10
10
9
2
Solution
Solution
16
14
13
13
9
8
8
7
6
6
5
5
4
3
2
2
1
1
1
1
Publisher
Publisher
88
1
1
1
1
1
Policy
Policy
2
Displaying 98 results
Validates NVIDIA GPU Operator components
Container
Docker containers distributed as part of the TAO Toolkit package
Container
Provision NVIDIA GPU Driver as a Container
Container
Plugin for the Kubernetes Node Feature Discovery for adding GPU node labels.
Container
Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.
Container
NVIDIA
NVIDIA
DCGM
Manage and Monitor GPUs in Cluster Environments.
Container
NVIDIA
NVIDIA
PyTorch
PyTorch is a GPU accelerated tensor computational framework. Functionality can be extended with common Python libraries such as NumPy and SciPy. Automatic differentiation is done with a tape-based system at the functional and neural network layer levels.
Container
TensorFlow is an open source platform for machine learning. It provides comprehensive tools and libraries in a flexible architecture allowing easy deployment across a variety of platforms and devices.
Container
NVIDIA
NVIDIA
TensorRT
NVIDIA TensorRT is a C++ library that facilitates high-performance inference on NVIDIA graphics processing units (GPUs). TensorRT takes a trained network and produces a highly optimized runtime engine that performs inference for that network.
Container
NVIDIA AI Enterprise
Llama 3.1 70B-Instruct NIM Production Branch October 2024 (PB 24h2) offers a 9-month lifecycle for API stability, with monthly patches for high and critical software vulnerabilities.
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Snowflake Arctic Embed Large Embedding inference
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Gemma-2-2B-IT inference through OpenAI compatible APIs
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Llama 2 70B inference through OpenAI compatible APIs
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Llama-3-Swalow-70B-Instruct-v0.1 inference through OpenAI compatible APIs
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Llama 3.1 Swallow 8B inference through OpenAI compatible APIs
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated CodeLlama-70B inference through OpenAI compatible APIs
Container
NVIDIA Optimized Deep Learning Framework, powered by Apache MXNet is a deep learning framework that allows you to mix the flavors of symbolic programming and imperative programming to maximize efficiency and productivity.
Container
NVIDIA Developer Program
NVIDIA
NVIDIA
NV-CLIP
NV-CLIP NIM microservice for multimodal embeddings model for image and text
Container
NVIDIA Developer Program
ACE Agent Model Utils Container
Container
NVIDIA AI Enterprise
Riva NMT NIM provide easy access to state-of-the-art neural machine translation (NMT) models, capable of translating text from one language to another with exceptional accuracy.
Container
NVIDIA Developer Program
The Llama 3.2 Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Llama-3-Taiwan-70B-Instruct inference through OpenAI compatible APIs
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated Llama-3.1-Nemotron-70B-Instruct inference through OpenAI compatible APIs
Container
NVIDIA Developer Program
NVIDIA NIM for GPU accelerated NVIDIA Retrieval QA Mistral 4B Reranking v3 inference
Container