GPU-optimized AI, Machine Learning, & HPC Software

NVIDIA

Validator for NVIDIA GPU Operator

Validates NVIDIA GPU Operator components

Container

NVIDIA

TAO Toolkit

Docker containers distributed as part of the TAO Toolkit package

Container

NVIDIA

NVIDIA GPU Driver

Provision NVIDIA GPU Driver as a Container

Container

NVIDIA

NVIDIA GPU Feature Discovery for Kubernetes

Plugin for the Kubernetes Node Feature Discovery for adding GPU node labels.

Container

NVIDIA

Triton Inference Server

Triton Inference Server is an open source software that lets teams deploy trained AI models from any framework, from local or cloud storage and on any GPU- or CPU-based infrastructure in the cloud, data center, or embedded devices.

Container

NVIDIA

DCGM

Manage and Monitor GPUs in Cluster Environments.

Container

NVIDIA

PyTorch

PyTorch is a GPU accelerated tensor computational framework. Functionality can be extended with common Python libraries such as NumPy and SciPy. Automatic differentiation is done with a tape-based system at the functional and neural network layer levels.

Container

Google

TensorFlow

TensorFlow is an open source platform for machine learning. It provides comprehensive tools and libraries in a flexible architecture allowing easy deployment across a variety of platforms and devices.

Container

NVIDIA

TensorRT

NVIDIA TensorRT is a C++ library that facilitates high-performance inference on NVIDIA graphics processing units (GPUs). TensorRT takes a trained network and produces a highly optimized runtime engine that performs inference for that network.

Container

NVIDIA AI Enterprise

NVIDIA

Llama-3.1-70b-instruct PB October 2024 (PB 24h2)

Llama 3.1 70B-Instruct NIM Production Branch October 2024 (PB 24h2) offers a 9-month lifecycle for API stability, with monthly patches for high and critical software vulnerabilities.

Container

NVIDIA Developer Program

—

Snowflake Arctic Embed Large Embedding

NVIDIA NIM for GPU accelerated Snowflake Arctic Embed Large Embedding inference

Container

NVIDIA Developer Program

NVIDIA

Gemma-2-2B-IT

NVIDIA NIM for GPU accelerated Gemma-2-2B-IT inference through OpenAI compatible APIs

Container

NVIDIA Developer Program

NVIDIA

meta-llama-2-70b-chat

NVIDIA NIM for GPU accelerated Llama 2 70B inference through OpenAI compatible APIs

Container

NVIDIA Developer Program

NVIDIA

Llama-3-Swallow-70B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Llama-3-Swalow-70B-Instruct-v0.1 inference through OpenAI compatible APIs

Container

NVIDIA Developer Program

NVIDIA

Llama-3.1-Swallow-8B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Llama 3.1 Swallow 8B inference through OpenAI compatible APIs

Container

NVIDIA Developer Program

NVIDIA

CodeLlama-70B-Instruct

NVIDIA NIM for GPU accelerated CodeLlama-70B inference through OpenAI compatible APIs

Container

Apache Software Foundation

NVIDIA Optimized Deep Learning Framework powered by Apache MXNet

NVIDIA Optimized Deep Learning Framework, powered by Apache MXNet is a deep learning framework that allows you to mix the flavors of symbolic programming and imperative programming to maximize efficiency and productivity.

Container

NVIDIA Developer Program

NVIDIA

NV-CLIP

NV-CLIP NIM microservice for multimodal embeddings model for image and text

Container

NVIDIA Developer Program

NVIDIA

ACE Agent Model Utils

ACE Agent Model Utils Container

Container

NVIDIA AI Enterprise

NVIDIA

Riva NMT NIM

Riva NMT NIM provide easy access to state-of-the-art neural machine translation (NMT) models, capable of translating text from one language to another with exceptional accuracy.

Container

NVIDIA Developer Program

NVIDIA

Llama-3.2-11B-Vision-Instruct

The Llama 3.2 Vision instruction-tuned models are optimized for visual recognition, image reasoning, captioning, and answering general questions about an image.

Container

NVIDIA Developer Program

NVIDIA

Llama-3-Taiwan-70B-Instruct

NVIDIA NIM for GPU accelerated Llama-3-Taiwan-70B-Instruct inference through OpenAI compatible APIs

Container

NVIDIA Developer Program

NVIDIA

Llama-3.1-Nemotron-70B-Instruct

NVIDIA NIM for GPU accelerated Llama-3.1-Nemotron-70B-Instruct inference through OpenAI compatible APIs

Container

NVIDIA Developer Program

NVIDIA

NVIDIA Retrieval QA Mistral 4B Reranking v3

NVIDIA NIM for GPU accelerated NVIDIA Retrieval QA Mistral 4B Reranking v3 inference

Container