Validator for NVIDIA GPU Operator

Validator for NVIDIA GPU Operator

Logo for Validator for NVIDIA GPU Operator
Features
Description
Validates NVIDIA GPU Operator components
Publisher
NVIDIA
Latest Tag
v24.3.0
Modified
May 17, 2024
Compressed Size
169.32 MB
Multinode Support
No
Multi-Arch Support
Yes
v24.3.0 (Latest) Security Scan Results

Linux / amd64

Sorry, your browser does not support inline SVG.

Linux / arm64

Sorry, your browser does not support inline SVG.

Validator for NVIDIA GPU Operator

NVIDIA GPU Operator manages NVIDIA GPU resources in a Kubernetes cluster and automates tasks related to bootstrapping GPU nodes. Since the GPU is a special resource in the cluster, it requires a few components to be installed before application workloads can be deployed onto the GPU. These components include the NVIDIA drivers (to enable CUDA), Kubernetes device plugin, container runtime and others such as automatic node labelling, monitoring and more.

The Validator for NVIDIA GPU Operator runs as a Daemonset and ensures that all components are working as expected on all GPU nodes. It runs through series of validations via InitContainers for each component and writes out status file as a result under /run/nvidia/validations. These status files allow each component to verify for their dependencies and start in correct order.

License Agreements

An End User License Agreement is included with this product. By pulling and using the containers from NGC, you accept the terms and conditions of this license.

  • The source code for the components in the container, including the Dockerfiles are licensed under Apache 2.0.

NVIDIA AI Enterprise Support

This product is supported when deployed by the NVIDIA GPU Operator.