NVIDIA
NVIDIA
nvsm-multinode-arm
Container
NVIDIA
NVIDIA
nvsm-multinode-arm

NVIDIA System Management (NVSM) is a software framework for monitoring NVIDIA DGX nodes

NVSM Multinode Deployment Guide

This guide provides step-by-step instructions for deploying NVIDIA System Management (NVSM) Multinode using Docker containers.

Overview

NVSM Multinode is deployed using a single container image that supports multiple process types. The deployment consists of:

  • Aggregator (agg): Runs NVSM services via supervisord
  • Provision (pvsn): Runs Ansible provisioning
  • Prometheus (pmts): Runs Prometheus server for metrics collection
  • Grafana (gfna): Runs Grafana server for visualization

Use the -e process=<args> environment variable to deploy a specific container type (e.g., agg, pvsn, pmts, or gfna).

Additional Resources

License

License here.

Publisher
NVIDIA
NVIDIA
Latest Tag25.09.07
UpdatedApril 12, 2026 UTC
Compressed Size1013.93 MB
Multinode SupportYes
Multi-Arch SupportNo