Linux / amd64
NeMo Deployment Management microservice enables deployment and management of NVIDIA NIM and other workloads in Kubernetes environments. The service provides APIs to create, update, delete, and monitor various deployment types including NIM services, training jobs, and validation workflows.
You can use this service to deploy NVIDIA NIMs for large language models, manage model deployment configurations, set up training workflows with data handlers, and orchestrate the validation of datasets and models within your Kubernetes cluster.
Note: Use, distribution or deployment of this microservice in production requires an NVIDIA AI Enterprise License.
The software and materials are governed by the NVIDIA Software License Agreement and the Product-Specific Terms for NVIDIA AI Products.