NeMo NIM Proxy | NVIDIA NGC

NVIDIA

NeMo NIM Proxy

Container

NVIDIA

NeMo NIM Proxy

Proxy service for NIM microservices

NVIDIA AI Enterprise Supported

NeMo NIM Proxy Microservice Container

NeMo NIM Proxy microservice provides a unified access point for all NVIDIA NIM (NVIDIA Inference Microservice) deployments within your Kubernetes cluster through a single OpenAI-compatible API.

You can use the NIM Proxy to interact with multiple deployed models through standardized endpoints for retrieving model lists and making inference requests to chat completions and completions APIs.

Resources

Helm Chart | User Guide

Note: Use, distribution or deployment of this microservice in production requires an NVIDIA AI Enterprise License.

Governing Terms

The software and materials are governed by the NVIDIA Software License Agreement and the Product-Specific Terms for NVIDIA AI Products.

Publisher

NVIDIA

Latest Tag25.12

UpdatedDecember 16, 2025 UTC

Compressed Size37.5 MB

Multinode SupportNo

Multi-Arch SupportYes

System

signed images

Labels

NeMo NIM NSPECT-L3FU-DSNV