NeMo Evaluator | NVIDIA NGC

NVIDIA

NeMo Evaluator

Helm Chart

NVIDIA

NeMo Evaluator

Model evaluation service for NeMo microservices

NVIDIA AI Enterprise Supported

NeMo Evaluator Microservice Helm Chart

NeMo Evaluator microservice provides a comprehensive solution for evaluating large language models (LLMs) as part of the NeMo Microservices ecosystem. It enables systematic assessment of LLM capabilities through academic benchmarks, custom evaluations, and LLM-as-judge techniques.

You can use the Evaluator to test model performance across various dimensions, compare different models against consistent metrics, and conduct evaluations with your own custom datasets to ensure models meet your specific requirements before deployment.

Alternative: Platform Deployment

You can install NeMo Evaluator as part of the NeMo microservices platform by using the NeMo Microservices Helm Chart (chart | documentation).

Resources

Container | Helm Installation Guide | User Guide

Note: Use, distribution or deployment of this microservice in production requires an NVIDIA AI Enterprise License.

Governing Terms

The software and materials are governed by the NVIDIA Software License Agreement and the Product-Specific Terms for NVIDIA AI Products.

Publisher

NVIDIA

Latest Version25.6.0

UpdatedJune 11, 2025 UTC

Compressed Size2.11 MB

Labels

NeMo NSPECT-L3FU-DSNV