NGC Catalog
CLASSIC
Welcome Guest
Containers
NeMo Evaluator

NeMo Evaluator

For copy image paths and more information, please view on a desktop device.
Logo for NeMo Evaluator
Features
Description
Model evaluation service for NeMo microservices
Publisher
NVIDIA
Latest Tag
25.04
Modified
April 18, 2025
Compressed Size
236.56 MB
Multinode Support
No
Multi-Arch Support
No
25.04 (Latest) Security Scan Results

Linux / amd64

Sorry, your browser does not support inline SVG.

NeMo Evaluator Microservice Container

NeMo Evaluator microservice provides a comprehensive solution for evaluating large language models (LLMs) as part of the NeMo Microservices ecosystem. It enables systematic assessment of LLM capabilities through academic benchmarks, custom evaluations, and LLM-as-judge techniques.

You can use the Evaluator to test model performance across various dimensions, compare different models against consistent metrics, and conduct evaluations with your own custom datasets to ensure models meet your specific requirements before deployment.

Resources

Helm Chart | User Guide

Note: Use, distribution or deployment of this microservice in production requires an NVIDIA AI Enterprise License.

Governing Terms

The software and materials are governed by the NVIDIA Software License Agreement and the Product-Specific Terms for NVIDIA AI Products.