Container
Evaluation benchmark for code generation models
Sign in to access this content
Big Code Benchmark container
Overview
This image is a container to run BigCode-based benchmarks. The NeMo Evaluator microservice uses this container to run BigCode benchmarks. As part of the evaluation, the container downloads a dataset, performs inference, and uploads the results and logs to the datastore.
To get started with NeMo Evaluator, refer to Evaluation Tutorials.
Note: Use, distribution or deployment of this microservice in production requires an NVIDIA AI Enterprise License.
Governing Terms
The software and materials are governed by the NVIDIA Software License Agreement and the Product-Specific Terms for NVIDIA AI Products.
Publisher
NVIDIA
Latest Tag0.12.21
UpdatedJune 11, 2025 UTC
Compressed Size3.19 GB
Multinode SupportNo
Multi-Arch SupportNo
System
Labels