NVIDIA
NVIDIA
NVIDIA MLPerf Inference
Container
NVIDIA
NVIDIA
NVIDIA MLPerf Inference

MLPerf Inference containers are base containers for people interested in NVIDIA's MLPerf Inference submission results

MLPerf Inference NVIDIA-Optimized Implementations

MLPerf Inference is a benchmark suite for measuring how fast systems can run models in a variety of deployment scenarios. MLPerf Inference provides the base containers to enable people interested in NVIDIA’s MLPerf Inference submission to reproduce NVIDIA’s leading results. Containers included are sorely for benchmarking purposes and should not be used in any production environment.

Getting Started

For details of how to reproduce NVIDIA's results, please visit ML Commons github page and check NVIDIA's submission repo's readme file.

EULA

The user license has been include under the container's root directory as /NVIDIA_MLPerf_Evaluation_License. By downloading this container, you agree to follow all the requirements stated in the EULA.

Misc

MLCommons offical webpage: https://mlcommons.org/en/

Publisher
NVIDIA
NVIDIA
Latest Tagtensorrt_llm_release-feat-1.2-mlpinf-b5ddff4_mlperf-main-f538816_jan28_aarch64
UpdatedFebruary 11, 2026 UTC
Compressed Size17.76 GB
Multinode SupportNo
Multi-Arch SupportNo

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.