Meta
Meta
Llama 4 Maverick 17B 128E Instruct
Container
Meta
Meta
Llama 4 Maverick 17B 128E Instruct

This container houses Llama 4 Maverick model which is a general purpose multimodal, multilingual 128 MoE model with 17B parameters.

Join or Subscribe to get accessSubscribe to the product below to access this premium content:
NVIDIA Developer Program
NVIDIA Developer ProgramJoin the Developer Program for access to free tools, support, and tech resources.
Get Access
NVIDIA AI Enterprise
NVIDIA AI EnterpriseAccelerate your AI agent development
Subscribe Now
Note: You can gain access to hundreds more GPU-optimized artifacts by creating a free NGC account.
Already Subscribed?Log in

Llama4 Maverick 17b 128e Container Overview

Description:

This container houses Llama 4 Maverick model which is a general purpose multimodal, multilingual 128 MoE model with 17B parameters.

The container components are ready for commercial/non-commercial use.

License/Terms of Use:

GOVERNING TERMS: The NIM container is governed by the NVIDIA Software License Agreement and the Product-Specific Terms for NVIDIA AI Products; except for the model which is governed by the NVIDIA Community Model License. ADDITIONAL INFORMATION: Llama 4 Community License Agreement. Built with Llama.

Deployment Geography:

Global, except EU

Release Date:

Build.Nvidia.com April 5, 2025 via https://build.nvidia.com/meta/llama-4-maverick-17b-128e-instruct
Huggingface April 5, 2025 via https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct

Llama 4 Maverick 17b 128e :

The Llama 4 Maverick Container includes the following model:

Model Name & LinkUse CaseHow to Pull the Model
https://build.nvidia.com/meta/llama-4-maverick-17b-128e-instructA general purpose multimodal, multilingual 128 MoE model with 17B parameters.Automatic

Deployment Details:

Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems. By leveraging NVIDIA’s hardware (e.g. GPU cores) and software frameworks (e.g., CUDA libraries), the model achieves faster training and inference times compared to CPU-only solutions.

For information on how to deploy this NIM, please visit - Get started

Enterprise Support

Get access to knowledge base articles and support cases or submit a ticket.

Container Version(s):

nvcr.io/nim/meta/llama-4-maverick-17b-128e-instruct:latest

Ethical Considerations:

NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal developer team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

Please report security vulnerabilities or NVIDIA AI Concerns here.

Publisher
Meta
Meta
Latest Tag1.4
UpdatedSeptember 8, 2025 UTC
Compressed Size13.45 GB
Multinode SupportNo
Multi-Arch SupportNo

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.