SearchSearch thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
NVIDIA Enterprise
NVIDIA Enterprise
2
NVIDIA NIM
NVIDIA NIM
26
NIM Container GPUs
NIM Container GPUs
3
3
2
1
1
1
Use Case
Use Case
3
3
NVIDIA Platform
NVIDIA Platform
1
Industry
Industry
Solution
Solution
3
2
Publisher
Publisher
12
4
3
2
1
1
1
1
1
Policy
Policy
Displaying 26 results
The MiniMax-M2.5 NIM Container is a deployable inference container for serving MiniMax-M2.5, a third-party text generation model optimized for complex agentic tasks including software engineering, tool use, search.
Container
Next-gen Qwen 3.5 VLM (400B MoE) brings advanced vision, chat, RAG, and agentic capabilities.
Container
This container houses GLiNER PII, which detects and classifies a broad range of Personally Identifiable Information (PII) and Protected Health Information (PHI) in structured and unstructured text.
Container
Nemotron-3-Super-120B-A12B is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks.
Container
Qwen3.5-122B-A10B is a multimodal vision-language Mixture-of-Experts model designed for native multimodal agent applications, supporting text, image, and video inputs.
Container
Qwen3.5-35B-A3B is a multimodal vision-language Mixture-of-Experts model designed for native multimodal agent applications, supporting text, image, and video inputs.
Container
Step 3.5 Flash is a sparse Mixture-of-Experts (MoE) large language model developed by StepFun, engineered to deliver frontier reasoning and agentic capabilities with exceptional efficiency
Container
Moonshot AI
kimi-k2.5-Turbo
This turbo container houses the Kimi K2.5 model which is an open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed visual and text tokens atop Kimi-K2-Base.
Container
Mistral Small 4 model is a powerful hybrid model with the capability of acting as both a general instruction model and a reasoning model.
Container
The NVIDIA Ising Calibration 1 NIM houses the NVIDIA-Ising-Calibration-1-35B-A3B-BF16 model, which is a purpose-built Mixture-of-Experts vision-language model (MoE VLM) built on Qwen3.5-35B-A3B,
Container
Z.Ai
GLM-5
GLM-5 is a next-generation large language model targeting complex systems engineering and long-horizon agentic tasks.
Container
Gemma 4 31B IT model which, is an open multimodal model built by Google DeepMind that handles text and image inputs, can process video as sequences of frames, and generates text output.
Container
Nemotron Content Safety Reasoning 4B is a Large Language Model (LLM) classifier designed to function as a dynamic and adaptable guardrail for content safety and dialogue moderation (topic-following).
Container
Nemotron Nano V3 Omni is a multi-modal large language model that unifies video, audio, image, and text understanding to support enterprise-grade Q&A, summarization, transcription, and document intelligence workflows.
Container
NVIDIA
NVIDIA
GLM-5.1
This container houses GLM-5.1, which is a next-generation flagship model for agentic engineering with significantly stronger coding capabilities than its predecessor GLM-5. The model achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5
Container
This container houses the model MiMo-V2-Flash.
Container
The Qwen3.6-27B NIM Container is a deployable inference container for serving Qwen3.6-27B, a third-party multimodal dense model capable of processing text, image, and video inputs for text generation.
Container
The Nemotron 3.5 Content Safety NIM container packages NVIDIA's small language model (SLM) that uses Google's Gemma-3-4B-it as the base and is fine-tuned by NVIDIA on multimodal, multilingual, and reasoning-oriented content-safety datasets.
Container
Deepseek AI
DeepSeek-V4-Pro
The DeepSeek-V4-Pro Container is a deployable inference container for serving DeepSeek-V4-Pro, a third-party sparse Mixture-of-Experts language model for reasoning, coding, and agentic tasks.
Container
This NIM container houses the Nemotron 3 Content Safety model which, is a small language model (SLM) that uses Google's Gemma-3-4B-it as the base and is fine-tuned by NVIDIA on multimodal and multilingual content-safety related datasets.
Container
Mistral-Medium-3.5-128B VLM model is a dense 128B model with a 256k context window, handling instruction-following, reasoning, and coding in a single set of weights.
Container
Nemotron-3-Ultra-550B-A55B NIM container packages NVIDIA's large language model featuring a hybrid Latent Mixture-of-Experts (LatentMoE) architecture with Multi-Token Prediction (MTP) layers.
Container
Gemma 4 26B A4B IT is a Google multimodal instruction-tuned model packaged as an NVIDIA NIM container for deployment through NVIDIA NGC as a Downloadable NIM.
Container
The DiffusionGemma-4-26B-A4B-IT model is an open-weights multimodal generative model developed by Google DeepMind that processes text, image, and video inputs to produce text output via discrete diffusion.
Container

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.