NVIDIA Retrieval QA Llama 3.2 1B Embedding v2 PB6

NVIDIA

Container

NVIDIA

NVIDIA Retrieval QA Llama 3.2 1B Embedding v2 PB6

The NVIDIA Retrieval QA Llama3.2 1b Embedding NIM is an embedding NIM optimized for multilingual and crosslingual text question-answering retrieval.Please add description

NVIDIA AI Enterprise

NVIDIA AI Enterprise Supported NVIDIA NIM

Subscribe to get accessSubscribe to the product below to access this premium content:

NVIDIA AI EnterpriseAccelerate your AI agent development

Subscribe Now

Note: You can gain access to hundreds more GPU-optimized artifacts by creating a free NGC account.

Already Subscribed?Log in

Text Embedding NIM

NeMo Retriever Text Embedding NIM (Text Embedding NIM) brings the power of state-of-the-art text embedding models to your applications, offering unparalleled natural language processing and understanding capabilities. You can use Text Retriever NIM for semantic search, Retrieval Augmented Generation (RAG) pipelines—or any application that uses text embeddings. Text Embedding NIM is built on the NVIDIA software platform, incorporating CUDA, TensorRT, and Triton to offer out-of-the-box GPU acceleration.

What Is Llama 3.2 NeMo Retriever Embedding 1B NIM Production Branch 6?

The Llama 3.2 NeMo Retriever Embedding 1B NIM Production Branch, exclusively available with NVIDIA AI Enterprise, is a 9-month supported, API-stable branch that includes monthly fixes for high and critical software vulnerabilities. This branch provides a stable and secure environment for building your mission-critical AI applications. The Llama 3.2 NeMo Retriever Embedding 1B NIM production branch releases every six months with a three-month overlap in between two releases.

Getting started with the NIM

Before you start, ensure that your environment is set up by following one of the deployment guides available in the NVIDIA AI Enterprise Documentation.

Deploying and integrating the NIM is straightforward thanks to our industry standard APIs. Visit the NIM Container page for release documentation, deployment guides and more.

Government Ready: STIG/FIPS Hardening

This ensures the highest level of security and compliance for regulated environments, the x86 container image for this branch is:

STIG Ubuntu 24.04 hardened
Supports FIPS 140-2 / 3 validated crypto / uses libraries that support FIPS crypto

Learn more about NVIDIA's hardened image in the AI Software for Regulated Environments White Paper.

Compatible Infrastructure Software Versions

For the optimized performance, it is highly recommended to deploy the supported NVIDIA AI Enterprise Infrastructure software in conjunction with your AI software. Production Branch 6 (PB6) is compatible with NVIDIA AI Enterprise Infrastructure 8.

Security Vulnerabilities in Open Source Packages

Please review the Security Scanning tab to view the latest security scan results.

For certain open-source vulnerabilities listed in the scan results, NVIDIA provides a response in the form of a Vulnerability Exploitability eXchange (VEX) document. The VEX information can be reviewed and downloaded from the Security Scanning tab.