NeMo Retriever | NVIDIA NGC

NGC Catalog

CLASSIC

Welcome Guest

For contents of this collection and more information, please view on a desktop device.

Associated Products

Features

Description

Nemo Retriever is a collection of NIMs reference architectures for RAG

Curator

Modified

March 14, 2025

Containers

Helm Charts

Models

Resources

What is NeMo Retriever

NeMo Retriever provides easy access to state-of-the-art models that are foundational building blocks for enterprise semantic search applications, delivering accurate answers quickly at scale. Developers can use these APIs to create robust copilots, chatbots, and AI assistants from start to finish. Text Retriever NIM models are built on the NVIDIA software platform, incorporating CUDA, TensorRT, and Triton to offer out-of-the-box GPU acceleration.

NeMo Retriever Text Embedding NIM - Boosts text question-answering retrieval performance, providing high quality embeddings for many downstream NLP tasks. For more information, see the Text Embedding NIM documentation.
NeMo Retriever Text Reranking NIM - Includes a fine-tuned reranker and boosts the retrieval process, finding most relevant passages to provide as context when querying an LLM.

Enterprise-Ready Features

Text Embedding NIM comes with enterprise-ready features, such as a high-performance inference server, flexible integration, and enterprise-grade security.

High Performance: Text Embedding NIM is optimized for high-performance deep learning inference with NVIDIA TensorRTTM and NVIDIA TritonTM Inference Server.
Scalable Deployment: Text Embedding NIM seamlessly scales from a few users to millions.
Flexible Integration: Text Embedding NIM can be easily incorporated into existing data pipelines and applications. Developers are provided with an OpenAI-compatible API in addition to custom NVIDIA extensions.
Enterprise-Grade Security: Text Embedding NIM comes with security features such as the use of safetensors, continuous patching of CVEs, and constant monitoring with our internal penetration tests.

Compatible Infrastructure Software Versions

For optimal performance, deploy the supported NVIDIA AI Enterprise Infrastructure software with this NIM.

NVIDIA AI Enterprise Infrastructure 5

Getting Started with NVIDIA NIM

Deploying and integrating NVIDIA NIM is straightforward thanks to our API's. Follow the documentation, tutorials, and community support forums to make the most of this revolutionary language processing tool.

Get Help

Enterprise Support

Get access to knowledge base articles and support cases or submit a ticket.

NVIDIA AI Enterprise Documentation

Visit the NVIDIA AI Enterprise Documentation Hub for release documentation, deployment guides and more.

NVIDIA Licensing Portal

Go to the NVIDIA Licensing Portal to manage your software licenses. licensing portal for your products. Get Your Licenses

License

This NIM is licensed under the NVIDIA AI Product Agreement. By downloading and using the artifacts in this collection, you accept the terms and conditions of this license.