NGC Catalog
CLASSIC
Welcome Guest
Collections
NeMo Retriever

NeMo Retriever

For contents of this collection and more information, please view on a desktop device.
Logo for NeMo Retriever
Associated Products
Features
Description
Nemo Retriever is a collection of NIMs reference architectures for RAG
Curator
Modified
March 14, 2025
Containers
Sorry, your browser does not support inline SVG.
Helm Charts
Sorry, your browser does not support inline SVG.
Models
Sorry, your browser does not support inline SVG.
Resources
Sorry, your browser does not support inline SVG.

What is NeMo Retriever

NeMo Retriever provides easy access to state-of-the-art models that are foundational building blocks for enterprise semantic search applications, delivering accurate answers quickly at scale. Developers can use these APIs to create robust copilots, chatbots, and AI assistants from start to finish. Text Retriever NIM models are built on the NVIDIA software platform, incorporating CUDA, TensorRT, and Triton to offer out-of-the-box GPU acceleration.

  • NeMo Retriever Text Embedding NIM - Boosts text question-answering retrieval performance, providing high quality embeddings for many downstream NLP tasks. For more information, see the Text Embedding NIM documentation.

  • NeMo Retriever Text Reranking NIM - Includes a fine-tuned reranker and boosts the retrieval process, finding most relevant passages to provide as context when querying an LLM.

Enterprise-Ready Features

Text Embedding NIM comes with enterprise-ready features, such as a high-performance inference server, flexible integration, and enterprise-grade security.

  • High Performance: Text Embedding NIM is optimized for high-performance deep learning inference with NVIDIA TensorRTTM and NVIDIA TritonTM Inference Server.

  • Scalable Deployment: Text Embedding NIM seamlessly scales from a few users to millions.

  • Flexible Integration: Text Embedding NIM can be easily incorporated into existing data pipelines and applications. Developers are provided with an OpenAI-compatible API in addition to custom NVIDIA extensions.

  • Enterprise-Grade Security: Text Embedding NIM comes with security features such as the use of safetensors, continuous patching of CVEs, and constant monitoring with our internal penetration tests.

Compatible Infrastructure Software Versions

For optimal performance, deploy the supported NVIDIA AI Enterprise Infrastructure software with this NIM.

  • NVIDIA AI Enterprise Infrastructure 5

Getting Started with NVIDIA NIM

Deploying and integrating NVIDIA NIM is straightforward thanks to our API's. Follow the documentation, tutorials, and community support forums to make the most of this revolutionary language processing tool.

Get Help

Enterprise Support

Get access to knowledge base articles and support cases or submit a ticket.

NVIDIA AI Enterprise Documentation

Visit the NVIDIA AI Enterprise Documentation Hub for release documentation, deployment guides and more.

NVIDIA Licensing Portal

Go to the NVIDIA Licensing Portal to manage your software licenses. licensing portal for your products. Get Your Licenses

License

This NIM is licensed under the NVIDIA AI Product Agreement. By downloading and using the artifacts in this collection, you accept the terms and conditions of this license.