NeMo Retriever provides easy access to state-of-the-art models that are foundational building blocks for enterprise semantic search applications, delivering accurate answers quickly at scale. Developers can use these APIs to create robust copilots, chatbots, and AI assistants from start to finish. Text Retriever NIM models are built on the NVIDIA software platform, incorporating CUDA, TensorRT, and Triton to offer out-of-the-box GPU acceleration.
NeMo Retriever Text Embedding NIM - Boosts text question-answering retrieval performance, providing high quality embeddings for many downstream NLP tasks. For more information, see the Text Embedding NIM documentation.
NeMo Retriever Text Reranking NIM - Includes a fine-tuned reranker and boosts the retrieval process, finding most relevant passages to provide as context when querying an LLM.
Text Embedding NIM comes with enterprise-ready features, such as a high-performance inference server, flexible integration, and enterprise-grade security.
High Performance: Text Embedding NIM is optimized for high-performance deep learning inference with NVIDIA TensorRTTM and NVIDIA TritonTM Inference Server.
Scalable Deployment: Text Embedding NIM seamlessly scales from a few users to millions.
Flexible Integration: Text Embedding NIM can be easily incorporated into existing data pipelines and applications. Developers are provided with an OpenAI-compatible API in addition to custom NVIDIA extensions.
Enterprise-Grade Security: Text Embedding NIM comes with security features such as the use of safetensors, continuous patching of CVEs, and constant monitoring with our internal penetration tests.
For optimal performance, deploy the supported NVIDIA AI Enterprise Infrastructure software with this NIM.
Deploying and integrating NVIDIA NIM is straightforward thanks to our API's. Follow the documentation, tutorials, and community support forums to make the most of this revolutionary language processing tool.
Get access to knowledge base articles and support cases or submit a ticket.
Visit the NVIDIA AI Enterprise Documentation Hub for release documentation, deployment guides and more.
NVIDIA Licensing Portal
Go to the NVIDIA Licensing Portal to manage your software licenses. licensing portal for your products. Get Your Licenses
This NIM is licensed under the NVIDIA AI Product Agreement. By downloading and using the artifacts in this collection, you accept the terms and conditions of this license.