NeMo Retriever Extraction

NVIDIA

Container

NVIDIA

NeMo Retriever Extraction

NeMo Retriever extraction is a scalable, performance oriented document content and metadata extraction microservice.

NVIDIA NeMo Microservices NVIDIA Developer Program

Join or Subscribe to get accessSubscribe to the product below to access this premium content:

NVIDIA NeMo MicroservicesNeMo provides microservices that simplify the generative AI development and deployment process at scale, allowing organizations to connect LLMs to their enterprise data sources.

NVIDIA Developer ProgramJoin the Developer Program for access to free tools, support, and tech resources.

Get Access

Note: You can gain access to hundreds more GPU-optimized artifacts by creating a free NGC account.

Already Subscribed?Log in

NeMo Retriever extraction also known as NVIDIA Ingest and nv-ingest

NeMo Retriever extraction is a scalable, performance oriented document content and metadata extraction microservice. Including support for parsing PDFs, Word and PowerPoint documents, nv-ingest uses specialized nvidia image NIMs to find, contextualize, and extract text, tables, charts and images for use in downstream generative applications.

NeMo Retriever extraction enables parallelization of splitting documents into pages where artifacts are classified (such as text, tables, charts, and images), extracted, and further contextualized through optical character recognition (OCR) into a well defined JSON schema. From there, NeMo Retriever extraction can optionally manage computation of embeddings for the extracted content, and optionally manage storing into a vector database Milvus.

Documentation

For more details, please vis the NVIDIA Ingest GitHub Repository.

Governing Terms

The container is governed by NVIDIA Agreements | Enterprise Software | NVIDIA Software License Agreement and NVIDIA Agreements | Enterprise Software | Product Specific Terms for AI Product; and the NeMo Retriever extraction is released under the Apache-2.0 license.

You are responsible for ensuring that your use of NVIDIA AI Foundation Models complies with all applicable laws.

Publisher

NVIDIA

Latest Tag26.3.0

UpdatedMarch 17, 2026 UTC

Compressed Size1.12 GB

Multinode SupportNo

Multi-Arch SupportYes

System

signed images

Labels

NSPECT-DFYJ-JJ49