NVOF is a deep learning based optical flow estimation and stereo matching solution.

nvof

A pre-trained model for volumetric (3D) segmentation of the spleen from CT image.

monai_spleen_ct_segmentation

Semantic segmentation of persons in an image.

peoplesemsegnet

The Meta Llama 3.2 collection of multilingual large language models (LLMs) is a collection of pre-trained and instruction-tuned generative models in 1B and 3B sizes (text in/text out).

llama-3_2-1b-instruct

efficientnet-widese-b4 ImageNet pretrained weights

efficientnet_v1-wse-b4_pyt_ckpt

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pre-trained and instruction-tuned generative models in 8B, 70B, and 405B sizes (text in/text out).

llama-3_1-70b-instruct-nemo

Meta-Llama-3-70B-Instruct is an instruct-tuned decoder-only, text-to-text model. The instruction-tuning uses supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.

llama-3-70b-instruct-nemo

FourCastNet V2 model for predicting atmospheric dynamics.

earth2-sfno-era5-73ch

llama-3_1-8b-instruct-nemo

4 class object detection network to detect cars in an image.

trafficcamnet

speechsynthesis_en_us_auxiliary_files

The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pretrained and instruction tuned generative models in 8B, 70B and 405B sizes (text in/text out).

llama-3_1-8b-nemo

ChatGLM3-6B is the latest open-source model in the ChatGLM series. ChatGLM3-6B introduces the following features (1) More Powerful Base Model (2) More Comprehensive Function Support (3)  More Comprehensive Open-source Series.

chatglm3-6b-chat-int4

3 class object detection network to detect people in an image.

peoplenet

1 class instance segmentation network to detect and segment instances of people in an image.

peoplesegnet

gpunet_0_pyt_ckpt

llama-3_2-1b

Phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets.
Source: https://huggingface.co/microsoft/phi-4

phi-4

llama-3_2-3b-instruct

bodyposenet

The Meta Llama 3.3 multilingual large language model (LLM) is an instruction-tuned generative model in 70B (text in/text out).

llama-3_3-70b-instruct

5 class action recognition network to recognize what people do in an image.

actionrecognitionnet

dashcamnet

Pretrained weights to facilitate transfer learning using TAO Toolkit.

pretrained_object_detection

A NeMo Megatron BERT based model trained on protein sequences.

esm1nv

LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.

llama2-13b

The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.