Nemotron-3-8B-Chat-SteerLM is an 8 billion parameter generative language model based on the Nemotron-3-8B base model. It has been customized for user control of model outputs during inference using the SteerLM method developed by NVIDIA.
Llama 2 SteerLM Chat is a large language model, aligned using the SteerLM technique developed by NVIDIA. This allows you to adjust the preferred style of response to attributes (such as creativity, complexity and verbosity) at inference time.
A collection of easy to use, highly optimized Deep Learning Models for Recommender Systems. Deep Learning Examples provides Data Scientist and Software Engineers with recipes to Train, fine-tune, and deploy State-of-the-Art Models
Clara NLP is a collection of SOTA biomedical pre-trained language models as well as highly optimized pipelines for training NLP models on biomedical and clinical text
Clara Parabricks is a collection of software tools and notebooks for next generation sequencing, including short- and long-read applications. These tools are designed to be scalable, generating highly accurate results in an accelerated compute environmen
CUDA Toolkit provides the core, foundational development environment for creating high performance NVIDIA GPU-accelerated applications for diverse workloads from high performance computing, data science analytics and AI.
NVIDIA DeepStream SDK enables developers to build accelerated pipelines for a wide range of use-cases such as IVA, retail, industrial inspection, and many more with minimal development effort.