Riva Speech Skills

NGC Catalog

CLASSIC

Welcome Guest

For contents of this collection and more information, please view on a desktop device.

Description

Collection of common Riva Speech Skills resources.

Curator

NVIDIA

Modified

March 19, 2025

Containers

Helm Charts

Models

Resources

What Is Riva?

NVIDIA Riva is a GPU-accelerated SDK for building speech AI applications that are customized for your use case and deliver real-time performance. Riva offers pre-trained speech models in NVIDIA NGC that can be fine-tuned with NVIDIA NeMo on a custom data set, accelerating the development of domain-specific models.

Some of the major tasks that you can perform using Riva are:

Customizing a model with your data Using NeMo, you can use a custom-trained model in Riva.

Deploying a model in Riva Riva is designed for speech AI at scale. To help you efficiently serve models across different servers robustly, NVIDIA provides push-button model deployment using Helm charts.

Supported NeMo models can be easily exported, optimized, and deployed as a speech service on premises or in the cloud with a single command using Helm charts.

Riva's high performance inference is powered by NVIDIA TensorRT optimizations and served using the NVIDIA Triton Inference Server. Riva services are available as gRPC-based microservices for low-latency streaming, as well as high-throughput offline use cases. Riva is fully containerized and can easily scale to hundreds and thousands of parallel streams.

What's Included In The Riva Collection?

Containers

Riva Speech Server: Riva Speech Skills is a Docker image containing a toolkit for production-grade conversational AI inference. The Riva Speech API server exposes a simple API for performing speech recognition, speech synthesis, and a variety of NLP inferences. No GPU is required to run the sample clients.

Resources

Riva Quick Start Scripts: Riva includes Quick Start scripts to help you get started with Riva AI Services. These scripts are meant for deploying the services locally for testing and running our example applications.

Models

Models: Trainable and deployable versions of Riva Automatic Speech Recognition (ASR) and Speech Synthesis models.

Helm Charts

Riva Speech Skills Helm chart: The Helm chart is used to deploy ASR, NLP, and text-to-speech (TTS) services automatically. Specifically, it is designed to automate the steps for push-button deployment to a Kubernetes cluster.

Getting Started With Riva

Refer to the Riva Skills Quick Start page for step-by-step instructions on getting started with Riva.

License

By downloading and using Riva software, you accept the terms and conditions of this license.

Technical Blogs

Building Transcription and Entity Recognition Apps using Riva

What Is Riva?

What's Included In The Riva Collection?

Containers

Resources

Models

Helm Charts

Getting Started With Riva

License

Technical Blogs

Documentation