What Is Riva Deployment Helm Chart?
Riva Speech Skills Helm chart can be used to deploy ASR, NMT, and TTS services automatically. Specifically, it is designed to automate the steps for push-button deployment to a Kubernetes cluster.
Running Riva Deployment Helm Chart
The Helm chart configuration can be modified for your use case by modifying the values.yaml file. In this file, you can change the settings related to which models to deploy, where to store them, and how to expose the services.
Procedure
-
Download and modify the Helm chart for your use.
helm fetch https://helm.ngc.nvidia.com/nvidia/riva/charts/riva-api-2.19.1.tgz --username='$oauthtoken' --password=<YOUR API KEY>Where
<YOUR API KEY>is your API key. The result of the above operation will be a new directory calledriva-apiin your current working directory. Within that directory is avalues.yamlfile which can be modified to suit your use case. -
After the
values.yamlfile has been updated to reflect the deployment requirements, Riva can be deployed to the Kubernetes cluster.helm install riva-api riva-apiAlternatively, use the
--setoption to install without modifying thevalues.yamlfile. Make sure to set the NGC API key, email, andmodel_key_stringto the appropriate values. By default,model_key_stringistlt_encode.helm install riva-api riva-api --set ngcCredentials.password=`echo -n $NGC_API_KEY` --set ngcCredentials.email=your_email@your_domain.com --set modelRepoGenerator.modelDeployKey=`echo -n tlt_encode | base64 -w0`For additional details, see Kubernetes. For more information about using NGC, refer to the NGC Catalog User Guide.
Suggested Reading
For the latest release specific documentation, refer see the Release Notes.
For a full list of the supported hardware and software, refer to the Support Matrix.
For all other documentation, refer to the User Guide.
License
By downloading and using Riva software, you accept the terms and conditions of this license.