This collection contains end-to-end neural models for Text to Speech (TTS) that can be trained using TAO Toolkit and deployed with Riva. The models in this collection can be used for synthesizing speech from text. The current TTS pipeline requires two models:
TAO Toolkit supports training models of the following architectures from scratch
For more information on how to train the end to end TTS models using TAO and deploy to RIVA, refer the TAO Toolkit Text-To-Spech documentation.
Deployable RIVA models for FastPitch and HiFiGAN are available with this collection:
By downloading and using the models and resources packaged with TAO Conversational AI, you would be accepting the terms of the Riva license
NVIDIA’s platforms and application frameworks enable developers to build a wide array of AI applications. Consider potential algorithmic bias when choosing or creating the models being deployed. Work with the model’s developer to ensure that it meets the requirements for the relevant industry and use case; that the necessary instruction and documentation are provided to understand error rates, confidence intervals, and results; and that the model is being used under the conditions and in the manner intended.