SearchSearch thousands of GPU-optimized Containers, pretrained Models, SDKs, and Helm charts—ready to accelerate AI, digital twins, and HPC from cloud to edge.
NVIDIA Enterprise
NVIDIA Enterprise
NVIDIA NIM
NVIDIA NIM
NIM Container GPUs
NIM Container GPUs
Use Case
Use Case
1
NVIDIA Platform
NVIDIA Platform
4
Industry
Industry
Solution
Solution
2
2
1
1
Publisher
Publisher
9
Policy
Policy
Displaying 9 results
HiFiGAN Speech Synthesis model
Model
End to End workflow for text to speech training with TAO Toolkit and deployment using Riva.
Resource
FastPitch+HiFiGAN End-to-End Speech Synthesis model trained on female English speech
Model
GAN-based waveform generator from mel-spectrograms.
Model
This model card includes two Mandarin Chinese models: 1) FastPitch Mel-spectrogram generator trained on SF Chinese/English Bilingual Speech dataset; 2) HiFiGAN vocoder trained on Mel-spectrograms predicted by the FastPitch.
Model
This collection includes two German models: FastPitch trained on the HUI-Audio-Corpus-German clean dataset where the 5-largest amount of speakers are selected and balanced; HiFiGAN is trained on mel-spectrograms predicted by the Multi-speaker FastPitch.
Model
FastSpeech2+HiFiGAN End-to-End Speech Synthesis model trained on female English speech
Model
GAN-based waveform generator from mel-spectrograms.
Model
HifiGAN is a neural vocoder model for text-to-speech applications. It is intended as the second part of a two-stage speech synthesis pipeline, with a mel-spectrogram generator such as FastPitch as the first stage.
Model