This model is customized version of RIVA Conformer ASR English model with NeuralVAD enabled for voice activity. Conformer-CTC (around 120M parameters) is trained on ASRSet with over 35000 hours of English (en-US) speech. The model transcribes speech in lower case English alphabet along with spaces and apostrophes.
By downloading and using this software, you accept the terms and conditions of this license.