NVIDIA
NVIDIA
VAD multilingual Marblenet
Model
NVIDIA
NVIDIA
VAD multilingual Marblenet

MarbleNet VAD model with multilingual data

1 Version
1.10.0Selected
08/02/2022 8:33 PM UTC490 KB50 EpochsBatch Size: 256
Accuracy
KeyValue
auroc on ALL0.9112
datasetava speech
Model
KeyValue
Architecturemarblenet
Outputsframe-level voice activity prediction
Inputs16000 KHz Mono-channel Audio (wav files)

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.