Chunghwa Telecom Laboratories
SF Bilingual Speech in Chinese and English
Resource
Chunghwa Telecom Laboratories
SF Bilingual Speech in Chinese and English

A bilingual (Mandarin-English) Speech Dataset.

Sign in to access all content for this ResourceSigning in will also allow download accessSign In

Get ready-to-use bilingual Chinese and English speech dataset.

This dataset is used for training bilingual (Chinese and English) Text-to-Speech models, including training FastPitch acoustic model with NVIDIA Deep Learning Examples FastPitch training recipe. The dataset contains about 2,740 bilingual audio samples of a single female speaker and their corresponding text transcripts, each of them is an audio of around 5-6 seconds and have a total length of approximately 4.5 hours.

The dataset is provided and shared by Chunghwa Telecom Laboratories. By downloading and using this dataset, you accept the terms and conditions of the license, CC BY-NC 4.0.

Publisher
Chunghwa Telecom Laboratories
Latest Versionv1
UpdatedApril 4, 2023 UTC
Compressed Size477.68 MB

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.