This dataset consists of 40 hours of annotated female voices in Mandarin Chinese that is applicable for Text-to-Speech Synthesis, where 21,342 utterances collected from a 22-year-old woman were contained.
Contact business@magicdatatech.com to learn more.
Sample: