This dataset consists of 10 hours of annotated female voices in Sichuan dialect that is applicable for Text-to-Speech Synthesis, where 9,566 utterances collected from a 19-year-old woman were contained.
Contact business@magicdatatech.com to learn more.
Sample: