This dataset consists of 10.34 hours of annotated male voices in Sichuan dialect that is applicable for Text-to-Speech Synthesis, where 9,595 utterances collected from a 22-year-old man were contained.
Contact business@magicdatatech.com to learn more.
Sample: