This dataset consists of 15 hours of annotated male voices in American English that is applicable for Text-to-Speech Synthesis, where 10,246 utterances collected from a 24-year-old man were contained.
連絡先 business@magicdatatech.com to learn more.
Sample: