Datasets Type

Domain

Language

Content Type

Accent

Speak Speed

Industry

Scenario

published at December 15, 2021
Open Source
NLP Corpus
100 sentences
100 paragraphs
This dataset contains 100 pieces of news.
published at November 4, 2021
Proprietary
TTS Corpus
MDT-TTS-D005 | 5,600 sentence parsinf sentences in American English applicable for Text-to-Speech Synthesis
published at November 4, 2021
Proprietary
TTS Corpus
15 hours
MDT-TTS-E009 | 10,246 utterances of annotated male voices in American English applicable for Text-to-Speech Synthesis
published at October 28, 2021
Proprietary
TTS Corpus
2.13 hours
MDT-TTS-E018 | 1,926 utterances of annotated female voices in American English applicable for Text-to-Speech Synthesis
published at October 25, 2021
Proprietary
TTS Corpus
28 hours
MDT-TTS-F003 | 30,000 utterances of annotated male and female voices in Mandarin Chinese and Chinese Engish applicable for Text-to-Speech Synthesis
published at October 15, 2021
Proprietary
ASR Corpus
405 hours
MDT-ASR-B011 | MDT-ASR-D020 405 hours of transcribed American English scripted speech on daily use sentences
published at September 28, 2021
Proprietary
ASR Corpus
832 hours
MDT-ASR-E024 | MDT-ASR-E025 | 832 hours of transcribed Thai English scripted speech
published at September 28, 2021
Proprietary
ASR Corpus
313 hours
MDT-ASR-E027 | MDT-ASR-E038 | 313 hours of transcribed Turkish English scripted speech on keyword spotting and daily use sentences
published at September 28, 2021
Proprietary
ASR Corpus
37 hours
MDT-ASR-E055 | 37 hours of transcribed Malay English Scripted Speech
published at September 28, 2021
Proprietary
ASR Corpus
250 hours
MDT-ASR-F025 | MDT-ASR-F026 | 250 hours of transcribed Filipino English Scripted Speech
published at September 28, 2021
Proprietary
ASR Corpus
1395 hours
MDT-ASR-E014 | MDT-ASR-E062 | 1395 hours of transcribed Malay English Scripted Speech
published at September 23, 2021
Proprietary
ASR Corpus
29 hours
MDT-ASR-F011 | 28 hours of transcribed Singaporean English scripted speech
published at September 23, 2021
Proprietary
ASR Corpus
28 hours
MDT-ASR-F010 | 28 hours of transcribed Singaporean English scripted speech
published at September 23, 2021
Proprietary
ASR Corpus
632 hours
MDT-ASR-F012 | MDT-ASR-F013 | 632 hours of transcribed Singaporean English scripted speech on daily use sentences
published at September 23, 2021
Proprietary
ASR Corpus
1386 hours
MDT-ASR-D004 | MDT-ASR-E075 | 1,386 hours of transcribed Indian English scripted speech on daily use sentences
published at September 23, 2021
Proprietary
ASR Corpus
645 hours
MDT-ASR-E070 | 645 hours of transcribed Indonesian English scripted speech on daily use sentences
published at September 22, 2021
Proprietary
ASR Corpus
180 hours
MDT-ASR-D021 | 180 hours of transcribed English conversational speech
published at September 22, 2021
Proprietary
ASR Corpus
356 hours
MDT-ASR-C004 | MDT-ASR-D008 | 356 hours of transcribed English conversational speech
published at September 16, 2021
Proprietary
ASR Corpus
64 hours
MDT-ASR-A009 | 64 hours of transcribed English scripted speech on command and query
published at September 16, 2021
Proprietary
ASR Corpus
1172 hours
MDT-ASR-A004 | MDT-ASR-E042 | 1,172 hours of transcribed Chinese English scripted speech