Dataset Overview

Dataset Type

speech corpus for TTS

Language

en-US

Speech Style

scripted

Content

daily-use sentence

Audio Parameters

48 kHz, 24 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

Popular Datasets

Proprietary
1602 hours
ASRデータセット
Proprietary
427 hours
ASRデータセット
Proprietary
1386 hours
ASRデータセット
Proprietary
313 hours
ASRデータセット
Open Source
NLPコーパス
2 KB
Proprietary
973 hours
ASRデータセット
Proprietary
65 hours
ASRデータセット
Proprietary
TTSデータセット
15 hours

American English Speech Corpus for TTS — Male Voice

MDT-TTS-E009 | 10,246 utterances of annotated male voices in American English applicable for Text-to-Speech Synthesis

Dataset Overview

Dataset Type

speech corpus for TTS

Language

en-US

Speech Style

scripted

Content

daily-use sentence
48 kHz, 24 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

License

This dataset consists of 15 hours of annotated male voices in American English that is applicable for Text-to-Speech Synthesis, where 10,246 utterances collected from a 24-year-old man were contained.

連絡先 business@magicdatatech.com to learn more.

Sample:

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}