Dataset Overview

Dataset Type

speech corpus for TTS

Language

en-US

Speech Style

scripted

Content

daily-use sentence

Audio Parameters

48 kHz, 24 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Sennheiser MKH 416 SSL2 by Solid State Logic

Recording Environment

recording studio

Popular Datasets

Proprietary
1602 hours
ASRデータセット
Proprietary
427 hours
ASRデータセット
Proprietary
1386 hours
ASRデータセット
Proprietary
313 hours
ASRデータセット
Open Source
NLPコーパス
2 KB
Proprietary
973 hours
ASRデータセット
Proprietary
65 hours
ASRデータセット
Proprietary
TTSデータセット
2.13 hours

American English Speech Corpus for TTS

MDT-TTS-E018 | 1,926 utterances of annotated female voices in American English applicable for Text-to-Speech Synthesis

Dataset Overview

Dataset Type

speech corpus for TTS

Language

en-US

Speech Style

scripted

Content

daily-use sentence
48 kHz, 24 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Sennheiser MKH 416 SSL2 by Solid State Logic

Recording Environment

recording studio

License

This dataset consists of 2.13 hours of annotated female voices in American English that is applicable for Text-to-Speech Synthesis, where 1,926 utterances collected from a 27-year-old woman were contained.

連絡先 business@magicdatatech.com to learn more.

Sample:

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}