MagicData
SIGN IN

Dataset Overview

Dataset Type

speech corpus for TTS

Language

zh&en-CN

Speech Style

Scripted

Content

daily-use sentence

Audio Parameters

48 kHz, 16 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio
Proprietary
TTS Corpus
28 hours

TTS-SDemoBalCSC: A Scripted Demographic-Balanced Chinese Speech Corpus

MDT-TTS-F003 | 30,000 utterances of annotated male and female voices in Mandarin Chinese and Chinese Engish applicable for Text-to-Speech Synthesis

This dataset consists of 28 hous of annotated male & female voices in Mandarin Chinese and Chinese English that is applicable for Text-to-Speech Synthesis, where 30,000 utterances contributed by 25 men and 25 women were contained.

Contact business@magicdatatech.com to learn more.

Sample:

Dataset Overview

Dataset Type

speech corpus for TTS

Language

zh&en-CN

Speech Style

Scripted

Content

daily-use sentence

Audio Parameters

48 kHz, 16 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email