Dataset Overview

Dataset Type

speech corpus for TTS

Language

zh-CN

Speech Style

scripted monologue

Content

daily-use sentence

Audio Parameters

48 kHz, 24 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Nuemann U87-Neve 1073-RME Fireface

Recording Environment

Recording Studio

Popular Datasets

Proprietary
1602 hours
ASR Corpus
Proprietary
427 hours
ASR Corpus
Proprietary
1386 hours
ASR Corpus
Proprietary
313 hours
ASR Corpus
Open Source
NLP Corpus
2 KB
Proprietary
973 hours
ASR Corpus
Proprietary
65 hours
ASR Corpus
Proprietary
TTS Corpus
16006 sentences

Mandarin Chinese Speech Corpus for TTS – Daily Use Sentences

MDT-TTS-F010 | 16,006 utterances of annotated female voices in Mandarin Chinese applicable for Text-to-Speech Synthesis

Dataset Overview

Dataset Type

speech corpus for TTS

Language

zh-CN

Speech Style

scripted monologue

Content

daily-use sentence
48 kHz, 24 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Nuemann U87-Neve 1073-RME Fireface

Recording Environment

Recording Studio

License

This dataset consists of 20 hours of annotated female voices in Mandarin Chinese that is applicable for Text-to-Speech Synthesis, where 16,006 utterances collected from a 38-year-old women were contained.

Contact business@magicdatatech.com to learn more.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}