MagicData

sign in

Total Size: 46 MB

Dataset Overview

Dataset Type

speech corpus
for TTS

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences, fables, and stories

Audio Parameters

44.1 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

Philips K38003

Recording Environment

quiet indoor
environment

License

Magic Data
open-source license

Open Source
TTS Corpus

TTS-SCFChilSC: A Scripted Chinese Female Child's Speech Corpus

224 utterances of annotated female voices in Mandarin Chinese applicable for Text-to-Speech Synthesis

This open-source dataset consists of 15 minutes of annotated female voices in Mandarin Chinese that is applicable for Text-to-Speech Synthesis, where 224 utterances collected from a five-year-old girl were contained.

Sample:

小刺猬#1向#1妈妈#1敬礼#4。
xiao3 ci4 wei5 xiang4 ma1 ma5 jing4 li3
小刺猬/n 向/p 妈妈/n 敬礼/v

Dataset Overview

Dataset Type

speech corpus
for TTS

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences, fables, and stories

Audio Parameters

44.1 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

Philips K38003

Recording Environment

quiet indoor
environment

License

Magic Data
open-source license

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email