MagicData
SIGN IN

Total Size: 92 MB

Dataset Overview

Dataset Type

speech corpus
for TTS

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

scripted monologue

Content

customer service language

Audio Parameters

48 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

License

Magic Data
open-source license

Open Source
TTS Corpus

TTS-SCCusSerFSC: A Scripted Chinese Customer Service Female Speech Corpus

250 utterances of annotated female voices in Mandarin Chinese
applicable for Text-to-Speech Synthesis in customer service scenes

This open-source dataset consists of 22 minutes of annotated female voices in Mandarin Chinese that is applicable for Text-to-Speech Synthesis especially in customer service scenes, where 250 utterances collected from a 22-year-old woman were contained.

Sample in finance:

您是#1想了解#1重大#1疾病#1保险#1A款吗#4?
nin2 shi4 xiang3 liao6 jie3 zhong4 da4 ji2 bing4
bao6 xian3 / EY1 / kuan3 ma5

Dataset Overview

Dataset Type

speech corpus
for TTS

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

scripted monologue

Content

customer service language

Audio Parameters

48 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

License

Magic Data
open-source license

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email