Dataset Overview

Dataset Type

speech corpus for TTS

Language

zh&en-CN

Speech Style

Scripted

Content

daily-use sentence

Audio Parameters

48 kHz, 16 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

Popular Datasets

Proprietary
1602 hours
ASRデータセット
Proprietary
427 hours
ASRデータセット
Proprietary
1386 hours
ASRデータセット
Proprietary
313 hours
ASRデータセット
Open Source
NLPコーパス
2 KB
Proprietary
973 hours
ASRデータセット
Proprietary
65 hours
ASRデータセット
Proprietary
TTSデータセット
28 hours

Mandarin Chinese Speech Corpus for TTS — from Non-Voice Actors

MDT-TTS-F003 | 30,000 utterances of annotated male and female voices in Mandarin Chinese and Chinese Engish applicable for Text-to-Speech Synthesis

Dataset Overview

Dataset Type

speech corpus for TTS

Language

zh&en-CN

Speech Style

Scripted

Content

daily-use sentence
48 kHz, 16 bits

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

Neumann U87-Neve 1073-RME Fireface

Recording Environment

recording studio

License

This dataset consists of 28 hous of annotated male & female voices in Mandarin Chinese and Chinese English that is applicable for Text-to-Speech Synthesis, where 30,000 utterances contributed by 25 men and 25 women were contained.

連絡先 business@magicdatatech.com to learn more.

Sample:

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}