MagicData
SIGN IN

Dataset Overview

Dataset Type

N/A

Language

中国語の方言

Speech Style

Scripted

Content

N/A

Audio Parameters

48 kHz, 16 bits

File Format

WAV (PCM)

Recording Equipment

microphone

Recording Environment

quiet indoor environment
Open Source
TTSデータセット
0.83 hours

MagicData-Dialect-TTS-Lite

Dataset Introduction

MagicData-Dialect-TTS-Lite is an open-source Chinese dialect TTS dataset collection released by Magic Data. It includes five Chinese dialect varieties: Northeastern Chinese, Henan Dialect, Sichuanese, Wu Chinese, and Cantonese.

The full collection contains approximately 50 minutes of speech data, recorded by five native dialect speakers aged between 30 and 60. Each dialect subset contains around 10 minutes of audio and is released as an independent open-source dataset.

If you are interested in a specific dialect, please click the corresponding dataset link below for more details.

Dataset Overview

Dialect RegionCityCodeDurationSentencesSpeaker
Northeastern ChineseSipingNED10 minutes75 sentences1 female, 30 years old
Henan DialectZhengzhouHEN10 minutes74 sentences1 male, 34 years old
SichuaneseChengduSIC10 minutes77 sentences1 female, 40 years old
Wu ChineseSuzhouJSU10 minutes102 sentences1 female, 50 years old
CantoneseGuangzhouGUD10 minutes54 sentences1 female, 55 years old

Total: 50 minutes / 5 native dialect speakers

Dataset Links

Recommended Use

This dataset collection is suitable for:

  • Multi-dialect TTS research
  • Zero-shot / few-shot TTS baseline testing
  • Dialect acoustic analysis
  • Academic research and model evaluation

For detailed dataset features, annotation guidelines, and file structure, please refer to each individual dialect dataset page.

Open-source License

This dataset collection is for non-commercial use only under the CC BY-NC-ND 4.0 license. It is suitable for academic research, personal development, and model evaluation.

📧 For the full commercial version, please contact: business@magicdatatech.com

Dataset Overview

Dataset Type

N/A

Language

中国語の方言

Speech Style

Scripted

Content

N/A

Audio Parameters

48 kHz, 16 bits

File Format

WAV (PCM)

Recording Equipment

microphone

Recording Environment

quiet indoor environment
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email