MagicData
SIGN IN

Total Size: 15GB

Dataset Overview

Dataset Type

ASR speech corpus

Language

Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Recording Environment

indoor environment
Third Party

ASR-AIShell-MCSC: A Mandarin Chinese Speech Corpus from AIshell

178 hours of transcribed Mandarin Chinese scripted speech

This open-source dataset consists of 178 hours of transcribed Mandarin Chinese scripted speech contributed by 400 speakers.

Dataset Overview

Dataset Type

ASR speech corpus

Language

Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Recording Environment

indoor environment
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email