Total Size: 15GB

Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Recording Environment

indoor environment
Third Party
ASR Corpus

Chinese Mandarin Speech Corpus from Aishell

178 hours of transcribed Mandarin Chinese scripted speech

This open-source dataset consists of 178 hours of transcribed Mandarin Chinese scripted speech contributed by 400 speakers.

Dataset Overview

Dataset Type

ASR speech corpus

Language

Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Recording Environment

indoor environment
Sign In to Download.
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}