Total Size: 15GB

Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Recording Environment

indoor environment

Popular Datasets

Third Party
ASR Corpus

Chinese Mandarin Speech Corpus from Aishell

178 hours of transcribed Mandarin Chinese scripted speech

Dataset Overview

Dataset Type

ASR speech corpus

Language

Mandarin Chinese (China)

Speech Style

scripted monologue

Content

daily use sentences
16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Recording Environment

indoor environment

This open-source dataset consists of 178 hours of transcribed Mandarin Chinese scripted speech contributed by 400 speakers.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}