Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

Daily-Use Sentence

Audio Parameters

16 kHz, 16 bits, 3 channels

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone, mobile & bluetooth headset

Recording Environment

indoor, outdoors, in-vehicle, public place

Popular Datasets

Proprietary
ASR Corpus
506 hours

Mandarin Chinese Scripted Speech Corpus – Daily-Use Sentence

MDT-ASR-F001 | 506 hours of transcribed Mandarin Chinese scripted speech on daily use sentences

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

Daily-Use Sentence
16 kHz, 16 bits, 3 channels

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone, mobile & bluetooth headset

Recording Environment

indoor, outdoors, in-vehicle, public place

License

This dataset consists of 506 hours of transcribed Mandarin Chinese scripted speech focusing on daily use sentences contributed by 768 speakers.

Contact business@magicdatatech.com to learn more.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}