Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

command and queries

Audio Parameters

48 kHz, 16 bits, dual

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor far/near field

Popular Datasets

Proprietary
ASR Corpus
1812 hours

Mandarin Chinese Far-Field Scripted Speech Corpus – Command and Query

MDT-ASR-D025 | MDT-ASR-E001 | 1,812 hours of transcribed Mandarin Chinese scripted speech on command and query

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

command and queries
48 kHz, 16 bits, dual

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor far/near field

License

This datasets collection consists of 1,812 hours of transcribed Mandarin Chinese scripted speech on command and query contributed by 2,200 speakers.

Contact business@magicdatatech.com to learn more.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}