Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

keyword spotting, command and query

Audio Parameters

48 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor environment
Proprietary
ASR Corpus
433 hours

Mandarin Chinese Scripted Speech Corpus

MDT-ASR-F058 | 433 hours of transcribed Mandarin Chinese scripted speech on keyword spotting, command and query

This open-source dataset consists of 433 hours of transcribed Mandarin Chinese Scripted Speech focusing on keyword spotting and command & Query contributed by 2,296 senior people.

Contact business@magicdatatech.com to learn more.

Sample:

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

keyword spotting, command and query

Audio Parameters

48 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor environment

License

Sign In to Download.
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}