MagicData
SIGN IN

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

digits

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor environment
Proprietary
ASRデータセット
117 hours

ASR-SCDgSC: A Scripted Chinese Digits Speech Corpus

MDT-ASR-C007 | 117 hours of transcribed Mandarin Chinese scripted speech on digits

This dataset consists of 117 hours of transcribed Mandarin Chinese scripted speech on digits contributed by 1,465 speakers.

連絡先 business@magicdatatech.com to learn more.

Sample:

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN

Speech Style

Scripted

Content

digits

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor environment

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email