MagicData
SIGN IN

Dataset Overview

Dataset Type

ASR speech corpus

Language

en-CN

Speech Style

35

Content

daily-use sentence (rated)

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment
Proprietary
ASR Corpus
35 hours

ASR-SESEvalC: A Scripted English Speech Evaluation Corpus (by Chinese Speakers)

MDT-ASR-C015 | MDT-ASR-E013 | 35 hours of transcribed Mandarin Chinese scripted speech on daily use sentences (rated)

This dataset consists of 35 hours of transcribed Mandarin Chinese scripted speech focusing on daily-use sentences (rated) contributed by 2055 speakers.

Contact business@magicdatatech.com to learn more.

Sample:

Dataset Overview

Dataset Type

ASR speech corpus

Language

en-CN

Speech Style

35

Content

daily-use sentence (rated)

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email