MagicData
SIGN IN

Total Size: 151 MB

Dataset Overview

Dataset Type

ASR speech corpus

Language

en-CN,
English (China)

Speech Style

scripted monologue

Content

words, phrases, and daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

Magic Data
open-source license

Open Source
ASR Corpus
1.44 hours

ASR-SCEChilSC: A Scripted Chinese English Children's Speech Corpus

1.44 hours of transcribed Chinese English scripted speech from children

This open-source dataset consists of 1.44 hours of transcribed Chinese English scripted speech from children, where 2,266 utterances contributed by ten speakers, aged 7 or less, were contained.

Sample:

"Let's go swimming!"

Dataset Overview

Dataset Type

ASR speech corpus

Language

en-CN,
English (China)

Speech Style

scripted monologue

Content

words, phrases, and daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

Magic Data
open-source license

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email