Dataset Overview

Dataset Type

ASR speech corpus

Language

cmn-Zhengzhou

Speech Style

scripted

Content

daily-use sentence

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

Popular Datasets

Proprietary
1602 hours
ASRデータセット
Proprietary
427 hours
ASRデータセット
Proprietary
1386 hours
ASRデータセット
Proprietary
313 hours
ASRデータセット
Open Source
NLPコーパス
2 KB
Proprietary
65 hours
ASRデータセット
Proprietary
973 hours
ASRデータセット
Proprietary
ASRデータセット
703 hours

Zhengzhou Dialect Scripted Speech Corpus

MDT-ASR-E005 | 703 hours of transcribed Zhengzhou dialect scripted speech on daily use sentences

Dataset Overview

Dataset Type

ASR speech corpus

Language

cmn-Zhengzhou

Speech Style

scripted

Content

daily-use sentence
16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

This dataset consists of 703 hours of transcribed Zhengzhou dialect scripted speech focusing on daily use sentences contributed by 1055 speakers.

連絡先 business@magicdatatech.com to learn more.

Sample:

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}