Dataset Overview

Dataset Type

ASR speech corpus

Language

cmn-Zhengzhou

Speech Style

conversational

Content

spontaneous conversation

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

Popular Datasets

Proprietary
ASR Corpus
500 hours

Zhengzhou Dialect Conversational Speech Corpus

MDT-ASR-F068 | 500 hours of transcribed Zhengzhou dialect conversational speech

Dataset Overview

Dataset Type

ASR speech corpus

Language

cmn-Zhengzhou

Speech Style

conversational

Content

spontaneous conversation
16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

This dataset consists of 500 hours of transcribed Zhengzhou dialect spontaneous speech contributed by 274 speakers.

Contact business@magicdatatech.com to learn more.

Sample:

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}