MagicData

sign in

Dataset Overview

Dataset Type

ASR speech corpus

Language

cmn-Northeast

Speech Style

Conversational

Content

Spontaneous Conversation

Audio Parameters

16 kHz, 16 bits

File Format

WAV TXT

Recording Equipment

mobile

Recording Environment

indoor
Proprietary
ASR Corpus
160 hours

ASR-CNEAcstCSC: A Chinese Northeastern Accent Conversational Speech Corpus

MDT-ASR-E047 | MDT-ASR-E054 | 185 hours of transcribed Northeastern Mandarin Scripted Speech

This dataset consists of 185 hours of transcribed Northeastern Mandarin Conversational Speech on certain topics

Contact business@magicdatatech.com to learn more.

Dataset Overview

Dataset Type

ASR speech corpus

Language

cmn-Northeast

Speech Style

Conversational

Content

Spontaneous Conversation

Audio Parameters

16 kHz, 16 bits

File Format

WAV TXT

Recording Equipment

mobile

Recording Environment

indoor

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email