MagicData

sign in

Total Size: 202 MB

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

spontaneous conversation

Content

conversations
(web meetings)

Audio Parameters

8 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

laptop and mobile

Recording Environment

indoor environment

License

Magic Data
open-source license

Open Source
ASR Corpus
5.2 hours

ASR-CCMeetingSC: A Chinese Conversational Meeting (Web) Speech Corpus

5.2 hours of transcribed Mandarin Chinese conversational speech
on web meetings

This open-source dataset consists of 5.2 hours of transcribed Mandarin Chinese conversational speech on web meetings between laptops and mobiles, where ten conversations were contained.

Sample:

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

spontaneous conversation

Content

conversations
(web meetings)

Audio Parameters

8 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

laptop and mobile

Recording Environment

indoor environment

License

Magic Data
open-source license

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email