Total Size: 5.4 GB

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

spontaneous conversation

Content

themed conversations

Audio Parameters

16 kHz, 16 bits,
6 channels

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

microphone, mobile, and recorder

Recording Environment

indoor environment

License

Magic Data
open-source license

Open Source

ASR Corpus

10 hours

ASR-MultiDeviCCSC: A Chinese Conversational Speech Corpus

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

10 hours of transcribed Mandarin Chinese conversational speech
collected by multiple devices

This open-source dataset consists of 10 hours of transcribed Mandarin Chinese conversational speech on certain topics, including six-channel audios collected by three devices, where 360 conversations contributed by 30 pairs of speakers were contained.

Sample for iOS Phone:

Sample for Android Phone:

Sample for Recorder:

Sample for Microphone:

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN,
Mandarin Chinese (China)

Speech Style

spontaneous conversation

Content

themed conversations

Audio Parameters

16 kHz, 16 bits,
6 channels

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

microphone, mobile, and recorder

Recording Environment

indoor environment

License

Magic Data
open-source license

备案号: 京ICP备18008050号-6号

京公网安备 11010802035822号

Your IP is: 216.73.216.56

SIGN IN

SIGN UP

Total Size: 5.4 GB

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

Magic Data
open-source license

ASR-MultiDeviCCSC: A Chinese Conversational Speech Corpus

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

Magic Data
open-source license

京公网安备 11010802035822号

SIGN IN

SIGN UP

Total Size: 5.4 GB

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

Magic Dataopen-source license

ASR-MultiDeviCCSC: A Chinese Conversational Speech Corpus

Dataset Overview

Dataset Type

Language

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

Magic Dataopen-source license

京公网安备 11010802035822号

Verifying Email

Magic Data
open-source license

Magic Data
open-source license