1663323011-logo2022

sign in

Total Size: 908 MB

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN, Mandarin Chinese (China)

Speech Style

spontaneous conversation

Content

conversational speech on interview and self-media

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF-8)

Recording Equipment

mobile

Recording Environment

indoor environment
Open Source
ASR Corpus
8 hours

Mandarin Chinese Conversational Speech Corpus - Test Set

This open-source dataset consists of 8 hours of transcribed Mandarin Chinese conversational speech on interview and self-media, where 26 pieces of conversations are contained.

Sample:

Dataset Overview

Dataset Type

ASR speech corpus

Language

zh-CN, Mandarin Chinese (China)

Speech Style

spontaneous conversation

Content

conversational speech on interview and self-media

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF-8)

Recording Equipment

mobile

Recording Environment

indoor environment
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}