MagicData
SIGN IN

Total Size: 8.3GB

Dataset Overview

Dataset Type

Language

Chinese

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

Apache License v.2.0

Third Party
ASR Corpus

ASR-THCHS30-CSC: A Chinese Speech Corpus from Tsinghua University

About this resource:

THCHS30 is an open Chinese speech database published by Center for Speech and Language Technology (CSLT) at Tsinghua University. The origional recording was conducted in 2002 by Dong Wang, supervised by Prof. Xiaoyan Zhu, at the Key State Lab of Intelligence and System, Department of Computer Science, Tsinghua Universeity, and the original name was 'TCMSD', standing for 'Tsinghua Continuous Mandarin Speech Database'. The publication after 13 years has been initiated by Dr. Dong Wang and was supported by Prof. Xiaoyan Zhu. We hope to provide a toy database for new researchers in the field of speech recognition. Therefore, the database is totally free to academic users. You can cite the data using the following BibTeX entry:

@misc{THCHS30_2015,
  title={THCHS-30 : A Free Chinese Speech Corpus},
  author={Dong Wang, Xuewei Zhang, Zhiyong Zhang},
  year={2015},
  url={http://arxiv.org/abs/1512.01882}
}

PEOPLE

Dong Wang, Xuewei Zhang, Zhiyong Zhang @CSLT, Tsinghua Univ.

CONTACTOR

ROOM1-303, BLDG FIT

CSLT, Tsinghua University

http://cslt.org

http://cslt.riit.tsinghua.edu.cn

Dataset Overview

Dataset Type

Language

Chinese

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

Apache License v.2.0

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email