MagicData
SIGN IN

Total Size: 97G

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog

License

Attribution-ShareAlike 4.0 International

Third Party
ASR Corpus

ASR-CCelebSC: A Chinese Celebrities' Speech Corpus

A Free Chinese Speaker Recognition Corpus Released by CSLT@Tsinghua University

This is a large-scale speaker recognition dataset collected `in the wild'. The dataset consists of 3,000 Chinese celebrities, and covers 11 different genres in real world.

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog

License

Attribution-ShareAlike 4.0 International

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email