Total Size: 97G

Sign In to Download.

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog

Popular Datasets

Open Source
6.55 hours
ASR Corpus
406 MB
Proprietary
1602 hours
ASR Corpus
Proprietary
427 hours
ASR Corpus
Proprietary
1386 hours
ASR Corpus
Proprietary
313 hours
ASR Corpus
Open Source
NLP Corpus
2 KB
Proprietary
973 hours
ASR Corpus
Third Party
ASR Corpus

CN-Celeb Speech Recognition Corpus

A Free Chinese Speaker Recognition Corpus Released by CSLT@Tsinghua University

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text
16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog

This is a large-scale speaker recognition dataset collected `in the wild’. The dataset consists of 3,000 Chinese celebrities, and covers 11 different genres in real world.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}