Total Size: 97G

Sign In to Download.

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog

Popular Datasets

Open Source
6.13 hours
ASR Corpus
3.09 GB
Open Source
4.25 hours
ASR Corpus
378 MB
Open Source
5.08 hours
ASR Corpus
436.14 MB
Open Source
0.71 hours
ASR Corpus
62 MB
Open Source
10.43 hours
ASR Corpus
785 MB
Open Source
4.1 hours
ASR Corpus
355 MB
Open Source
Lexicon
16 KB
Third Party
ASR Corpus

CN-Celeb Speech Recognition Corpus

A Free Chinese Speaker Recognition Corpus Released by CSLT@Tsinghua University

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text
16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog

This is a large-scale speaker recognition dataset collected `in the wild’. The dataset consists of 3,000 Chinese celebrities, and covers 11 different genres in real world.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}