Total Size: 97G

Sign In to Download.

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog
Third Party
ASR Corpus

CN-Celeb Speech Recognition Corpus

A Free Chinese Speaker Recognition Corpus Released by CSLT@Tsinghua University

This is a large-scale speaker recognition dataset collected `in the wild'. The dataset consists of 3,000 Chinese celebrities, and covers 11 different genres in real world.

Dataset Overview

Dataset Type

speaker recognition corpus

Language

Chinese

Speech Style

multi-media sources

Content

free text

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM) TXT (UTF8)

Recording Equipment

Bilibili, Changba, Himalaya, NetEase Cloud, Tik Tok

Recording Environment

mult-genre: Advertisement, Drama, Entertainment, Interview, Live Broadcast, Movie, Play, Recitation, Singing, Speech, Vlog
Sign In to Download.
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}