Here we present a conversational dataset in Mandarin Chinese, code mixed with English words and phrases.
The total duration of the original dataset is about 22.54 hours, with an effective duration of about 9.57 hours. We split the dataset into two parts: the DEV set and the test set.
We present only the DEV part here for open access, of which the total duration is about 12 hours. Audio files (.wav) with segments and manually annotated transcriptions are contained in the dataset.
10 participants (4 males and 6 females) from whom we collected the audio data from were aged 21 - 25 years old. And in total, 42 audios were collected, corresponding to 42 annotated texts.
The word correct rate of this dataset is above 99% when we test and evaluate this set.
For any access to this dataset, please note our usage agreement.
Recommended Applications: ASR, Chatbot, TTS, Low-resources research