Sign In to Download.

Dataset Overview

Dataset Type

text corpus for NLP

Language

yue-Guangdong

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

Popular Datasets

Proprietary
1602 hours
ASR Corpus
Proprietary
427 hours
ASR Corpus
Proprietary
1386 hours
ASR Corpus
Proprietary
313 hours
ASR Corpus
Open Source
NLP Corpus
2 KB
Proprietary
973 hours
ASR Corpus
Proprietary
65 hours
ASR Corpus
Open Source
NLP Corpus
100 sentences

Guangzhou Cantonese Text Corpus

This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.

Dataset Overview

Dataset Type

text corpus for NLP

Language

yue-Guangdong

Speech Style

N/A

Content

daily-use sentence
N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.

Sample:

Chinese Guangzhou Cantonese
你漫画看多了吧 你漫画睇多咗啊
没问道怎么说 冇问到哦点讲啊
写了我发现我好有文采 写咗嘞我发现我好有文采啊
The dataset is provided on an “As Is” basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}