Sign In to Download.

Dataset Overview

Dataset Type

text corpus for NLP

Language

yue-Guangdong

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
Open Source
NLP Corpus
100 sentences

Guangzhou Cantonese Text Corpus

This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.

This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.

Sample:

Chinese Guangzhou Cantonese
你漫画看多了吧 你漫画睇多咗啊
没问道怎么说 冇问到哦点讲啊
写了我发现我好有文采 写咗嘞我发现我好有文采啊

Dataset Overview

Dataset Type

text corpus for NLP

Language

yue-Guangdong

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
The dataset is provided on an "As Is" basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.
Sign In to Download.
{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}