MagicData

sign in

Dataset Overview

Dataset Type

text corpus for NLP

Language

yue-Guangdong

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

License

Magic Data
open-source license

Open Source
NLP Corpus
100 sentences

NLP-CCantDuC: A Chinese Cantonese (Canton) Daily-use Corpus

This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.

This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.

Sample:

Chinese Guangzhou Cantonese
你漫画看多了吧 你漫画睇多咗啊
没问道怎么说 冇问到哦点讲啊
写了我发现我好有文采 写咗嘞我发现我好有文采啊

Dataset Overview

Dataset Type

text corpus for NLP

Language

yue-Guangdong

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

License

Magic Data
open-source license

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email