MagicData
SIGN IN

Dataset Overview

Dataset Type

text corpus for NLP

Language

cmn-Wuhan

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
Proprietary
NLP Corpus
108477 sentences

NLP-CWuhDiaParaC: A Chinese-Wuhan Dialect Parallel Corpus

MDT-NLP-F020 | 108,477 daily-use sentences in Wuhan dialect

This dataset consists of 108,477 daily-use sentences in Wuhan dialect.

Contact business@magicdatatech.com to learn more.

Sample:

ChineseWuhan Dialect
十二月是什么月十二月是么斯月
百家姓中复姓形都有什么百家姓豆里复姓姓都有么斯
外籍孩子在北京上学怎么办外籍的伢在北京上学么昂办
哪里有高一的试卷儿啊哪里有高一的试卷呐
动名词和不定式有什么不同动名词和不定式有么斯不同
皮肤没有光泽用什么护肤品皮肤冇得光泽用么斯护肤品

Dataset Overview

Dataset Type

text corpus for NLP

Language

cmn-Wuhan

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email