MagicData
SIGN IN

Dataset Overview

Dataset Type

text corpus for NLP

Language

wuu-Shanghai

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
Proprietary
NLP Corpus
724186 sentences

NLP-CShhiParaDuSC: A Chinese-Shanghainese Parallel Daily-use Speech Corpus

MDT-NLP-F017 | 724,186 daily-use sentences in Shanghai dialect

This dataset consists of 724,186 daily-use sentences in Shanghai dialect.

Contact business@magicdatatech.com to learn more.

Sample:

ChineseShanghai Dialect
没有这么晚还没睡觉啊吾没介晚还没困觉啊
你是你发错了是么侬是啥拧侬发错了是伐
想让你开心一下吗想让侬开心一记伐
哈哈现在好开心哦哈哈现在老开心哦
查了没过啊我就郁闷了查过了没吾就郁闷了
明天去的话早点叫我明朝去呃闲话早点叫吾

Dataset Overview

Dataset Type

text corpus for NLP

Language

wuu-Shanghai

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email