Dataset Overview

Dataset Type

text corpus for NLP

Language

nan-Fujian

Speech Style

N/A

Content

daily-use sentence

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

Popular Datasets

Proprietary
NLP Corpus
535270 sentences

Minnan Text Corpus (Parallel Corpus)

MDT-NLP-F018 | 535,270 daily-use sentences in Minnan dialect

Dataset Overview

Dataset Type

text corpus for NLP

Language

nan-Fujian

Speech Style

N/A

Content

daily-use sentence
N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

License

This dataset consists of 535,270 daily-use sentences in Minnan dialect.

Contact business@magicdatatech.com to learn more.

Sample:

ChineseMinnan Dialect
我想和你好好聊聊可以给个机会吗阮想甲汝好好聊聊会使诶互阮一个机会呣
没事不用管他无要紧毋免管伊
呵呵想知道我是谁阿呵呵想知影阮是啥侬阿
不行啦我现在真的睡了毋行啦阮即阵真诶困了
你晚上到底回不回来了啊汝明晚上到底倒来毋倒来啊
你为什么要这样做汝为虾米要安尼做

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}