MagicData
SIGN IN

Dataset Overview

Dataset Type

text corpus for NLP

Language

zh-CN

Speech Style

N/A

Content

Polyphony

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A
Proprietary
NLP Corpus
244630 sentences

NLP-CPolyPhC: A Chinese Polyphone Corpus

MDT-NLP-F025 | 244,630 sentences with 138 polyphonic charactors in Chinese

This dataset consists of 244,630 sentences with 138 polyphonic charactors in Chinese.

Contact business@magicdatatech.com to learn more.

Sample:

拍摄期间休杰克曼和他的4个特技替身演员总共用了700只爪(zhua3)子。
一遍思考,一边伸出小爪(zhua3)子试探。
不过确实是想不明白为啥是螃蟹的八个爪(zhua3)子。
跑步冰爪(zhao3)别让冬天把你限制在跑步机上3个月!
蝙蝠翅膀携带利爪(zhao3),大蝙蝠被捕蝙工具抓到后很容易喷血。
圣徒爆冷力克曼城秘籍超远距离吊射八爪(zhao3)鱼守门员。

Dataset Overview

Dataset Type

text corpus for NLP

Language

zh-CN

Speech Style

N/A

Content

Polyphony

Audio Parameters

N/A

File Format

TXT (UTF8)

Recording Equipment

N/A

Recording Environment

N/A

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email