Datasets
Chinese

Chinese Financial Customer Service Text Corpus
Category : NLPコーパス
Datasets Source : MagicData
Language : zh-CN
Content : customer service
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Mandarin Chinese Prosody Text Corpus
Category : NLPコーパス
Datasets Source : MagicData
Language : zh-CN
Content : prosody annotaion
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Mandarin Chinese Polyphone Text Corpus
Category : NLPコーパス
Datasets Source : MagicData
Language : zh-CN
Content : Polyphony
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Mandarin Chinese Text Normalization Text Corpus
Category : NLPコーパス
Datasets Source : MagicData
Language : zh-CN
Content : text normalization
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Mandarin Chinese Human–Computer Interaction Text Corpus
Category : NLPコーパス
Datasets Source : MagicData
Language : zh-CN
Content : human–computer interaction
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Wuhan Text Corpus (Parallel Corpus)
Category : NLPコーパス
Datasets Source : MagicData
Language : cmn-Wuhan
Content : daily-use sentence
Tags : 中国語の方言
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Minnan Text Corpus (Parallel Corpus)
Category : NLPコーパス
Datasets Source : MagicData
Language : nan-Fujian
Content : daily-use sentence
Tags : 中国語の方言
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Shanghai Text Corpus (Parallel Corpus)
Category : NLPコーパス
Datasets Source : MagicData
Language : wuu-Shanghai
Content : daily-use sentence
Tags : 中国語の方言
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Guangzhou Cantonese Text Corpus (Parallel Corpus)
Category : NLPコーパス
Datasets Source : MagicData
Language : yue-Guangdong
Content : daily-use sentence
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.
Chinese Chatting Text Corpus
Category : NLPコーパス
Datasets Source : MagicData
Language : zh-CN
Content : Chatting
Tags : マンダリン
Size : Not Described.
File Format : TXT (UTF8)
License : Not Described.