来源类型

数据类型

语种

内容类型

重口音

语速

行业

场景

published at December 15, 2021
开源数据集
ASR数据集
6 hours
This open-source dataset consists of 6 hours of transcribed Mandarin Chinese scripted speech of keyword spotting in fast, normal, and slow speed, where 11,030 utterances contributed by 37 speakers were contained.
published at December 15, 2021
开源数据集
NLP语料库
100 sentences
This dataset consists of 100 daily-use sentences in Guangzhou Cantonese.
published at December 15, 2021
开源数据集
NLP语料库
100 sentences
100 paragraphs
This dataset contains 100 pieces of news.
published at November 24, 2021
Magic Data自有数据集
NLP语料库
12,600句
MDT-NLP-F027 | 12,600条中文金融领域客服场景文本语料
published at November 24, 2021
Magic Data自有数据集
NLP语料库
330,000句
MDT-NLP-F026 | 330,000句中文韵律标注语料
published at November 24, 2021
Magic Data自有数据集
NLP语料库
244,630句
MDT-NLP-F025 | 244,630句共包含138个多音字的中文语料
published at November 24, 2021
Magic Data自有数据集
NLP语料库
100,736句
MDT-NLP-F024 | 100,736条中文TN正则文本语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
2480组
MDT-NLP-F023 | 2,480句中文人机交互语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
828,114句
MDT-NLP-F017 | 828,114条广式粤语日常用语语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
2,095,686句
MDT-NLP-F016 | 2,095,686句中文聊天语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
750,194句
MDT-NLP-F015 | 750,194条中文地标地址文本语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
613,482句
MDT-NLP-F014 | 613,482句中文通讯类语料
published at
Magic Data自有数据集
NLP语料库
127,035句
MDT-NLP-F013 | 127,035条中导航语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
15,264句
MDT-NLP-F012 | 15,264句中文智能家居命令控制语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
357,468句
MDT-NLP-F011 | 357,486句播放音乐相关语料
published at November 23, 2021
Magic Data自有数据集
NLP语料库
10,488句
MDT-NLP-F010 | 10,488句中文人机交互语料
published at November 12, 2021
开源数据集
ASR数据集
3.23小时
总时长为3.23小时的中文普通话朗读数据集和转写文本,有快—中—慢三种语速
published at November 12, 2021
开源数据集
NLP语料库
600条中文金融领域客服场景文本语料
published at November 4, 2021
Magic Data自有数据集
TTS数据集
40小时
MDT-TTS-D003 | 21,343条适用于语音合成的中文女声情感标注语音
published at November 4, 2021
Magic Data自有数据集
TTS数据集
1 hours
MDT-TTS-D007 | 697条适用于语音合成的中文普通话女声标注语音