MagicData
SIGN IN

Total Size: 4.2G

概览

数据集类型

语音识别(ASR)音频数据集

语种

英语和捷克语

语音类型

自由对话

内容

给定主题的对话

音频参数

文件格式

WAV (PCM) TXT (UTF8)

录音设备

录音环境

/

授权方式

Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0 US)

第三方
ASR数据集

ASR-Vystadial: An English and Czech Telephone Conversational Corpus from the Vystadial Project

此数据集包含41小时的英语音频和15小时的捷克语音频,附有转写文本。

About this resource:

This data is transcribed from telephone conversation data, in English and Czech.

The data collection process and development of these training scripts were partly funded by the Ministry of Education, Youth and Sports of the Czech Republic under the grant agreement LK11221 and core research funding from Charles University in Prague.

You can cite the data using the following BibTeX entry:

@inproceedings{korvas_2014,
  title={{Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license}},
  author={Korvas, Mat\v{e}j and Pl\'{a}tek, Ond\v{r}ej and Du\v{s}ek, Ond\v{r}ej and \v{Z}ilka, Luk\'{a}\v{s} and Jur\v{c}\'{i}\v{c}ek, Filip},
  booktitle={Proceedings of the Eigth International Conference on Language Resources and Evaluation (LREC 2014)},
  pages={To Appear},
  year={2014},
}

概览

数据集类型

语音识别(ASR)音频数据集

语种

英语和捷克语

语音类型

自由对话

内容

给定主题的对话

音频参数

文件格式

WAV (PCM) TXT (UTF8)

录音设备

录音环境

/

授权方式

Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0 US)

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}评论
写评论
*访客无法进行评论

Verifying Email