MagicData
SIGN IN

Total Size: 60

概览

数据集类型

语音识别(ASR)音频数据集

语种

Hebrew

语音类型

内容

音频参数

文件格式

wav

录音设备

录音环境

授权方式

No formal license but free to use for any purpose.

第三方
ASR数据集

ASR-BiAnHebSC: A Binary Answers Hebrew Speech Corpus

About this resource:

This dataset was created for the Kaldi project (see kaldi.sf.net), by a contributor who prefers to remain anonymous. The main point of the dataset is to provide an easy and fast way to test out the Kaldi scripts for free.

The archive "waves_yesno.tar.gz" contains 60 .wav files, sampled at 8 kHz. All were recorded by the same male speaker, in Hebrew. In each file, the individual says 8 words; each word is either the Hebrew for "yes" or "no", so each file is a random sequence of 8 yes-es or noes. There is no separate transcription provided; the sequence is encoded in the filename, with 1 for yes and 0 for no, for instance:

# tar -xvzf waves_yesno.tar.gz
waves_yesno/1_0_1_1_1_0_1_0.wav
waves_yesno/0_1_1_0_0_1_1_0.wav
...

概览

数据集类型

语音识别(ASR)音频数据集

语种

Hebrew

语音类型

内容

音频参数

文件格式

wav

录音设备

录音环境

授权方式

No formal license but free to use for any purpose.

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}评论
写评论
*访客无法进行评论

Verifying Email