Total Size: 913MB

Sign In to Download.

Dataset Overview

Dataset Type

ASR

Language

Iban

Speech Style

Content

News

Audio Parameters

File Format

Recording Equipment

Recording Environment

Popular Datasets

Proprietary
1602 hours
ASR Corpus
Proprietary
427 hours
ASR Corpus
Proprietary
1386 hours
ASR Corpus
Proprietary
313 hours
ASR Corpus
Open Source
NLP Corpus
2 KB
Proprietary
973 hours
ASR Corpus
Proprietary
65 hours
ASR Corpus
Third Party
ASR Corpus

Iban Speech Corpora for ASR

Dataset Overview

Dataset Type

ASR

Language

Iban

Speech Style

Content

News

File Format

Recording Equipment

Recording Environment

This package contains Iban language text and speech suitable for Automatic Speech Recognition (ASR) experiments. In addition to transcribed speech, 2M tokens corpus crawled from an online newspaper sites is provided. News data provided by a local radio station in Sarawak, Malaysia.

The dataset is provided on an “As Is” basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}