Total Size: 311 MB

Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

id-ID,
Indonesian (Indonesia)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

mobile

Recording Environment

indoor environment

Popular Datasets

Proprietary
1602 hours
ASRデータセット
Proprietary
427 hours
ASRデータセット
Proprietary
1386 hours
ASRデータセット
Proprietary
313 hours
ASRデータセット
Open Source
NLPコーパス
2 KB
Proprietary
65 hours
ASRデータセット
Proprietary
973 hours
ASRデータセット
Open Source
ASRデータセット
3.5 hours

Indonesian Scripted Speech Corpus – Daily Use Sentence

3.5 hours of transcribed Indonesian scripted speech
on daily use sentences

Dataset Overview

Dataset Type

ASR speech corpus

Language

id-ID,
Indonesian (Indonesia)

Speech Style

scripted monologue

Content

daily use sentences
16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF-8)

Recording Equipment

mobile

Recording Environment

indoor environment

This open-source dataset consists of 3.5 hours of transcribed Indonesian scripted speech focusing on daily use sentences, where 3,296 utterances contributed by ten speakers were contained.

Sample:

“Sepanjang pertandingan, saya telah melihat banyak hal.”

The dataset is provided on an “As Is” basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}