Sign In to Download.

Dataset Overview

Dataset Type

ASR speech corpus

Language

en-SG

Speech Style

scripted

Content

keyword spotting

Audio Parameters

48 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor environment

License

Popular Datasets

Open Source
6.13 hours
ASR Corpus
3.09 GB
Open Source
4.25 hours
ASR Corpus
378 MB
Open Source
5.08 hours
ASR Corpus
436.14 MB
Open Source
0.71 hours
ASR Corpus
62 MB
Open Source
10.43 hours
ASR Corpus
785 MB
Open Source
4.1 hours
ASR Corpus
355 MB
Proprietary
ASR Corpus
28 hours

Singaporean English Scripted Speech Corpus – Keyword Spotting

MDT-ASR-F010 | 28 hours of transcribed Singaporean English scripted speech

Dataset Overview

Dataset Type

ASR speech corpus

Language

en-SG

Speech Style

scripted

Content

keyword spotting
48 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

microphone

Recording Environment

indoor environment

License

This dataset consists of 28 hours of transcribed Singaporean English scripted speech focusing on daily use sentences contributed by 197 speakers.

Contact business@magicdatatech.com to learn more.

Sample:

The dataset is provided on an “As Is” basis, and no warranty, either expressed or implied, is given. Your use of the dataset is at your sole risk. You expressly understand and agree that MagicHub and/or Beijing Magic Data Technology Co., Ltd. shall not be liable for any direct, indirect, incidental, special or consequential damages; including but not limited to, damages for loss of profits, goodwill, use, data or other intangible losses related to the datasets.

Copyright © 2021 Beijing Magic Data Technology Co., Ltd. All rights reserved.

Similar datasets are available! Please feel free to CONTACT US if you have any questions or data requirements.

Comments

{{ reviewsTotal }} Review
{{ reviewsTotal }} Reviews
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}