MagicData

sign in

Dataset Overview

Dataset Type

ASR speech corpus

Language

ms-MY

Speech Style

Conversational

Content

Spontaneous Conversation

Audio Parameters

16 kHz, 16 bits

File Format

WAV TXT

Recording Equipment

mobile

Recording Environment

outdoor
Proprietary
ASR Corpus
372 hours

ASR-BigMalCSC: A Malay Conversational Speech Corpus

MDT-ASR-E079 | 372 hours of transcribed Malay conversational speech

This dataset consists of 372 hours of transcribed Malay Conversational Speech on certain topics contributed by 188 speakers.

Contact business@magicdatatech.com to learn more.

Dataset Overview

Dataset Type

ASR speech corpus

Language

ms-MY

Speech Style

Conversational

Content

Spontaneous Conversation

Audio Parameters

16 kHz, 16 bits

File Format

WAV TXT

Recording Equipment

mobile

Recording Environment

outdoor

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email