MagicData

sign in

Total Size: 414 MB

Dataset Overview

Dataset Type

ASR speech corpus

Language

tl-PH,
Filipino (Philippines)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

Magic Data
open-source license

Open Source
ASR Corpus
4.58 hours

ASR-SFDuSC: A Scripted Filipino Daily-use Speech Corpus

4.58 hours of transcribed Filipino scripted speech
on daily use sentences

This open-source dataset consists of 4.58 hours of transcribed Filipino scripted speech focusing on daily use sentences, where 4,073 utterances contributed by ten speakers were contained.

Sample:

"Kung kelan ako nagcollege saka naglabasan."

Dataset Overview

Dataset Type

ASR speech corpus

Language

tl-PH,
Filipino (Philippines)

Speech Style

scripted monologue

Content

daily use sentences

Audio Parameters

16 kHz, 16 bits, mono

File Format

WAV (PCM)
TXT (UTF8)

Recording Equipment

mobile

Recording Environment

indoor environment

License

Magic Data
open-source license

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email