MagicData
SIGN IN

Dataset Overview

Dataset Type

Language

Speech Style

Content

Amharic, Swahili and Wolof data, mirrored from the ALFFA git repository

Audio Parameters

File Format

Recording Equipment

Recording Environment

Third Party
ASR Corpus

ASR-ALFFA: African Languages in the Field (Speech Fundamentals and Automation)

About this resource:

This data is transcribed speech data, in Amharic and Swahili and Wolof.

This repository is a result of the ALFFA project http://alffa.imag.fr
A summary of these resources and ASR performances, as well as a description of the ALFFA project has been published in the following paper:

Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of Wolof.
Elodie Gauthier, Laurent Besacier, Sylvie Voisin, Michael Melese and Uriel Pascal Elingui.
To appear at LREC 2016.

So far, the ASR directory contains Kaldi recipes for 4 languages: Amharic, Swahili, Hausa and Wolof.

  • AMHARIC
  • SWAHILI
  • WOLOF

External URLs:
https://github.com/besacier/ALFFA_PUBLIC/tree/master/ASR/AMHARIC   (Amharic data )
https://github.com/besacier/ALFFA_PUBLIC/tree/master/ASR/SWAHILI   (Swahili data )
https://github.com/besacier/ALFFA_PUBLIC/tree/master/ASR/WOLOF   (Wolof data)

Dataset Overview

Dataset Type

Language

Speech Style

Content

Amharic, Swahili and Wolof data, mirrored from the ALFFA git repository

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

MIT

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email