MagicData
SIGN IN

Dataset Overview

Dataset Type

Language

English

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

Third Party
NLP Corpus

NLP-LM4ASR: A Text Dataset for Modelling Language of ASR Systems

About this resource:

Language modelling resources are to be used in conjunction with the (soon-to-be-released) LibriSpeech ASR corpus.

This corpus and these resources were prepared by Vassil Panayotov with the assistance of Daniel Povey and Sanjeev Khudanpur. We hope to finalize this and release the corpus here by the ICASSP deadline (early October 2014).

Dataset Overview

Dataset Type

Language

English

Speech Style

Content

Audio Parameters

File Format

Recording Equipment

Recording Environment

License

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Verifying Email