MagicData
SIGN IN

Total Size: 131.6GB

概览

数据集类型

语种

英语

语音类型

The AMI Corpus

内容

meeting recordings

音频参数

文件格式

XML

录音设备

录音环境

室内

授权方式

THE CREATIVE COMMONS ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE v2.0 LICENCE

第三方
ASR数据集

ASR-AMImeeting-BigESC: An English Speech Corpus from The AMI Meeting

About this resource:

此AMI会议语料数据集由100小时的会议录像组成。本录像由同步到同一时间轴的多个信号组成,包括近场和远场麦克风、单人视角和全景视角摄像机、幻灯机及电子白板。

This is a mirror of The AMI Corpus acoustic data originally hosted on http://groups.inf.ed.ac.uk/ami/corpus/

The AMI Meeting Corpus consists of 100 hours of meeting recordings. The recordings use a range of signals synchronized to a common timeline. These include close-talking and far-field microphones, individual and room-view video cameras, and output from a slide projector and an electronic whiteboard. During the meetings, the participants also have unsynchronized pens available to them that record what is written. The meetings were recorded in English using three different rooms with different acoustic properties and included mostly non-native speakers. The associated paper(s) describing the data:

  • Jean Carletta (2007). Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus. Language Resources and Evaluation Journal 41(2): 181-190. pdf
  • Steve Renals, Thomas Hain, and Hervé Bourlard (2007). Recognition and interpretation of meetings: The AMI and AMIDA projects. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU '07). pdf

External URL: http://groups.inf.ed.ac.uk/ami/corpus   The official AMI corpus webpage

概览

数据集类型

语种

英语

语音类型

The AMI Corpus

内容

meeting recordings

音频参数

文件格式

XML

录音设备

录音环境

室内

授权方式

THE CREATIVE COMMONS ATTRIBUTION-NONCOMMERCIAL-SHAREALIKE v2.0 LICENCE

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}评论
写评论
*访客无法进行评论

Verifying Email