DocumentCode
394264
Title
The ICSI Meeting Corpus
Author
Janin, Adam ; Baron, Don ; Edwards, Jane ; Ellis, Dan ; Gelbart, David ; Morgan, Nelson ; Peskin, Barbara ; Pfau, Thilo ; Shriberg, Elizabeth ; Stolcke, Andreas ; Wooters, Chuck
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
We have collected a corpus of data from natural meetings that occurred at the International Computer Science Institute (ICSI) in Berkeley, California over the last three years. The corpus contains audio recorded simultaneously from head-worn and table-top microphones, word-level transcripts of meetings, and various metadata on participants, meetings, and hardware. Such a corpus supports work in automatic speech recognition, noise robustness, dialog modeling, prosody, rich transcription, information retrieval, and more. We present details on the contents of the corpus, as well as rationales for the decisions that led to its configuration. The corpus were delivered to the Linguistic Data Consortium (LDC).
Keywords
audio recording; microphones; speech processing; speech recognition; Berkeley; California; ICSI; ICSI Meeting Corpus; International Computer Science Institute; Linguistic Data Consortium; audio recordings; automatic speech recognition; data corpus; dialog modeling; head-worn microphones; information retrieval; noise robustness; prosody; table-top microphones; transcription;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198793
Filename
1198793
Link To Document