DocumentCode
1857334
Title
Information extraction from speech
Author
Makhoul, J.
Author_Institution
BBN Technol., Cambridge, MA
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
3
Lastpage
3
Abstract
Summary form only given. The state of the art in automatic speech recognition has reached the point that searching for and extracting information from large speech repositories or streaming audio has become a growing reality. This paper summarizes the technologies that have been instrumental in making audio as searchable as text, including speech recognition, speaker clustering, segmentation, and identification; topic classification; and story segmentation. Once speech is turned into text, information extraction methods can then be applied, such as named entity extraction, finding relationships between named entities, and resolution of anaphoric references. Examples of deployed systems for information extraction from speech, which incorporate some of the aforementioned technologies, will be given.
Keywords
information retrieval; speech recognition; automatic speech recognition; information extraction; large speech repositories; named entity extraction; speaker clustering; story segmentation; topic classification; Automatic speech recognition; Data mining; Instruments; Speech recognition; Streaming media;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location
Palm Beach
Print_ISBN
1-4244-0872-5
Type
conf
DOI
10.1109/SLT.2006.326780
Filename
4123343
Link To Document