Information extraction from speech

Author

Makhoul, J.

Author_Institution

BBN Technol., Cambridge, MA

fYear

2006

fDate

10-13 Dec. 2006

Firstpage

3

Lastpage

3

Abstract

Summary form only given. The state of the art in automatic speech recognition has reached the point that searching for and extracting information from large speech repositories or streaming audio has become a growing reality. This paper summarizes the technologies that have been instrumental in making audio as searchable as text, including speech recognition, speaker clustering, segmentation, and identification; topic classification; and story segmentation. Once speech is turned into text, information extraction methods can then be applied, such as named entity extraction, finding relationships between named entities, and resolution of anaphoric references. Examples of deployed systems for information extraction from speech, which incorporate some of the aforementioned technologies, will be given.

Keywords

information retrieval; speech recognition; automatic speech recognition; information extraction; large speech repositories; named entity extraction; speaker clustering; story segmentation; topic classification; Automatic speech recognition; Data mining; Instruments; Speech recognition; Streaming media;

fLanguage

English

Publisher

ieee

Conference_Titel

Spoken Language Technology Workshop, 2006. IEEE

Conference_Location

Palm Beach

Print_ISBN

1-4244-0872-5

Type

conf

DOI

10.1109/SLT.2006.326780

Filename

4123343