• DocumentCode
    1857334
  • Title

    Information extraction from speech

  • Author

    Makhoul, J.

  • Author_Institution
    BBN Technol., Cambridge, MA
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    3
  • Lastpage
    3
  • Abstract
    Summary form only given. The state of the art in automatic speech recognition has reached the point that searching for and extracting information from large speech repositories or streaming audio has become a growing reality. This paper summarizes the technologies that have been instrumental in making audio as searchable as text, including speech recognition, speaker clustering, segmentation, and identification; topic classification; and story segmentation. Once speech is turned into text, information extraction methods can then be applied, such as named entity extraction, finding relationships between named entities, and resolution of anaphoric references. Examples of deployed systems for information extraction from speech, which incorporate some of the aforementioned technologies, will be given.
  • Keywords
    information retrieval; speech recognition; automatic speech recognition; information extraction; large speech repositories; named entity extraction; speaker clustering; story segmentation; topic classification; Automatic speech recognition; Data mining; Instruments; Speech recognition; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326780
  • Filename
    4123343