• DocumentCode
    2180468
  • Title

    A segment-level confidence measure for Spoken Document Retrieval

  • Author

    Senay, Grégory ; Linarès, Georges ; Lecouteux, Benjamin

  • Author_Institution
    Lab. Inf. d´´Avignon (LIA-CERI), Univ. of Avignon, Avignon, France
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5548
  • Lastpage
    5551
  • Abstract
    This paper presents a semantic confidence measure that aims to predict the relevance of automatic transcripts for a task of Spoken Document Retrieval (SDR). The proposed predicting method relies on the combination of Automatic Speech Recognition (ASR) confidence measure and a Semantic Compacity Index (SCI), that estimates the relevance of the words considering the semantic context in which they occurred. Experiments are conducted on the French Broadcast news corpus ESTER, by simulating a classical SDR usage scenario : users submit text-queries to a search engine that is expected to return the most relevant documents regarding the query. Results demonstrate the interest of using semantic level in formation to predict the transcription indexability.
  • Keywords
    indexing; query processing; search engines; speech recognition; text analysis; French Broadcast news corpus ESTER; SDR; automatic speech recognition confidence measure; automatic transcripts relevance; search engine; segment level confidence measure; semantic compacity index; semantic confidence measure; semantic level information; spoken document retrieval; text queries; transcription indexability; Acoustics; Indexing; Measurement; Semantics; Speech; Speech recognition; Speech recognition; confidence measures; spoken document retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947616
  • Filename
    5947616