• DocumentCode
    700175
  • Title

    Using comparison of parallel phoneme probability streams for OOV word detection

  • Author

    Tosic, Tamara ; Magimai-Doss, Mathew ; Hermansky, Hynek

  • Author_Institution
    IDIAP Res. Inst., Centre du Parc, Martigny, Switzerland
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper, we investigate the approach of comparing two different parallel streams of phoneme posterior probability estimates for OOV word detection. The first phoneme posterior probability stream is estimated using only the knowledge of short-term acoustic observation. In our work we refer this stream as “out-of-context posteriors”. The second posterior probability stream, referred also as “in-context posteriors” is estimated using the knowledge of the whole acoustic observation sequence: the acoustic model and the language model of an ASR system. In particular, we focus our study on different types of distance measures, namely KL-divergence and Euclidean distance, to compare the two phoneme posterior probability streams. Our experiments on large vocabulary automatic speech recognition task shows that using KL-divergence measure estimated with the in-context posteriors as reference distribution consistently yields a better OOV word detection system.
  • Keywords
    probability; speech recognition; ASR system; Euclidean distance; KL-divergence; OOV word detection; large vocabulary automatic speech recognition task; out-of-context posteriors; parallel posterior phoneme probability streams; short-term acoustic observation; whole acoustic observation sequence; Acoustics; Context; Euclidean distance; Lattices; Speech; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080707