• DocumentCode
    591897
  • Title

    Using syntactic and confusion network structure for out-of-vocabulary word detection

  • Author

    Marin, A. ; Kwiatkowski, Tom ; Ostendorf, Mari ; Zettlemoyer, Luke

  • Author_Institution
    Univ. of Washington, Seattle, WA, USA
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    159
  • Lastpage
    164
  • Abstract
    This paper addresses the problem of detecting words that are out-of-vocabulary (OOV) for a speech recognition system to improve automatic speech translation. The detection system leverages confidence prediction techniques given a confusion network representation and parsing with OOV word tokens to identify spans associated with true OOV words. Working in a resource-constrained domain, we achieve OOV detection F-scores of 60-66 and reduce word error rate by 12% relative to the case where OOV words are not detected.
  • Keywords
    speech recognition; vocabulary; OOV; automatic speech translation; confusion network structure; out-of-vocabulary word detection; resource constrained domain; speech recognition system; syntactic network structure; Error analysis; Grammar; Lattices; Speech; Speech recognition; Syntactics; Vocabulary; OOV detection; parsing; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424215
  • Filename
    6424215