• DocumentCode
    180488
  • Title

    Subword-based modeling for handling OOV words inkeyword spotting

  • Author

    Yanzhang He ; Hutchinson, Brian ; Baumann, Philipp ; Ostendorf, Mari ; Fosler-Lussier, Eric ; Pierrehumbert, Janet

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    7864
  • Lastpage
    7868
  • Abstract
    This work compares ASR decoding at different subword levels crossed with alternative keyword search strategies to handle the OOV issue for keyword spotting in the low-resource setting. We show that a morpheme-based subword modeling approach is effective in recovering OOV keywords within a Turkish low-resource keyword spotting task, where mixed word and morpheme decoding approach outperforms the traditional subword-based search from word-decoded lattices that are broken down to subword lattices. Furthermore, unsupervised learning of morphology works almost as well as a rule-based system designed for the language despite the low-resource condition. A staged keyword search strategy benefits from both methods of morphological analysis.
  • Keywords
    speech recognition; unsupervised learning; vocabulary; ASR decoding; OOV words inkeyword spotting; morpheme based subword modeling approach; subword based modeling; unsupervised learning; Conferences; Decoding; Lattices; Speech; Speech processing; Speech recognition; Vocabulary; Automatic Speech Recognition; Keyword Spotting; Morphology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6855131
  • Filename
    6855131