• DocumentCode
    591769
  • Title

    A hybrid fragment / syllable-based system for improved OOV term detection

  • Author

    Yong Xu ; Wu Guo ; Lirong Dai

  • Author_Institution
    Dept. of Electron. Eng. & Inf. Sci., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    378
  • Lastpage
    382
  • Abstract
    Spoken term detection (STD) is a task for open vocabulary search in large recordings of speech. Although the term detection performance for in-vocabulary (INV) terms has achieved a great improvement, the detection performance for out of vocabulary (OOV) terms is still disappointing. In this paper, we propose to combine fragment-based with syllable-based search into a hybrid STD system for OOV terms. Syllable is a kind of knowledge-based subword while fragment is data-driven. We initially compare their different modeling ability for OOVs. Considering the potential complementarities between them, we explore two methods of fusion: index fusion (combining the triphone indexes of a fragment-based and a syllable-based system) and result fusion (merging search results of the two systems). After the result fusion, we achieve a 9.4% relative improvement on NIST STD06 English conversational telephone speech (CTS) EvalSet in actual term weighted value (ATWV).
  • Keywords
    natural language processing; speech recognition; vocabulary; English conversational telephone speech; OOV term detection; actual term weighted value; hybrid STD system; in-vocabulary terms; index fusion; knowledge based subword; large recordings; modeling ability; open vocabulary search; out of vocabulary terms; spoken term detection; syllable based search; syllable based system; term detection performance; Hidden Markov models; Indexes; NIST; Speech; Speech processing; Training; Vocabulary; fragment; fusion method; out of vocabulary; spoken term detection; syllable; triphone index;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423479
  • Filename
    6423479