• DocumentCode
    2790090
  • Title

    Application of out-of-language detection to spoken term detection

  • Author

    Motlicek, Petr ; Valente, Fabio

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5098
  • Lastpage
    5101
  • Abstract
    This paper investigates the detection of English spoken terms in a conversational multi-language scenario. The speech is processed using a large vocabulary continuous speech recognition system. The recognition output is represented in the form of word recognition lattices which are then used to search required terms. Due to the potential multi-lingual speech segments at the input, the spoken term detection system is combined with a module performing out-of-language detection to adjust its confidence scores. First, experimental results of spoken term detection are provided on the conversational telephone speech database distributed by NIST in 2006. Then, the system is evaluated on a multi-lingual database with and without employment of the out-of-language detection module, where we are only interested in detecting English terms (stored in the index database). Several strategies to combine these two systems in an efficient way are proposed and evaluated. Around 7% relative improvement over a stand-alone STD is achieved.
  • Keywords
    linguistics; speech recognition; multilingual speech segments; out-of-language detection application; speech recognition system; spoken term detection; telephone speech database; word recognition; Birth disorders; Databases; Dictionaries; Indexing; Lattices; Natural languages; Speech processing; Speech recognition; Telephony; Vocabulary; Confidence Measure (CM); Large Vocabulary Continuous Speech Recognition (LVCSR); Out-Of-Language (OOL) detection; Spoken Term Detection (STD);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495038
  • Filename
    5495038