• DocumentCode
    2789131
  • Title

    Subword-based spoken term detection in audio course lectures

  • Author

    Rose, Richard ; Norouzian, Atta ; Reddy, Aarthi ; Coy, Andre ; Gupta, Vishwa ; Karafiat, Martin

  • Author_Institution
    Dept. of ECE, McGill Univ., Montreal, QC, Canada
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5282
  • Lastpage
    5285
  • Abstract
    This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient phone based distance is used to detect the occurrence of search terms in phonetic expansions of promising lattice paths. STD and ASR results are reported for both in-vocabulary (IV) and out-of-vocabulary (OOV) search terms in this lecture speech domain.
  • Keywords
    audio recording; educational computing; speech recognition; audio course lectures; audio recordings; automatic speech recognition system; in-vocabulary search terms; lattice paths; lecture speech domain; media repository; out-of-vocabulary search terms; phone based distance; phonetic expansions; subword-based spoken term detection; word lattices; Audio recording; Automatic speech recognition; Broadcasting; Decoding; Delay; Lattices; Speech recognition; Telephony; Vocabulary; Voice mail; Speech recognition; spoken term detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5494982
  • Filename
    5494982