DocumentCode :
2789131
Title :
Subword-based spoken term detection in audio course lectures
Author :
Rose, Richard ; Norouzian, Atta ; Reddy, Aarthi ; Coy, Andre ; Gupta, Vishwa ; Karafiat, Martin
Author_Institution :
Dept. of ECE, McGill Univ., Montreal, QC, Canada
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5282
Lastpage :
5285
Abstract :
This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient phone based distance is used to detect the occurrence of search terms in phonetic expansions of promising lattice paths. STD and ASR results are reported for both in-vocabulary (IV) and out-of-vocabulary (OOV) search terms in this lecture speech domain.
Keywords :
audio recording; educational computing; speech recognition; audio course lectures; audio recordings; automatic speech recognition system; in-vocabulary search terms; lattice paths; lecture speech domain; media repository; out-of-vocabulary search terms; phone based distance; phonetic expansions; subword-based spoken term detection; word lattices; Audio recording; Automatic speech recognition; Broadcasting; Decoding; Delay; Lattices; Speech recognition; Telephony; Vocabulary; Voice mail; Speech recognition; spoken term detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494982
Filename :
5494982
Link To Document :
بازگشت