DocumentCode
2789131
Title
Subword-based spoken term detection in audio course lectures
Author
Rose, Richard ; Norouzian, Atta ; Reddy, Aarthi ; Coy, Andre ; Gupta, Vishwa ; Karafiat, Martin
Author_Institution
Dept. of ECE, McGill Univ., Montreal, QC, Canada
fYear
2010
fDate
14-19 March 2010
Firstpage
5282
Lastpage
5285
Abstract
This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient phone based distance is used to detect the occurrence of search terms in phonetic expansions of promising lattice paths. STD and ASR results are reported for both in-vocabulary (IV) and out-of-vocabulary (OOV) search terms in this lecture speech domain.
Keywords
audio recording; educational computing; speech recognition; audio course lectures; audio recordings; automatic speech recognition system; in-vocabulary search terms; lattice paths; lecture speech domain; media repository; out-of-vocabulary search terms; phone based distance; phonetic expansions; subword-based spoken term detection; word lattices; Audio recording; Automatic speech recognition; Broadcasting; Decoding; Delay; Lattices; Speech recognition; Telephony; Vocabulary; Voice mail; Speech recognition; spoken term detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5494982
Filename
5494982
Link To Document