DocumentCode :
3531298
Title :
Improving multi-lattice alignment based spoken keyword spotting
Author :
Lin, Hui ; Stupakov, Alex ; Bilmes, Jeff
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4877
Lastpage :
4880
Abstract :
In previous work, we showed that using a lattice instead of the 1-best path to represent both the query and the utterance being searched is beneficial for spoken keyword spotting. In this paper, we introduce several techniques that further improve our multi-lattice alignment approach, including edit operation modeling and supervised training of the conditional probability table, something which cannot be directly trained by traditional maximum likelihood estimation. Experiments on TIMIT show that the proposed methods significantly improve the performance of spoken keyword spotting.
Keywords :
audio databases; learning (artificial intelligence); maximum likelihood estimation; probability; query processing; speech processing; TIMIT; conditional probability table; edit operation modeling; maximum likelihood estimation; multilattice alignment-based spoken keyword spotting; search query; speech database; supervised training; Audio recording; Auditory displays; Cameras; Graphical models; Keyboards; Lattices; Legged locomotion; Maximum likelihood estimation; Microphones; Video recording; Spoken keyword spotting; auxiliary training; edit operation modeling; lattice alignment; negative training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960724
Filename :
4960724
Link To Document :
بازگشت