DocumentCode
3531298
Title
Improving multi-lattice alignment based spoken keyword spotting
Author
Lin, Hui ; Stupakov, Alex ; Bilmes, Jeff
Author_Institution
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
fYear
2009
fDate
19-24 April 2009
Firstpage
4877
Lastpage
4880
Abstract
In previous work, we showed that using a lattice instead of the 1-best path to represent both the query and the utterance being searched is beneficial for spoken keyword spotting. In this paper, we introduce several techniques that further improve our multi-lattice alignment approach, including edit operation modeling and supervised training of the conditional probability table, something which cannot be directly trained by traditional maximum likelihood estimation. Experiments on TIMIT show that the proposed methods significantly improve the performance of spoken keyword spotting.
Keywords
audio databases; learning (artificial intelligence); maximum likelihood estimation; probability; query processing; speech processing; TIMIT; conditional probability table; edit operation modeling; maximum likelihood estimation; multilattice alignment-based spoken keyword spotting; search query; speech database; supervised training; Audio recording; Auditory displays; Cameras; Graphical models; Keyboards; Lattices; Legged locomotion; Maximum likelihood estimation; Microphones; Video recording; Spoken keyword spotting; auxiliary training; edit operation modeling; lattice alignment; negative training;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960724
Filename
4960724
Link To Document