Title :
Improving multi-lattice alignment based spoken keyword spotting
Author :
Lin, Hui ; Stupakov, Alex ; Bilmes, Jeff
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA
Abstract :
In previous work, we showed that using a lattice instead of the 1-best path to represent both the query and the utterance being searched is beneficial for spoken keyword spotting. In this paper, we introduce several techniques that further improve our multi-lattice alignment approach, including edit operation modeling and supervised training of the conditional probability table, something which cannot be directly trained by traditional maximum likelihood estimation. Experiments on TIMIT show that the proposed methods significantly improve the performance of spoken keyword spotting.
Keywords :
audio databases; learning (artificial intelligence); maximum likelihood estimation; probability; query processing; speech processing; TIMIT; conditional probability table; edit operation modeling; maximum likelihood estimation; multilattice alignment-based spoken keyword spotting; search query; speech database; supervised training; Audio recording; Auditory displays; Cameras; Graphical models; Keyboards; Lattices; Legged locomotion; Maximum likelihood estimation; Microphones; Video recording; Spoken keyword spotting; auxiliary training; edit operation modeling; lattice alignment; negative training;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960724