DocumentCode :
2789090
Title :
Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder
Author :
Wöllmer, Martin ; Eyben, Florian ; Schuller, Björn ; Rigoll, Gerhard
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, München, Germany
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5274
Lastpage :
5277
Abstract :
This paper proposes a novel system for robust keyword detection in continuous speech. Our decoder is composed of a bidirectional Long Short-Term Memory recurrent neural network using a Connectionist Temporal Classification (CTC) output layer, and a Dynamic Bayesian Network (DBN). The CTC network exploits bidirectional context information to reliably identify phonemes, whereas the DBN is able to discriminate between keywords and arbitrary speech while explicitly modeling substitutions, deletions, and insertions in the CTC phoneme output string. Our technique is vocabulary independent and does not require an explicit garbage model. Experiments show that our system architecture prevails over a standard Hidden Markov Model approach.
Keywords :
Bayes methods; recurrent neural nets; signal classification; speaker recognition; speech coding; vocabulary; CTC network; CTC phoneme output string; arbitrary speech; bidirectional context information; bidirectional long short-term memory recurrent neural network; connectionist temporal classification; continuous speech; dynamic Bayesian network; hybrid CTC-DBN decoder; keyword detection; spoken term detection; system architecture; vocabulary; Bayesian methods; Context modeling; Decoding; Graphical models; Hidden Markov models; Man machine systems; Recurrent neural networks; Robustness; Speech recognition; Vocabulary; Connectionist Temporal Classification; Dynamic Bayesian Networks; Keyword Spotting; Spoken Term Detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494980
Filename :
5494980
Link To Document :
بازگشت