DocumentCode :
3527171
Title :
Posterior features applied to speech recognition tasks with user-defined vocabulary
Author :
Aradilla, Guillermo ; Bourlard, Hervé ; Magimai-Doss, Mathew
Author_Institution :
Idiap Res. Inst., Martigny
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
3809
Lastpage :
3812
Abstract :
This paper presents a novel approach for those applications where vocabulary is defined by a set of acoustic samples. In this approach, the acoustic samples are used as reference templates in a template matching framework. The features used to describe the reference templates and the test utterances are estimates of phoneme posterior probabilities. These posteriors are obtained from a MLP trained on an auxiliary database. Thus, the speech variability present in the features is reduced by applying the speech knowledge captured by the MLP on the auxiliary database. Moreover, information theoretic dissimilarity measures can be used as local distances between features. When compared to state-of-the-art systems, this approach outperforms acoustic-based techniques and obtains comparable results to orthography-based methods. The proposed method can also be directly combined with other posterior-based HMM systems. This combination successfully exploits the complementarity between templates and parametric models.
Keywords :
hidden Markov models; information theory; speech recognition; information theoretic dissimilarity measures; phoneme posterior probabilities; posterior features; speech recognition; template matching framework; user-defined vocabulary; Acoustic applications; Acoustic measurements; Acoustic testing; Automatic speech recognition; Decoding; Hidden Markov models; Parametric statistics; Spatial databases; Speech recognition; Vocabulary; Kullback-Leibler divergence; Speech recognition; posterior features; template matching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960457
Filename :
4960457
Link To Document :
بازگشت