DocumentCode :
454591
Title :
Using Pitch as Prior Knowledge in Template-Based Speech Recognition
Author :
Aradilla, Guillermo ; Vepa, Jithendra ; Bourlard, Hervé
Author_Institution :
IDIAP Res. Inst.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
In a previous paper on speech recognition, we showed that templates can better capture the dynamics of speech signal compared to parametric models such as hidden Markov models. The key point in template matching approaches is finding the most similar templates to the test utterance. Traditionally, this selection is given by a distortion measure on the acoustic features. In this work, we propose to improve this template selection with the use of meta-linguistic information as prior knowledge. In this way, similarity is not only based on acoustic features but also on other sources of information that are present in the speech signal. Results on a continuous digit recognition task confirm the statement that similarity between words does not only depend on acoustic features since we obtained 24% relative improvement over the baseline. Interestingly, results are better even when compared to a system with no prior information but a larger number of templates
Keywords :
speech recognition; continuous digit recognition task; distortion measure; hidden Markov models; meta-linguistic information; parametric models; speech signal; template matching; template-based speech recognition; test utterance; Acoustic distortion; Acoustic measurements; Acoustic testing; Automatic speech recognition; Distortion measurement; Hidden Markov models; Information resources; Parametric statistics; Scalability; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660053
Filename :
1660053
Link To Document :
بازگشت