DocumentCode :
2403852
Title :
Linear interpolation of spectrotemporal excitation pattern representations for automatic speech recognition in the presence of noise
Author :
Stan, Adriana
Author_Institution :
Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear :
2009
fDate :
18-21 June 2009
Firstpage :
1
Lastpage :
6
Abstract :
This article is based on the study of new methods to improve recognition capabilities of automatic speech recognition in the presence of noise systems. Instead of trying to modify complex recognition models, the study is aimed at enhancing the input data´s reliability. This is achieved through processing of the acoustic representations of speech. One of these representations, called SpectroTemporal Excitation Pattern (STEP) is used in recognition systems with missing or unreliable data. One of the ideas behind this study was to increase the glimpsing areas in the STEP representations. And, because the glimpsing algorithm requires previous knowledge of the noise, another idea was to estimate noise characteristics, and base the glimpsing areas determination on these estimations. Preliminary tests were conducted with an HMM recognition system, but this will be the object of a future study.
Keywords :
hidden Markov models; interpolation; speech recognition; automatic speech recognition; glimpsing algorithm; hidden Markov models; linear interpolation; noise systems; spectrotemporal excitation pattern representations; Acoustic noise; Automatic speech recognition; Hidden Markov models; Humans; Interpolation; Pattern recognition; Speech enhancement; Speech processing; Speech recognition; Working environment noise; STEP representation; glimpsing; speech recognition in noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Technology and Human-Computer Dialogue, 2009. SpeD '09. Proceedings of the 5-th Conference on
Conference_Location :
Constant
Print_ISBN :
978-1-4244-4727-5
Type :
conf
DOI :
10.1109/SPED.2009.5156188
Filename :
5156188
Link To Document :
بازگشت