DocumentCode
2326456
Title
Improving speech recognition robustness using non-standard windows
Author
Rozman, R. ; Kodek, DuSan M.
Author_Institution
Comput. & Inf. Sci. Fac., Ljubljana Univ., Slovenia
Volume
2
fYear
2003
fDate
22-24 Sept. 2003
Firstpage
171
Abstract
The windowing problem of short-time frequency analysis in speech recognition systems (SRSs) is considered. Design possibilities for different non-standard window sequences are presented. The traditional "digital filtering" approach to the design of finite window sequences with linear and nonlinear phase response is examined. Since human hearing is relatively insensitive to phase distortions of speech signals, ideas of alternative windows with nonlinear phase response are also investigated. The two most promising design methods for nonlinear phase windows are discussed. Practical performance comparison of such windows with the Hamming window on two real SRSs is presented. They show that the non-standard window sequences can contribute to greater SRS robustness. Additional research on non-standard windows and the parameterization process as a whole is suggested.
Keywords
Fourier transforms; speech recognition; time-frequency analysis; Hamming window; digital filtering approach; human hearing; linear phase response; nonlinear phase response; nonstandard windows; parameterization process; phase distortions; short-time frequency analysis; speech processing; speech recognition robustness; speech signal; windowing problem; Auditory system; Digital filters; Filtering; Frequency; Humans; Nonlinear filters; Phase distortion; Robustness; Speech analysis; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
EUROCON 2003. Computer as a Tool. The IEEE Region 8
Print_ISBN
0-7803-7763-X
Type
conf
DOI
10.1109/EURCON.2003.1248175
Filename
1248175
Link To Document