DocumentCode
319590
Title
An acoustic front-end using warped frequency and temporal resolutions
Author
Lilly, B.T. ; Paliwal, K.K.
Author_Institution
Sch. of Microelectron. Eng., Griffith Univ., Brisbane, Qld., Australia
Volume
1
fYear
1997
fDate
4-4 Dec. 1997
Firstpage
133
Abstract
Typically, the power spectrum of a speech frame used in speech recognition is estimated for a fixed length window using the fast Fourier transform. Each frequency component represented in this power spectrum is an estimate over that speech frame. The power spectrum calculated in this way has a constant time and frequency resolution. An example of this type of front-end is the LPC-derived cepstral front-end commonly used is recognition systems today. The acoustic front-end presented in this paper employs both a warped frequency and temporal resolutions. We show that a front-end that utilises both warping functions, outperforms a front-end that employs only a warped frequency scale. We also show that this new front-end is unsuitable for noisy conditions.
Keywords
acoustic signal processing; FIR bandpass filter bank; LPC-derived cepstral front-end; acoustic front-end; fast Fourier transform; fixed length window; frequency component; human auditory system; power spectrum; speech frame; speech recognition; time resolution; warped frequency resolution; warped temporal resolution; Band pass filters; Bandwidth; Cepstral analysis; Ear; Filter bank; Finite impulse response filter; Frequency estimation; Humans; Signal resolution; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location
Brisbane, Qld., Australia
Print_ISBN
0-7803-4365-4
Type
conf
DOI
10.1109/TENCON.1997.647275
Filename
647275
Link To Document