DocumentCode
1109599
Title
A high-quality digital filterbank for speech recognition which runs in real time on a standard microprocessor
Author
Silverman, Harvey F.
Author_Institution
Brown University, Providence, RI, USA
Volume
34
Issue
5
fYear
1986
fDate
10/1/1986 12:00:00 AM
Firstpage
1064
Lastpage
1073
Abstract
Asynchronous digital filterbank or LPC output is widely used for high-quality research and commercial speech recognition applications. Special-purpose hardware is usually applied. In this paper, a decimation/DFT filterbank system is described which may be implemented in real time on a standard microprocessor. Key features of the approach are a dithering technique, which allows output sampling at a rate lower than would be theoretically predicted as adequate, and the Winograd Fourier transform algorithm (WFTA) implementation of the DFT. The contributions of this paper are the dithering idea, the demonstration that it is feasible to implement a high-quality digital filter-bank on a standard microprocessor, and the discussion of starting-phase error. The nonlinear effects of dithering are discussed in detail, and it is shown that the dithering technique significantly reduces starting phase error, which can be important in very close discrete utterance recognition (DUR) situations.
Keywords
Digital filters; Discrete Fourier transforms; Filter bank; Fourier transforms; Hardware; Linear predictive coding; Microprocessors; Real time systems; Sampling methods; Speech recognition;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1986.1164947
Filename
1164947
Link To Document