• DocumentCode
    319590
  • Title

    An acoustic front-end using warped frequency and temporal resolutions

  • Author

    Lilly, B.T. ; Paliwal, K.K.

  • Author_Institution
    Sch. of Microelectron. Eng., Griffith Univ., Brisbane, Qld., Australia
  • Volume
    1
  • fYear
    1997
  • fDate
    4-4 Dec. 1997
  • Firstpage
    133
  • Abstract
    Typically, the power spectrum of a speech frame used in speech recognition is estimated for a fixed length window using the fast Fourier transform. Each frequency component represented in this power spectrum is an estimate over that speech frame. The power spectrum calculated in this way has a constant time and frequency resolution. An example of this type of front-end is the LPC-derived cepstral front-end commonly used is recognition systems today. The acoustic front-end presented in this paper employs both a warped frequency and temporal resolutions. We show that a front-end that utilises both warping functions, outperforms a front-end that employs only a warped frequency scale. We also show that this new front-end is unsuitable for noisy conditions.
  • Keywords
    acoustic signal processing; FIR bandpass filter bank; LPC-derived cepstral front-end; acoustic front-end; fast Fourier transform; fixed length window; frequency component; human auditory system; power spectrum; speech frame; speech recognition; time resolution; warped frequency resolution; warped temporal resolution; Band pass filters; Bandwidth; Cepstral analysis; Ear; Filter bank; Finite impulse response filter; Frequency estimation; Humans; Signal resolution; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
  • Conference_Location
    Brisbane, Qld., Australia
  • Print_ISBN
    0-7803-4365-4
  • Type

    conf

  • DOI
    10.1109/TENCON.1997.647275
  • Filename
    647275