• DocumentCode
    2966510
  • Title

    An approach for formant based speech recognition in noise

  • Author

    Fattah, Shaikh Anowarul ; Ghosh, T. ; Das, Amal K. ; Goswami, Ramasis ; Shafin, A. ; Jameel, M.M. ; Shahnaz, Celia

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
  • fYear
    2012
  • fDate
    19-22 Nov. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, a noise robust formant frequency estimation scheme is developed utilizing the advantageous properties of the autocorrelation function of the band-limited noisy speech signal. It is shown that the use of autocorrelation operation on a speech signal, which is band-limited to a particular formant zone, in comparison to one without any band limitation, can provide higher noise immunity, especially under severe noisy condition. In order to extract each formant, a modified higher order Yule-Walker method is employed on the resulting autocorrelation sequence. Within a band, the pole with the maximum energy is selected as the formant. The estimated formants are used as features along with conventional Mel frequency cepstral coefficients in a vowel recognition system, where the linear discriminant based classifier is utilized. Extensive experimentation is carried out on speech samples taken from the TIMIT standard speech database. It is found that the proposed algorithm provides superior formant estimation accuracy in comparison to that obtained by some of the state of the art methods even at a very low level of signal-to-noise ratio (SNR) for both male and female speakers. Moreover, formant estimates obtained by the proposed method can also provide better vowel recognition accuracy in the presence of significant background noise.
  • Keywords
    autoregressive processes; correlation methods; speech recognition; Mel frequency cepstral coefficients; SNR; TIMIT standard speech database; Yule-Walker method; autocorrelation function; band-limited noisy speech signal; formant frequency estimation; maximum energy; signal-to-noise ratio; speech recognition; vowel recognition system; Accuracy; Estimation; Noise measurement; Signal to noise ratio; Speech; Speech recognition; Formant estimation; higher order Yule-Walker equations; noise; speech analysis; vowel recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2012 - 2012 IEEE Region 10 Conference
  • Conference_Location
    Cebu
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4673-4823-2
  • Electronic_ISBN
    2159-3442
  • Type

    conf

  • DOI
    10.1109/TENCON.2012.6412340
  • Filename
    6412340