• DocumentCode
    677156
  • Title

    Instantaneous fundamental frequency estimation of speech signals using DESA in low-frequency region

  • Author

    Rathore, P.S. ; Pachori, Ram Bilas

  • Author_Institution
    Eng. Dept., Akashvani Indore, Indore, India
  • fYear
    2013
  • fDate
    12-14 Dec. 2013
  • Firstpage
    470
  • Lastpage
    473
  • Abstract
    Instantaneous fundamental frequency (IFF) is one of the basic signal parameters of speech signals. Accurate estimation of the IFF of speech signals can be used for speaker recognition system, gender identification and emotion recognition. Various methods for estimation of the IFF of speech signals have been proposed in the literature. In this paper, we propose a new method to estimate IFF of speech signals using the discrete energy separation algorithm (DESA) in the low-frequency region. The Fourier-Bessel (FB) series expansion has been used to filter out the low-frequency region from the speech signal. The reconstructed band-limited signal from the FB coefficients is modeled by using the amplitude and frequency modulated (AM-FM) signal model. Amplitude envelope of the reconstructed band-limited signal is estimated using the DESA. The estimation of amplitude envelope function has been used for detection of glottal closure instants (GCIs). Inverse of time elapsed between two consecutive GCIs provides IFF of speech signals. Simulation results of proposed method have been compared with the IFF computed from reference differenced electroglottograph (EGG) signal. The CMU-Arctic database has been used in this work for validation of the proposed method for IFF determination. It has been observed that the proposed method provides accurate estimation of IFF of speech signals at most of time instants.
  • Keywords
    Bessel functions; Fourier series; frequency estimation; signal reconstruction; source separation; speaker recognition; AM-FM signal model; DESA; EGG signal; FB coefficient; Fourier-Bessel series expansion; GCI detection; IFF estimation; amplitude envelope function estimation; amplitude modulated signal model; band-limited signal reconstruction; discrete energy separation algorithm; eMU-Arctic database; electroglottograph signal; emotion recognition; frequency modulated signal model; gender identification; glottal closure instant detection; instantaneous fundamental frequency estimation; low-frequency region; speaker recognition system; speech signal; Databases; Estimation; Frequency estimation; Speech; Speech processing; Transforms; Amplitude and frequency modulated (AM-FM) signal model; Discrete energy separation algorithm (DESA); Fourier-Bessel (FB) series expansion; Glottal closure instants (GCIs); Instantaneous fundamental frequency (IFF); Speech signal analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communication (ICSC), 2013 International Conference on
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-1605-4
  • Type

    conf

  • DOI
    10.1109/ICSPCom.2013.6719836
  • Filename
    6719836