Title :
Instantaneous fundamental frequency estimation of speech signals using DESA in low-frequency region
Author :
Rathore, P.S. ; Pachori, Ram Bilas
Author_Institution :
Eng. Dept., Akashvani Indore, Indore, India
Abstract :
Instantaneous fundamental frequency (IFF) is one of the basic signal parameters of speech signals. Accurate estimation of the IFF of speech signals can be used for speaker recognition system, gender identification and emotion recognition. Various methods for estimation of the IFF of speech signals have been proposed in the literature. In this paper, we propose a new method to estimate IFF of speech signals using the discrete energy separation algorithm (DESA) in the low-frequency region. The Fourier-Bessel (FB) series expansion has been used to filter out the low-frequency region from the speech signal. The reconstructed band-limited signal from the FB coefficients is modeled by using the amplitude and frequency modulated (AM-FM) signal model. Amplitude envelope of the reconstructed band-limited signal is estimated using the DESA. The estimation of amplitude envelope function has been used for detection of glottal closure instants (GCIs). Inverse of time elapsed between two consecutive GCIs provides IFF of speech signals. Simulation results of proposed method have been compared with the IFF computed from reference differenced electroglottograph (EGG) signal. The CMU-Arctic database has been used in this work for validation of the proposed method for IFF determination. It has been observed that the proposed method provides accurate estimation of IFF of speech signals at most of time instants.
Keywords :
Bessel functions; Fourier series; frequency estimation; signal reconstruction; source separation; speaker recognition; AM-FM signal model; DESA; EGG signal; FB coefficient; Fourier-Bessel series expansion; GCI detection; IFF estimation; amplitude envelope function estimation; amplitude modulated signal model; band-limited signal reconstruction; discrete energy separation algorithm; eMU-Arctic database; electroglottograph signal; emotion recognition; frequency modulated signal model; gender identification; glottal closure instant detection; instantaneous fundamental frequency estimation; low-frequency region; speaker recognition system; speech signal; Databases; Estimation; Frequency estimation; Speech; Speech processing; Transforms; Amplitude and frequency modulated (AM-FM) signal model; Discrete energy separation algorithm (DESA); Fourier-Bessel (FB) series expansion; Glottal closure instants (GCIs); Instantaneous fundamental frequency (IFF); Speech signal analysis;
Conference_Titel :
Signal Processing and Communication (ICSC), 2013 International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-1605-4
DOI :
10.1109/ICSPCom.2013.6719836