Title :
Improving accuracy of glottal closure instant detection methods in nonstationary noise
Author :
Deshpande, Pranav S. ; Manikandan, M. Sabarimalai
Author_Institution :
Sch. of Electr. Sci., Indian Inst. of Technol. Bhubaneswar, Bhubaneswar, India
Abstract :
The glottal closure instant (GCI) detection is crucial in all kinds of speech processing applications. A variety of GCI detection methods were proposed but the performance of most existing methods was evaluated using only voiced parts of the speech signal. Under noisy conditions, the recorded speech consists of voiced, unvoiced and non-speech parts. Thus, most GCI detection methods demand an accurate voice activity detection (VAD) design. In this paper, we present post-processing techniques to improve the GCI detection accuracy and robustness in additive nonstationary noisy environments. We study the performance of the four methods: center of gravity (CoG), group delay function (GDF), zero frequency resonator (ZFR), and speech event detection using the residual excitation and a mean-based signal (SEDREAMS). The performance of each method with the proposed method-specific post-processing technique is tested and validated under both clean and noisy environments. Experimental results show that the method with post-processing technique outperforms the conventional GCI detection methods under noisy conditions. Results further show that the proposed technique can not only improve the overall detection accuracy but also reduce the complexity by avoiding VAD algorithm.
Keywords :
signal denoising; signal detection; speech synthesis; CoG; GCI detection method accuracy improvement; GDF; SEDREAMS; VAD algorithm; ZFR; additive nonstationary noisy environments; center of gravity; complexity reduction; glottal closure instant detection; group delay function; recorded speech signal; speech event detection using residual excitation and mean-based signal; speech processing applications; speech synthesis; voice activity detection design; zero frequency resonator; Databases; Delays; Feature extraction; Larynx; Noise measurement; Speech; Speech processing; Epoch extraction; glottal closure instant;
Conference_Titel :
Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-5990-7
DOI :
10.1109/SPIN.2015.7095390