DocumentCode
704697
Title
Improving accuracy of glottal closure instant detection methods in nonstationary noise
Author
Deshpande, Pranav S. ; Manikandan, M. Sabarimalai
Author_Institution
Sch. of Electr. Sci., Indian Inst. of Technol. Bhubaneswar, Bhubaneswar, India
fYear
2015
fDate
19-20 Feb. 2015
Firstpage
736
Lastpage
741
Abstract
The glottal closure instant (GCI) detection is crucial in all kinds of speech processing applications. A variety of GCI detection methods were proposed but the performance of most existing methods was evaluated using only voiced parts of the speech signal. Under noisy conditions, the recorded speech consists of voiced, unvoiced and non-speech parts. Thus, most GCI detection methods demand an accurate voice activity detection (VAD) design. In this paper, we present post-processing techniques to improve the GCI detection accuracy and robustness in additive nonstationary noisy environments. We study the performance of the four methods: center of gravity (CoG), group delay function (GDF), zero frequency resonator (ZFR), and speech event detection using the residual excitation and a mean-based signal (SEDREAMS). The performance of each method with the proposed method-specific post-processing technique is tested and validated under both clean and noisy environments. Experimental results show that the method with post-processing technique outperforms the conventional GCI detection methods under noisy conditions. Results further show that the proposed technique can not only improve the overall detection accuracy but also reduce the complexity by avoiding VAD algorithm.
Keywords
signal denoising; signal detection; speech synthesis; CoG; GCI detection method accuracy improvement; GDF; SEDREAMS; VAD algorithm; ZFR; additive nonstationary noisy environments; center of gravity; complexity reduction; glottal closure instant detection; group delay function; recorded speech signal; speech event detection using residual excitation and mean-based signal; speech processing applications; speech synthesis; voice activity detection design; zero frequency resonator; Databases; Delays; Feature extraction; Larynx; Noise measurement; Speech; Speech processing; Epoch extraction; glottal closure instant;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-5990-7
Type
conf
DOI
10.1109/SPIN.2015.7095390
Filename
7095390
Link To Document