Title :
Spectral local harmonicity feature for voice activity detection
Author :
Khoa, Pham Chau ; Siong, Chng Eng
Author_Institution :
Temasek Lab.@NTU, Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper, we propose a method to exploit the harmonicity of human voiced speech using only the most harmonic sub-part of the spectrum. This technique searches for all the potential sub-windows of the spectrum, and measures their local harmonicity, using a newly proposed metric, which works in the spectral autocorrelation domain and employs a novel sinusoidal fitting approach. Experiments show that the new feature can be used to detect noisy voiced speech frames heavily corrupted by non-stationary noise even at 0dB SNR with high precision and recall, which gives better results than the Windowed Autocorrelation Lag Energy (WALE), a recently proposed voicing features, under a complex factory noise scenarios.
Keywords :
audio signal processing; correlation methods; feature extraction; signal denoising; spectral analysis; speech processing; SNR; WALE; complex factory noise scenarios; harmonic spectrum subpart; human voiced speech harmonicity; local harmonicity; noisy voiced speech frame detection; nonstationary noise; sinusoidal fitting approach; spectral autocorrelation domain; spectral local harmonicity feature; subwindows; voice activity detection; windowed autocorrelation lag energy; Correlation; Discrete cosine transforms; Feature extraction; Harmonic analysis; Noise; Noise measurement; Speech;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376652