Title :
Voice activity detection based on ensemble empirical mode decomposition and teager kurtosis
Author :
Chong Feng ; Chunhui Zhao
Author_Institution :
Coll. of Inf. & Commun. Eng., Harbin Eng. Univ., Harbin, China
Abstract :
This paper proposes an improved voice activity detection (VAD) methodology based on ensemble empirical mode decomposition (EEMD) algorithm and the teager kurtosis to avoid the defect of empirical mode decomposition (EMD) in mode mixing. The teager energy operator is used to track the modulation energy of each intrinsic mode function (IMF), decomposed by ensemble empirical mode decomposition. The root power function and order statistics filter are used on the teager kurtosis for feature extraction. Voice activity detection can be implemented over the suitable threshold which can be automatically estimated by tracking the minimum of the extracted feature values. Experiments show that the proposed VAD can achieve comparable results at high signal-to-noise ratio (SNR). For low SNR conditions, it is able to maintain lower error detection ratio and higher detection ratio, compared with those of the original algorithm.
Keywords :
error detection; feature extraction; filtering theory; speech processing; EEMD; IMF; SNR; VAD; VOICE ACTIVITY DETECTION; ensemble empirical mode decomposition; error detection ratio; feature extraction; intrinsic mode function; root power function; signal-to-noise ratio; statistic filter; teager kurtosis; Empirical mode decomposition; Feature extraction; Noise measurement; Signal to noise ratio; Speech; White noise; ensemble empirical mode decomposition; root power function; teager kurtosis; voice activity detection;
Conference_Titel :
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2188-1
DOI :
10.1109/ICOSP.2014.7015047