DocumentCode :
1793229
Title :
Voice activity detection in presence of transient noise using spectral clustering and diffusion kernels
Author :
Rosen, Oren ; Mousazadeh, Saman ; Cohen, Israel
Author_Institution :
Technion - Israel Inst. of Technol., Haifa, Israel
fYear :
2014
fDate :
3-5 Dec. 2014
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, we introduce a voice activity detection (VAD) algorithm based on spectral clustering and diffusion kernels. The proposed algorithm is a supervised learning algorithm comprising of learning and testing stages: A sample cloud is produced for every signal frame by utilizing a moving window. Mel-frequency cepstrum coefficients (MFCCs) are then calculated for every sample in the cloud in order to produce an MFCC matrix and subsequently a covariance matrix for every frame. Utilizing the covariance matrix, we calculate a similarity matrix using spectral clustering and diffusion kernels methods. Using the similarity matrix, we cluster the data and transform it to a new space where each point is labeled as speech or nonspeech. We then use a Gaussian Mixture Model (GMM) in order to build a statistical model for labeling data as speech or nonspeech. Simulation results demonstrate its advantages compared to a recent VAD algorithm.
Keywords :
Gaussian processes; covariance matrices; learning (artificial intelligence); mixture models; signal detection; speech recognition; voice communication; GMM; Gaussian mixture model; MFCC matrix; Mel-frequency cepstrum coefficients; covariance matrix; diffusion kernels; labeling data; moving window; sample cloud; signal frame; similarity matrix; spectral clustering; statistical model; supervised learning algorithm; transient noise; voice activity detection; Clustering algorithms; Covariance matrices; Signal to noise ratio; Speech; Training; Transient analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical & Electronics Engineers in Israel (IEEEI), 2014 IEEE 28th Convention of
Conference_Location :
Eilat
Print_ISBN :
978-1-4799-5987-7
Type :
conf
DOI :
10.1109/EEEI.2014.7005743
Filename :
7005743
Link To Document :
بازگشت