Title :
Classifying NMF components based on vector similarity for speech and music separation
Author :
Nengheng Zheng ; Yi Cai ; Xia Li ; Tan Lee
Author_Institution :
Shenzhen Key Lab. of Telecommun. & Inf. Process., Shenzhen Univ., Shenzhen, China
Abstract :
This paper presents a nonnegative matrix factorization (NMF) components classification algorithm for single-channel speech and music separation. Music only and music-speech mixture segments are firstly classified from the audio stream via audio segmentation technique. Then NMF is applied for signal decomposition. The basis matrix of the NMF output of music only segments provides the prior knowledge of music component in the mixture signal. NMF components, i.e. basis and gain vectors of the mixture signal are classified into speech and music based on the vector similarity between each basis vector and the priori music basis matrix. A set of SNR-dependent thresholding coefficients are empirically determined for the classification. The separated speech and music signals are reconstructed from the respectively classified NMF components. Experimental results show the effectiveness of the proposed method for speech and music separation, and its superior performance over the traditional NMF-based separation methods.
Keywords :
audio signal processing; matrix decomposition; music; signal classification; signal reconstruction; source separation; speech processing; vectors; SNR-dependent thresholding coefficient; audio segmentation; audio stream; basis vector; mixture signal classification; music only segment classification; music separation; music signal reconstruction; music-speech mixture segment classification; nonnegative matrix factorization component classification; priori music basis matrix; separated speech reconstruction; signal decomposition; single-channel speech separation; vector similarity; Matrix decomposition; Multiple signal classification; Signal to noise ratio; Source separation; Spectrogram; Speech; Support vector machine classification;
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8