DocumentCode :
2237093
Title :
Separation of Voice and Music by Harmonic Structure Stability Analysis
Author :
Zhang, Yun-Gang ; Zhang, Chang-Shui
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing
fYear :
2005
fDate :
6-6 July 2005
Firstpage :
562
Lastpage :
565
Abstract :
Separation of voice and music is an interesting but difficult problem. It is useful for many other researches such as audio content analysis. In this paper, the difference between voice and music signals is carefully studied. It is proposed that the harmonic structure stability is the key difference between them. A separation algorithm based on this theory is proposed. The main idea is to learn the average harmonic structure of the music, and then separate signals by using it to distinguish voice and music harmonic structures. Experimental results show that the algorithm can separate mixed signals and obtains not only a very high signal-to-noise ratio (SNR) but also a rather good subjective audio quality
Keywords :
audio signal processing; music; source separation; speech processing; audio quality; harmonic structure stability analysis; music signal separation; voice signal separation; Entropy; Frequency; Independent component analysis; Instruments; Multiple signal classification; Music; Power harmonic filters; Speech analysis; Speech enhancement; Stability analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-7803-9331-7
Type :
conf
DOI :
10.1109/ICME.2005.1521485
Filename :
1521485
Link To Document :
بازگشت