DocumentCode
3413637
Title
On fusion of timbre-motivated features for singing voice detection and singer identification
Author
Nwe, Tin Lay ; Li, Haizhou
Author_Institution
Inst. for Infocomm Res., Singapore
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
2225
Lastpage
2228
Abstract
Timbre is the quality of sound which allows the ear to distinguish between musical sounds. In this paper, we study timbre effects in identification of singing voice segments in popular songs. Firstly, we identify between singing voice and instrumental segments in a song. Then, singing voice segments are further categorized according to their singer identity. Timbre-motivated effects are formulated by fusion of systems that use the features from vibrato, harmonic information and other features extracted using Mel and Log frequency scale filter banks. Statistical methods to select singing voice segments with high confidence measure are proposed for better performance in singer identification process. The experiments conducted on a database of 214 popular songs show that the proposed approach is effective.
Keywords
feature extraction; music; Log frequency scale filter bank; Mel frequency scale filter bank; musical sound; singer identification process; statistical method; timbre-motivated feature; voice detection; Data mining; Ear; Feature extraction; Filter bank; Frequency; Instruments; Power harmonic filters; Spatial databases; Statistical analysis; Timbre; Singing voice; harmonic; singer identification; timbre; vibrato;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518087
Filename
4518087
Link To Document