DocumentCode :
3490005
Title :
Combining Evidences from Mel Cepstral Features and Cepstral Mean Subtracted Features for Singer Identification
Author :
Patil, Hemant A. ; Radadia, Purushotam G. ; Basu, T.K.
Author_Institution :
Dhirubhai Ambani Inst. of Inf. & Commun. Technol. (DA-IICT), Gandhinagar, India
fYear :
2012
fDate :
13-15 Nov. 2012
Firstpage :
145
Lastpage :
148
Abstract :
One of the challenging and difficult problems under the category of Music Information Retrieval (MIR) is to identify a singer of a given song under the strong influence of instrumental sounds. The performance of Singer Identification (SID) system is also severely affected by the quality of recording devices, transmission channels and singing voice(s) of other singer(s). We have proposed a large database of 500 songs, prepared from Hindi Bollywood songs. The State-of-the-art Mel Frequency Cepstral Coefficients (MFCC) are used as feature vectors and 2nd order polynomial classifier is employed as a pattern classifier in our work. We also used Cepstral Mean Subtraction (CMS) based MFCC (CMSMFCC) feature vectors for SID and are found to give better results than the MFCC on proposed database. The SID accuracy for MFCC and CMSMFCC was found to be 75.75% and 84.5%, respectively and Equal Error Rate (EER) for MFCC and CMSMFCC was found to be 9.48% and 8.45%, respectively. While score-level-fusion of both gave improvement in SID accuracy and EER by 10.25% and 2.08% respectively than MFCC alone.
Keywords :
audio databases; cepstral analysis; information retrieval; music; pattern classification; speaker recognition; CMS-based MFCC; CMSMFCC; EER; Hindi Bollywood songs; MFCC; MIR; Mel cepstral features; Mel frequency cepstral coefficients; SID system; cepstral mean subtracted features; cepstral mean subtraction; equal error rate; feature vectors; instrumental sounds; music information retrieval; pattern classifier; recording device quality; second order polynomial classifier; singer identification system; singing voice; songs database; transmission channels; Accuracy; Databases; Instruments; Mel frequency cepstral coefficient; Testing; Training; cepstral mean subtraction; database of Hindi songs; polynomial classifier; singer identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
Type :
conf
DOI :
10.1109/IALP.2012.33
Filename :
6473717
Link To Document :
بازگشت