DocumentCode :
3348834
Title :
Automatic singer identification based on auditory features
Author :
Wei Cai ; Qiang Li ; Xin Guan
Author_Institution :
Sch. of Electron. & Inf. Eng., Tianjin Univ., Tianjin, China
Volume :
3
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1624
Lastpage :
1628
Abstract :
The paper describes a method of identifying singers´ voice from the monophonic music including sounds of various musical instruments based on auditory features. In this system, there are four problems to solve, vocal segment detection, feature extraction, modeling of the singing voice and identification. For a song to be identified, the vocal/nonvocal segment is detected via a new classifier - Sparse Representation-based Classification (SRC). The feature extraction is of the most importance. Human ear can distinguish among different types of sounds, so auditory features to describe the singer´s voice are important. To describe the auditory features, we calculate features of each frame including Mel-frequency Cepstral Coefficient (MFCC), Liner Prediction Mel-frequency Cepstral Coefficient (LPMCC) and Gammatone Cepstral Coefficient (GTCC). Finally, we introduce the Gaussian Mixture Model (GMM) to model the singers´ voice. This system is demonstrated to improve the performance of an automatic singer identification system in Music Information Retrieval (MIR).
Keywords :
Gaussian processes; cepstral analysis; feature extraction; information retrieval; musical instruments; signal classification; speaker recognition; GMM; Gaussian mixture model; LPMCC; MFCC; auditory feature extraction; automatic singer identification; human ear; liner prediction Mel-frequency cepstral coefficient; monophonic music; music information retrieval; musical instruments; nonvocal segment detection; singer voice identification; singing voice modeling; sparse representation-based classification; vocal segment detection; Feature extraction; Filter banks; Humans; Low pass filters; Mel frequency cepstral coefficient; GMM; Gammatone Cepstrum Coefficient; SRC; auditory feature; singer identification; singing voice detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation (ICNC), 2011 Seventh International Conference on
Conference_Location :
Shanghai
ISSN :
2157-9555
Print_ISBN :
978-1-4244-9950-2
Type :
conf
DOI :
10.1109/ICNC.2011.6022500
Filename :
6022500
Link To Document :
بازگشت