مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic singer identification using missing feature methods

DocumentCode :

635483

Title :

Automatic singer identification using missing feature methods

Author :

Ying Hu ; Guizhong Liu

Author_Institution :

Sch. of Electron. & Inf. Eng., Xi´an Jiaotong Univ., Xi´an, China

fYear :

2013

fDate :

15-19 July 2013

Firstpage :

Lastpage :

Abstract :

This study for singer identification of mono popular music is in two stages. In the first stage, computational auditory scene analysis (CASA) is exploited to segregate singing voice units. For each frame, the estimated binary T-F mask indicates the time-frequency (T-F) units dominated by singing voice which are considered reliable, and other units are unreliable or missing. Thus the spectrum is incomplete. In the second stage, two missing feature methods, reconstruction and marginalization are used to identify the singer by dealing with the incomplete spectrum data. In the reconstruction module, the complete spectrum is first reconstructed and then converted to obtain the Gammatone frequency cepstral coefficients (GFCCs), which are further used to identify the singer. In the marginalization module, the probabilities of the singer´s voice are computed on the basis of only the reliable components. We find that the reconstruction module outperforms the marginalization module, while both modules have significantly good performances, especially at signal-to-accompaniment ratios (SARs) of 0 dB and -3 dB, in contrast to other system.

Keywords :

cepstral analysis; feature extraction; music; probability; signal reconstruction; speaker recognition; time-frequency analysis; CASA; GFCC; SAR; automatic singer identification; binary T-F mask; computational auditory scene analysis; gammatone frequency cepstral coefficients; marginalization module; missing feature methods; mono popular music; probability; signal-to-accompaniment ratios; singing voice units; spectrum reconstruction; time-frequency units; Abstracts; Educational institutions; Estimation; Reliability; Singer identification; computational auditory scene analysis (CASA); missing feature;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo (ICME), 2013 IEEE International Conference on

Conference_Location :

San Jose, CA

ISSN :

1945-7871

Type :

conf

DOI :

10.1109/ICME.2013.6607641

Filename :

6607641

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=635483