• DocumentCode
    635483
  • Title

    Automatic singer identification using missing feature methods

  • Author

    Ying Hu ; Guizhong Liu

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Xi´an Jiaotong Univ., Xi´an, China
  • fYear
    2013
  • fDate
    15-19 July 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This study for singer identification of mono popular music is in two stages. In the first stage, computational auditory scene analysis (CASA) is exploited to segregate singing voice units. For each frame, the estimated binary T-F mask indicates the time-frequency (T-F) units dominated by singing voice which are considered reliable, and other units are unreliable or missing. Thus the spectrum is incomplete. In the second stage, two missing feature methods, reconstruction and marginalization are used to identify the singer by dealing with the incomplete spectrum data. In the reconstruction module, the complete spectrum is first reconstructed and then converted to obtain the Gammatone frequency cepstral coefficients (GFCCs), which are further used to identify the singer. In the marginalization module, the probabilities of the singer´s voice are computed on the basis of only the reliable components. We find that the reconstruction module outperforms the marginalization module, while both modules have significantly good performances, especially at signal-to-accompaniment ratios (SARs) of 0 dB and -3 dB, in contrast to other system.
  • Keywords
    cepstral analysis; feature extraction; music; probability; signal reconstruction; speaker recognition; time-frequency analysis; CASA; GFCC; SAR; automatic singer identification; binary T-F mask; computational auditory scene analysis; gammatone frequency cepstral coefficients; marginalization module; missing feature methods; mono popular music; probability; signal-to-accompaniment ratios; singing voice units; spectrum reconstruction; time-frequency units; Abstracts; Educational institutions; Estimation; Reliability; Singer identification; computational auditory scene analysis (CASA); missing feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2013 IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2013.6607641
  • Filename
    6607641