• DocumentCode
    3413637
  • Title

    On fusion of timbre-motivated features for singing voice detection and singer identification

  • Author

    Nwe, Tin Lay ; Li, Haizhou

  • Author_Institution
    Inst. for Infocomm Res., Singapore
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    2225
  • Lastpage
    2228
  • Abstract
    Timbre is the quality of sound which allows the ear to distinguish between musical sounds. In this paper, we study timbre effects in identification of singing voice segments in popular songs. Firstly, we identify between singing voice and instrumental segments in a song. Then, singing voice segments are further categorized according to their singer identity. Timbre-motivated effects are formulated by fusion of systems that use the features from vibrato, harmonic information and other features extracted using Mel and Log frequency scale filter banks. Statistical methods to select singing voice segments with high confidence measure are proposed for better performance in singer identification process. The experiments conducted on a database of 214 popular songs show that the proposed approach is effective.
  • Keywords
    feature extraction; music; Log frequency scale filter bank; Mel frequency scale filter bank; musical sound; singer identification process; statistical method; timbre-motivated feature; voice detection; Data mining; Ear; Feature extraction; Filter bank; Frequency; Instruments; Power harmonic filters; Spatial databases; Statistical analysis; Timbre; Singing voice; harmonic; singer identification; timbre; vibrato;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518087
  • Filename
    4518087