• DocumentCode
    1909429
  • Title

    Exploiting Semantic Content for Singing Voice Detection

  • Author

    Leonidas, Ioannidis ; Rouas, Jean-Luc

  • Author_Institution
    LaBRI, Univ. Bordeaux, Talence, France
  • fYear
    2012
  • fDate
    19-21 Sept. 2012
  • Firstpage
    134
  • Lastpage
    137
  • Abstract
    In this paper we propose a method for singing voice detection in popular music recordings. The method is based on statistical learning of spectral features extracted from the audio tracks. In our method we use Mel Frequency Cepstrum Coefficients (MFCC) to train two Gaussian Mixture Models (GMM). Special attention is brought to our novel approach for smoothing the errors produced by the automatic classification by exploiting semantic content from the songs, which will significantly boost the overall performance of the system.
  • Keywords
    Gaussian processes; audio signal processing; feature extraction; learning (artificial intelligence); music; signal classification; signal detection; speech processing; statistical analysis; GMM; Gaussian mixture models; MFCC; Mel frequency cepstrum coefficients; audio tracks; automatic classification; music recordings; semantic content exploitation; singing voice detection; spectral features extraction; statistical learning; Accuracy; Feature extraction; Hidden Markov models; Instruments; Semantics; Smoothing methods; Training; Singing Voice Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2012 IEEE Sixth International Conference on
  • Conference_Location
    Palermo
  • Print_ISBN
    978-1-4673-4433-3
  • Type

    conf

  • DOI
    10.1109/ICSC.2012.18
  • Filename
    6337096