• DocumentCode
    2148399
  • Title

    Improving melody extraction using Probabilistic Latent Component Analysis

  • Author

    Han, Jinyu ; Chen, Ching-Wei

  • Author_Institution
    Northwestern Univ., Evanston, IL, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    33
  • Lastpage
    36
  • Abstract
    We propose a new approach for automatic melody extraction from polyphonic audio, based on Probabilistic Latent Component Analysis (PLCA). An audio signal is first divided into vocal and non-vocal segments using a trained Gaussian Mixture Model (GMM) classifier. A statistical model of the non-vocal segments of the signal is then learned adaptively from this particular input music by PLCA. This model is then employed to remove the accompaniment from the mixture, leaving mainly the vocal components. The melody line is extracted from the vocal components using an auto-correlation algorithm. Quantitative evaluation shows that the new system performs significantly better than two existing melody extraction algorithms for polyphonic single-channel mixtures.
  • Keywords
    Gaussian processes; audio signal processing; statistical analysis; Gaussian mixture model classifier; audio signal; autocorrelation algorithm; automatic melody extraction; polyphonic audio; polyphonic single-channel mixtures; probabilistic latent component analysis; statistical model; Adaptation models; Estimation; Hidden Markov models; Instruments; Probabilistic logic; Spectrogram; Time frequency analysis; Melody Extraction; Probabilistic Latent Component Analysis; Singing Voice Detection and Extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946321
  • Filename
    5946321