• DocumentCode
    67888
  • Title

    On-Line Melody Extraction From Polyphonic Audio Using Harmonic Cluster Tracking

  • Author

    Arora, Vipul ; Behera, Laxmidhar

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, Kanpur, India
  • Volume
    21
  • Issue
    3
  • fYear
    2013
  • fDate
    Mar-13
  • Firstpage
    520
  • Lastpage
    530
  • Abstract
    Extraction of predominant melody from the musical performances containing various instruments is one of the most challenging task in the field of music information retrieval and computational musicology. This paper presents a novel framework which estimates predominant vocal melody in real-time by tracking various sources with the help of harmonic clusters (combs) and then determining the predominant vocal source by using the harmonic strength of the source. The novel on-line harmonic comb tracking approach complies with both structural as well as temporal constraints simultaneously. It relies upon the strong higher harmonics for robustness against distortion of the first harmonic due to low frequency accompaniments, in contrast to the existing methods which track the pitch values. The predominant vocal source identification depends upon the novel idea of source dependant filtering of recognition score, which allows the algorithm to be implemented on-line. The proposed method, although on-line, is shown to significantly outperform our implementation of a state-of-the-art offline method for vocal melody extraction. Evaluations also show the reduction in octave error and the effectiveness of novel score filtering technique in enhancing the performance.
  • Keywords
    acoustic signal processing; harmonic distortion; identification; information filtering; music; tracking; computational musicology; first harmonic distortion; harmonic cluster tracking; harmonic clusters; low frequency accompaniments; music information retrieval; on-line harmonic comb tracking approach; online melody extraction; pitch values; polyphonic audio; predominant melody extraction; predominant vocal melody estimation; predominant vocal source; predominant vocal source identification; recognition score; source dependant filtering; source harmonic strength; state-of-the-art offline method; temporal constraints simultaneously; vocal melody extraction; Estimation; Harmonic analysis; Hidden Markov models; Instruments; Power harmonic filters; Real-time systems; Speech; Music information retrieval; pitch tracking; spectral Harmonics; vocal melody estimation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2012.2227731
  • Filename
    6353544