• DocumentCode
    1440929
  • Title

    Vocal Melody Extraction in the Presence of Pitched Accompaniment in Polyphonic Music

  • Author

    Rao, Vishweshwara ; Rao, Preeti

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
  • Volume
    18
  • Issue
    8
  • fYear
    2010
  • Firstpage
    2145
  • Lastpage
    2154
  • Abstract
    Melody extraction algorithms for single-channel polyphonic music typically rely on the salience of the lead melodic instrument, considered here to be the singing voice. However the simultaneous presence of one or more pitched instruments in the polyphony can cause such a predominant-F0 tracker to switch between tracking the pitch of the voice and that of an instrument of comparable strength, resulting in reduced voice-pitch detection accuracy. We propose a system that, in addition to biasing the salience measure in favor of singing voice characteristics, acknowledges that the voice may not dominate the polyphony at all instants and therefore tracks an additional pitch to better deal with the potential presence of locally dominant pitched accompaniment. A feature based on the temporal instability of voice harmonics is used to finally identify the voice pitch. The proposed system is evaluated on test data that is representative of polyphonic music with strong pitched accompaniment. Results show that the proposed system is indeed able to recover melodic information lost to its single-pitch tracking counterpart, and also outperforms another state-of-the-art melody extraction system designed for polyphonic music.
  • Keywords
    music; speech processing; lead melodic instrument; melody extraction algorithms; pitched accompaniment; singing voice; single-channel polyphonic music; single-pitch tracking counterpart; vocal melody extraction; voice-pitch detection; Automatic control; Data mining; Frequency estimation; Humans; Instruments; Music information retrieval; Robustness; Signal representations; Switches; System testing; Fundamental frequency estimation; music information retrieval (MIR); music transcription; predominant pitch detection;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2042124
  • Filename
    5431024