DocumentCode
1440929
Title
Vocal Melody Extraction in the Presence of Pitched Accompaniment in Polyphonic Music
Author
Rao, Vishweshwara ; Rao, Preeti
Author_Institution
Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
Volume
18
Issue
8
fYear
2010
Firstpage
2145
Lastpage
2154
Abstract
Melody extraction algorithms for single-channel polyphonic music typically rely on the salience of the lead melodic instrument, considered here to be the singing voice. However the simultaneous presence of one or more pitched instruments in the polyphony can cause such a predominant-F0 tracker to switch between tracking the pitch of the voice and that of an instrument of comparable strength, resulting in reduced voice-pitch detection accuracy. We propose a system that, in addition to biasing the salience measure in favor of singing voice characteristics, acknowledges that the voice may not dominate the polyphony at all instants and therefore tracks an additional pitch to better deal with the potential presence of locally dominant pitched accompaniment. A feature based on the temporal instability of voice harmonics is used to finally identify the voice pitch. The proposed system is evaluated on test data that is representative of polyphonic music with strong pitched accompaniment. Results show that the proposed system is indeed able to recover melodic information lost to its single-pitch tracking counterpart, and also outperforms another state-of-the-art melody extraction system designed for polyphonic music.
Keywords
music; speech processing; lead melodic instrument; melody extraction algorithms; pitched accompaniment; singing voice; single-channel polyphonic music; single-pitch tracking counterpart; vocal melody extraction; voice-pitch detection; Automatic control; Data mining; Frequency estimation; Humans; Instruments; Music information retrieval; Robustness; Signal representations; Switches; System testing; Fundamental frequency estimation; music information retrieval (MIR); music transcription; predominant pitch detection;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2042124
Filename
5431024
Link To Document