DocumentCode :
67888
Title :
On-Line Melody Extraction From Polyphonic Audio Using Harmonic Cluster Tracking
Author :
Arora, Vipul ; Behera, Laxmidhar
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, Kanpur, India
Volume :
21
Issue :
3
fYear :
2013
fDate :
Mar-13
Firstpage :
520
Lastpage :
530
Abstract :
Extraction of predominant melody from the musical performances containing various instruments is one of the most challenging task in the field of music information retrieval and computational musicology. This paper presents a novel framework which estimates predominant vocal melody in real-time by tracking various sources with the help of harmonic clusters (combs) and then determining the predominant vocal source by using the harmonic strength of the source. The novel on-line harmonic comb tracking approach complies with both structural as well as temporal constraints simultaneously. It relies upon the strong higher harmonics for robustness against distortion of the first harmonic due to low frequency accompaniments, in contrast to the existing methods which track the pitch values. The predominant vocal source identification depends upon the novel idea of source dependant filtering of recognition score, which allows the algorithm to be implemented on-line. The proposed method, although on-line, is shown to significantly outperform our implementation of a state-of-the-art offline method for vocal melody extraction. Evaluations also show the reduction in octave error and the effectiveness of novel score filtering technique in enhancing the performance.
Keywords :
acoustic signal processing; harmonic distortion; identification; information filtering; music; tracking; computational musicology; first harmonic distortion; harmonic cluster tracking; harmonic clusters; low frequency accompaniments; music information retrieval; on-line harmonic comb tracking approach; online melody extraction; pitch values; polyphonic audio; predominant melody extraction; predominant vocal melody estimation; predominant vocal source; predominant vocal source identification; recognition score; source dependant filtering; source harmonic strength; state-of-the-art offline method; temporal constraints simultaneously; vocal melody extraction; Estimation; Harmonic analysis; Hidden Markov models; Instruments; Power harmonic filters; Real-time systems; Speech; Music information retrieval; pitch tracking; spectral Harmonics; vocal melody estimation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2227731
Filename :
6353544
Link To Document :
بازگشت