DocumentCode :
1274013
Title :
Signal-Driven Window-Length Adaptation for Sinusoid Detection in Polyphonic Music
Author :
Rao, Vishweshwara ; Gaddipati, Pradeep ; Rao, Preeti
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Bombay, Mumbai, India
Volume :
20
Issue :
1
fYear :
2012
Firstpage :
342
Lastpage :
348
Abstract :
Audio processing applications that use short-time signal analysis techniques typically utilize fixed window duration single- or multi-resolution analyses. However, different real-world signal conditions such as polyphony and non-stationarity, manifested as musical accompaniment and pitch-modulations, respectively, in the context of music content analysis, require varying data window lengths for reliable processing. In this paper, we investigate the use of signal sparsity for adapting analysis window lengths. Adaptive-window analysis driven by different measures of sparsity applied to the local spectrum, such as kurtosis and Gini index, is evaluated and shown to be superior to fixed-window analysis in terms of sinusoid detection and frequency estimation for simulated and real signals. A window main-lobe matching method for sinusoid detection is also shown to be more robust to signal conditions such as polyphony and frequency modulation relative to other methods.
Keywords :
audio signal processing; frequency modulation; music; signal detection; signal resolution; Gini index; adaptive-window analysis; audio processing application; frequency estimation; frequency modulation; kurtosis index; multiresolution analysis; music content analysis; musical accompaniment; pitch modulation; polyphonic music; real-world signal condition; reliable processing; short-time signal analysis technique; signal sparsity; signal-driven window-length adaptation; sinusoid detection; window main-lobe matching method; Frequency estimation; Frequency modulation; Harmonic analysis; Interference; Noise; Time frequency analysis; Signal sparsity; sinusoid detection; window adaptation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2162319
Filename :
5955094
Link To Document :
بازگشت