DocumentCode
3403559
Title
Detecting music in ambient audio by long-window autocorrelation
Author
Lee, Keansub ; Ellis, Daniel P W
Author_Institution
Dept. of Electr. Eng., Columbia Univ., New York, NY
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
9
Lastpage
12
Abstract
We address the problem of detecting music in the background of ambient real-world audio recordings such as the sound track of consumer-shot video. Such material may contain high levels of noises, and we seek to devise features that will reveal music content in such circumstances. Sustained, steady musical pitches show significant, structured autocorrelation at when calculated over windows of hundreds of milliseconds, where autocorrelation of aperiodic noise has become negligible at higher-lag points if a signal is whitened by LPC. Using such features, further compensated by their long-term average to remove the effect of stationary periodic noise, we produce GMM and SVM based classifiers with high performance compared with previous approaches, as verified on a corpus of real consumer video.
Keywords
Gaussian processes; audio recording; audio signal processing; correlation methods; music; support vector machines; GMM; LPC; SVM; aperiodic noise; audio recording; long-window autocorrelation; music detection; Acoustic noise; Acoustic signal detection; Audio recording; Autocorrelation; Data mining; Multiple signal classification; Music; Noise shaping; Rhythm; Video recording; Acoustic signal detection; Correlation; Music; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4517533
Filename
4517533
Link To Document