DocumentCode :
3403577
Title :
A background music detection method based on robust feature extraction
Author :
Izumitani, Tomonori ; Mukai, Ryo ; Kashino, Kunio
Author_Institution :
NTT Commun. Sci. Labs., Kanagawa
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
13
Lastpage :
16
Abstract :
We propose a music segment detection method for audio signals. Unlike many existing methods, ours specifically focuses on a background-music detection task, that is, detecting music used in background of main sounds. This task is important because music is almost always overlapped by speech or other environmental sounds in visual materials such as TV programs. Our method consists of feature extraction, dimension reduction, and statistical discrimination steps. For each step, we analyzed a set of methods to maximize the detection accuracy. With a simple post processing step, we achieved a framewise error rate as low as 8 % even when the mixed speech was louder than the target music by 10dB.
Keywords :
audio signal processing; feature extraction; music; signal detection; speech processing; statistical analysis; audio signal; background music segment detection method; dimension reduction; robust feature extraction; speech-music discrimination system; statistical discrimination; Feature extraction; Frequency; Hidden Markov models; Indexing; Multimedia systems; Multiple signal classification; Music; Robustness; Speech; TV; Background music detection; Gaussian mixture model; feature selection; k-nearest neighbor method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4517534
Filename :
4517534
Link To Document :
بازگشت