Title :
A background music detection method based on robust feature extraction
Author :
Izumitani, Tomonori ; Mukai, Ryo ; Kashino, Kunio
Author_Institution :
NTT Commun. Sci. Labs., Kanagawa
fDate :
March 31 2008-April 4 2008
Abstract :
We propose a music segment detection method for audio signals. Unlike many existing methods, ours specifically focuses on a background-music detection task, that is, detecting music used in background of main sounds. This task is important because music is almost always overlapped by speech or other environmental sounds in visual materials such as TV programs. Our method consists of feature extraction, dimension reduction, and statistical discrimination steps. For each step, we analyzed a set of methods to maximize the detection accuracy. With a simple post processing step, we achieved a framewise error rate as low as 8 % even when the mixed speech was louder than the target music by 10dB.
Keywords :
audio signal processing; feature extraction; music; signal detection; speech processing; statistical analysis; audio signal; background music segment detection method; dimension reduction; robust feature extraction; speech-music discrimination system; statistical discrimination; Feature extraction; Frequency; Hidden Markov models; Indexing; Multimedia systems; Multiple signal classification; Music; Robustness; Speech; TV; Background music detection; Gaussian mixture model; feature selection; k-nearest neighbor method;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517534