DocumentCode :
3716293
Title :
Audio phrases for audio event recognition
Author :
Huy Phan;Lars Hertel;Marco Maass;Radoslaw Mazur;Alfred Mertins
Author_Institution :
Institute for Signal Processing, University of Lü
fYear :
2015
Firstpage :
2546
Lastpage :
2550
Abstract :
The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a local feature of an audio signal is matched to a code word according to a learned codebook. The signal is then represented by frequencies of the matched code words on the whole signal. We present in this paper an improved model based on the idea of audio phrases which are sequences of multiple audio words. By using audio phrases, we are able to capture the relationship between the isolated audio words and produce more semantic descriptors. Furthermore, we also propose an efficient approach to learn a compact codebook in a discriminative manner to deal with high-dimensionality of bag-of-audio-phrases representations. Experiments on the Freiburg-106 dataset show that the recognition performance with our proposed bag-of-audio-phrases descriptor outperforms not only the baselines but also the state-of-the-art results on the dataset.
Keywords :
"Signal processing","Kernel","Europe","Training data","Clustering methods","Histograms","Training"
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN :
2076-1465
Type :
conf
DOI :
10.1109/EUSIPCO.2015.7362844
Filename :
7362844
Link To Document :
بازگشت