DocumentCode :
3153207
Title :
Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization
Author :
Lin, Kai-Hsiang ; Zhuang, Xiaodan ; Goudeseune, Camille ; King, Sarah ; Hasegawa-Johnson, Mark ; Huang, Thomas S.
Author_Institution :
Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
2277
Lastpage :
2280
Abstract :
We propose a saliency-maximized audio spectrogram as a representation that lets human analysts quickly search for and detect events in audio recordings. By rendering target events as visually salient patterns, this representation minimizes the time and effort needed to examine a recording. In particular, we propose a transformation of a conventional spectrogram that maximizes the mutual information between the spectrograms of isolated target events and the estimated saliency of the overall visual representation. When subjects are shown spectrograms that are saliency-maximized, they perform significantly better in a 1/10-real-time acoustic event detection task.
Keywords :
audio recording; audio signal processing; audio-visual systems; audio recordings; human acoustic event detection; human analyst; realtime acoustic event detection task; rendering; saliency maximized audio spectrogram; saliency maximized audio visualization; salient pattern; visual representation; Acoustics; Audio recording; Event detection; Humans; Spectrogram; Speech; Visualization; acoustic event detection; audio visualization; visual saliency;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288368
Filename :
6288368
Link To Document :
بازگشت