DocumentCode :
730365
Title :
Max-product dynamical systems and applications to audio-visual salient event detection in videos
Author :
Maragos, Petros ; Koutras, Petros
Author_Institution :
Sch. of ECE, Nat. Tech. Univ. of Athens, Athens, Greece
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
2284
Lastpage :
2288
Abstract :
This paper introduces a theory for max-product systems by analyzing them as discrete-time nonlinear dynamical systems that obey a superposition of a weighted maximum type and evolve on nonlinear spaces which we call complete weighted lattices. Special cases of such systems have found applications in speech recognition as weighted finite-state transducers and in belief propagation on graphical models. Our theoretical approach establishes their representation in state and input-output spaces using monotone lattice operators, finds analytically their state and output responses using nonlinear convolutions, studies their stability, and provides optimal solutions to solving max-product matrix equations. Further, we apply these systems to extend the Viterbi algorithm in HMMs by adding control inputs and model cognitive processes such as detecting audio and visual salient events in multimodal video streams, which shows good performance as compared to human attention.
Keywords :
audio-visual systems; convolution; hidden Markov models; matrix algebra; maximum likelihood estimation; speech recognition; transducers; video signal processing; HMM; VIDEOS; Viterbi algorithm; audio-visual salient event detection; belief propagation; cognitive process; complete weighted lattice; discrete-time nonlinear dynamical system; graphical model; input-output space; max-product dynamical system; max-product matrix equation; monotone lattice operator; nonlinear convolution; speech recognition; state space; weighted finite-state transducer; Hidden Markov models; Integrated circuits; cognitive modeling; event detection; lattices; minimax algebra; multimedia signal processing; nonlinear systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178378
Filename :
7178378
Link To Document :
بازگشت