Title :
Decomposing the video editing structure of a talk-show using nonnegative matrix factorization
Author :
Essid, Slim ; Fevotte, Cedric
Author_Institution :
Inst. Telecom, Telecom ParisTech, Paris, France
fDate :
Sept. 30 2012-Oct. 3 2012
Abstract :
We introduce a novel video structuring scheme that exploits nonnegative matrix factorization (NMF) on count data (in a bag of features representation of the visual stream) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. Our method is shown to be successful for decomposing the high-level editing structure of talk-shows. It is evaluated using a challenging database of TV political-debate programs, and found to clearly outperform a reference HMM method.
Keywords :
image representation; matrix decomposition; minimisation; video signal processing; Kullback-Leibler divergence; NMF variant; TV political-debate programs; cost function; count data; features representation; latent structuring patterns; majorization-minimization technique; nonnegative matrix factorization; reference HMM method; talk-show; temporal smoothness constraint; video editing structure decomposition; visual stream; Abstracts; Hafnium compounds; Histograms; Indexes; Video structuring; bag of features; indexing; machine learning; matrix factorization; unsupervised classification;
Conference_Titel :
Image Processing (ICIP), 2012 19th IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4673-2534-9
Electronic_ISBN :
1522-4880
DOI :
10.1109/ICIP.2012.6467557