DocumentCode
598272
Title
Decomposing the video editing structure of a talk-show using nonnegative matrix factorization
Author
Essid, Slim ; Fevotte, Cedric
Author_Institution
Inst. Telecom, Telecom ParisTech, Paris, France
fYear
2012
fDate
Sept. 30 2012-Oct. 3 2012
Firstpage
3105
Lastpage
3108
Abstract
We introduce a novel video structuring scheme that exploits nonnegative matrix factorization (NMF) on count data (in a bag of features representation of the visual stream) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. Our method is shown to be successful for decomposing the high-level editing structure of talk-shows. It is evaluated using a challenging database of TV political-debate programs, and found to clearly outperform a reference HMM method.
Keywords
image representation; matrix decomposition; minimisation; video signal processing; Kullback-Leibler divergence; NMF variant; TV political-debate programs; cost function; count data; features representation; latent structuring patterns; majorization-minimization technique; nonnegative matrix factorization; reference HMM method; talk-show; temporal smoothness constraint; video editing structure decomposition; visual stream; Abstracts; Hafnium compounds; Histograms; Indexes; Video structuring; bag of features; indexing; machine learning; matrix factorization; unsupervised classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2012 19th IEEE International Conference on
Conference_Location
Orlando, FL
ISSN
1522-4880
Print_ISBN
978-1-4673-2534-9
Electronic_ISBN
1522-4880
Type
conf
DOI
10.1109/ICIP.2012.6467557
Filename
6467557
Link To Document