• DocumentCode
    598272
  • Title

    Decomposing the video editing structure of a talk-show using nonnegative matrix factorization

  • Author

    Essid, Slim ; Fevotte, Cedric

  • Author_Institution
    Inst. Telecom, Telecom ParisTech, Paris, France
  • fYear
    2012
  • fDate
    Sept. 30 2012-Oct. 3 2012
  • Firstpage
    3105
  • Lastpage
    3108
  • Abstract
    We introduce a novel video structuring scheme that exploits nonnegative matrix factorization (NMF) on count data (in a bag of features representation of the visual stream) to jointly discover latent structuring patterns and their activations in time. Our NMF variant employs the Kullback-Leibler divergence as a cost function and imposes a temporal smoothness constraint to the activations. It is solved by a majorization-minimization technique. Our method is shown to be successful for decomposing the high-level editing structure of talk-shows. It is evaluated using a challenging database of TV political-debate programs, and found to clearly outperform a reference HMM method.
  • Keywords
    image representation; matrix decomposition; minimisation; video signal processing; Kullback-Leibler divergence; NMF variant; TV political-debate programs; cost function; count data; features representation; latent structuring patterns; majorization-minimization technique; nonnegative matrix factorization; reference HMM method; talk-show; temporal smoothness constraint; video editing structure decomposition; visual stream; Abstracts; Hafnium compounds; Histograms; Indexes; Video structuring; bag of features; indexing; machine learning; matrix factorization; unsupervised classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2012 19th IEEE International Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4673-2534-9
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2012.6467557
  • Filename
    6467557