• DocumentCode
    3340459
  • Title

    Statistical models for automatic video annotation and retrieval

  • Author

    Lavrenko, V. ; Feng, S.L. ; Manmatha, R.

  • Author_Institution
    Dept. of Comput. Sci., Massachusetts Univ., Amherst, MA, USA
  • Volume
    3
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We apply a continuous relevance model (CRM) to the problem of directly retrieving the visual content of videos using text queries. The model computes a joint probability model for image features and words using a training set of annotated images. The model may then be used to annotate unseen test images. The probabilistic annotations are used for retrieval using text queries. We also propose a modified model - the normalized CRM - which substantially improves performance on a subset of the TREC video dataset.
  • Keywords
    associative processing; content-based retrieval; feature extraction; image retrieval; query formulation; relevance feedback; statistics; video signal processing; annotated image training set; automatic video annotation; content based video retrieval; continuous relevance model; image associated words; image features; image segmentation; normalized CRM; probabilistic annotations; real-valued feature vectors; statistical models; text queries; video retrieval; video visual content; Computer science; Content based retrieval; Face recognition; Image retrieval; Indexing; Information retrieval; Manuals; Object recognition; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326727
  • Filename
    1326727