• DocumentCode
    1221848
  • Title

    A Multimodal and Multilevel Ranking Scheme for Large-Scale Video Retrieval

  • Author

    Hoi, Steven C H ; Lyu, Michael R.

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
  • Volume
    10
  • Issue
    4
  • fYear
    2008
  • fDate
    6/1/2008 12:00:00 AM
  • Firstpage
    607
  • Lastpage
    619
  • Abstract
    A critical issue of large-scale multimedia retrieval is how to develop an effective framework for ranking the search results. This problem is particularly challenging for content-based video retrieval due to some issues such as short text queries, insufficient sample learning, fusion of multimodal contents, and large-scale learning with huge media data. In this paper, we propose a novel multimodal and multilevel (MMML) ranking framework to attack the challenging ranking problem of content-based video retrieval. We represent the video retrieval task by graphs and suggest a graph based semi-supervised ranking (SSR) scheme, which can learn with small samples effectively and integrate multimodal resources for ranking smoothly. To make the semi-supervised ranking solution practical for large-scale retrieval tasks, we propose a multilevel ranking framework that unifies several different ranking approaches in a cascade fashion. We have conducted empirical evaluations of our proposed solution for automatic search tasks on the benchmark testbed of TRECVID2005. The promising empirical results show that our ranking solutions are effective and very competitive with the state-of-the-art solutions in the TRECVID evaluations.
  • Keywords
    content-based retrieval; learning (artificial intelligence); multimedia systems; video retrieval; content-based video retrieval; large-scale learning; large-scale multimedia retrieval; large-scale video retrieval; multilevel ranking scheme; multimodal ranking scheme; semisupervised ranking scheme; short text queries; Acoustical engineering; Benchmark testing; Content based retrieval; Image retrieval; Information retrieval; Large-scale systems; Optical character recognition software; Search engines; Video sharing; Video signal processing; Content-based video retrieval; graph representation; multilevel ranking; multimedia retrieval; multimodal fusion; semi-supervised ranking; support vector machines;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2008.921735
  • Filename
    4523959