• DocumentCode
    80424
  • Title

    A Robust and Scalable Visual Category and Action Recognition System Using Kernel Discriminant Analysis With Spectral Regression

  • Author

    Tahir, Muhammad Atif ; Fei Yan ; Koniusz, Peter ; Awais, Muhammad ; Barnard, Mark ; Mikolajczyk, Krystian ; Bouridane, Ahmed ; Kittler, Josef

  • Author_Institution
    Coll. of Comput. & Inf. Sci., Al-Imam Mohammad Ibn Saud Islamic Univ., Riyadh, Saudi Arabia
  • Volume
    15
  • Issue
    7
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    1653
  • Lastpage
    1664
  • Abstract
    Visual concept detection and action recognition are one of the most important tasks in content-based multimedia information retrieval (CBMIR) technology. It aims at annotating images using a vocabulary defined by a set of concepts of interest including scenes types (mountains, snow, etc.) or human actions (phoning, playing instrument). This paper describes our system in the ImageCLEF@ICPR10, Pascal VOC 08 Visual Concept Detection and Pascal VOC 10 Action Recognition Challenges. The proposed system ranked first in these large-scale tasks when evaluated independently by the organizers. The proposed system involves state-of-the-art local descriptor computation, vector quantization via clustering, structured scene or object representation via localized histograms of vector codes, similarity measure for kernel construction and classifier learning. The main novelty is the classifier-level and kernel-level fusion using Kernel Discriminant Analysis and Spectral Regression (SR-KDA) with RBF Chi-Squared kernels obtained from various image descriptors. The distinctiveness of the proposed method is also assessed experimentally using a video benchmark: the Mediamill Challenge along with benchmarks from ImageCLEF@ICPR10, Pascal VOC 10 and Pascal VOC 08. From the experimental results, it can be derived that the presented system consistently yields significant performance gains when compared with the state-of-the art methods. The other strong point is the introduction of SR-KDA in the classification stage where the time complexity scales linearly with respect to the number of concepts and the main computational complexity is independent of the number of categories.
  • Keywords
    computational complexity; content-based retrieval; gesture recognition; image classification; image reconstruction; image retrieval; learning (artificial intelligence); multimedia computing; radial basis function networks; regression analysis; vector quantisation; CBMIR technology; ImageCLEF@ICPR10; Pascal VOC 08 visual concept detection; Pascal VOC 10 action recognition challenges; RBF Chi-squared kernels; SR-KDA; action recognition system; classifier learning; classifier-level fusion; computational complexity; content-based multimedia information retrieval technology; descriptor computation; kernel construction; kernel discriminant analysis; kernel discriminant analysis and spectral regression; kernel-level fusion; localized histograms; object representation; spectral regression; structured scene; time complexity; vector quantization; visual category system; vocabulary; Action recognition from still images; SIFT; kernel discriminant analysis; visual category recognition;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2013.2264927
  • Filename
    6521396