• DocumentCode
    3207451
  • Title

    Feature tracking and object recognition on a hand-held

  • Author

    Lee, Taehee ; Soatto, Stefano

  • Author_Institution
    Univ. of California, Los Angeles, CA, USA
  • fYear
    2010
  • fDate
    13-16 Oct. 2010
  • Firstpage
    306
  • Lastpage
    306
  • Abstract
    We demonstrate a visual recognition system operating on a hand-held device, with the help of an efficient and robust feature tracking and an object recognition mechanism that can be used for interactive mobile applications. In our recognition system, corner features are detected from captured video frames in a multi-scale image pyramid, and are tracked between consecutive frames efficiently. In order to perform object recognition, local descriptors are calculated on the tracked features, and quantized using a vocabulary tree. For each object, a bag-of-words model is learned from multiple views. The learned objects are recognized by computing the ranking score for the set of features in a single video frame. Our feature tracking algorithm and local descriptors are different than the Lucas-Kanade algorithm in image pyramid or the SIFT descriptor, however improving the efficiency and accuracy. For our implementation on a mobile phone, we used an iPhone 3GS with a 600MHz ARM chip CPU. The video frame is captured from a camera preview screen at a rate of 15 frames per second using the public API. The task of object recognition on a mobile phone runs at around 7 frames per second, including the feature tracking and descriptor calculation.
  • Keywords
    computer vision; feature extraction; image recognition; mobile handsets; object recognition; tracking; video cameras; video signal processing; vocabulary; 600MHz ARM chip CPU; Lucas-Kanade algorithm; SIFT descriptor; bag of words model; captured video frame; corner feature; descriptor calculation; feature tracking; handheld device; iPhone 3GS; interactive mobile application; learned object; mobile phone; multiscale image pyramid; object recognition; public API; visual recognition system; vocabulary tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mixed and Augmented Reality (ISMAR), 2010 9th IEEE International Symposium on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4244-9343-2
  • Electronic_ISBN
    978-1-4244-9345-6
  • Type

    conf

  • DOI
    10.1109/ISMAR.2010.5643615
  • Filename
    5643615