• DocumentCode
    1027169
  • Title

    Real-Time Computerized Annotation of Pictures

  • Author

    Li, Jia ; Wang, James Z.

  • Author_Institution
    Dept. of Stat., Pennsylvania State Univ., University Park, PA
  • Volume
    30
  • Issue
    6
  • fYear
    2008
  • fDate
    6/1/2008 12:00:00 AM
  • Firstpage
    985
  • Lastpage
    1002
  • Abstract
    Developing effective methods for automated annotation of digital pictures continues to challenge computer scientists. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications, including Web image search, online picture-sharing communities, and scientific experiments. In this work, the authors developed new optimization and estimation techniques to address two fundamental problems in machine learning. These new techniques serve as the basis for the automatic linguistic indexing of pictures - real time (ALIPR) system of fully automatic and high-speed annotation for online pictures. In particular, the D2-clustering method, in the same spirit as K-Means for vectors, is developed to group objects represented by bags of weighted vectors. Moreover, a generalized mixture modeling technique (kernel smoothing as a special case) for nonvector data is developed using the novel concept of hypothetical local mapping (HLM). ALIPR has been tested by thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process. Its performance has also been studied at an online demonstration site, where arbitrary users provide pictures of their choices and indicate the correctness of each annotation word. The experimental results show that a single computer processor can suggest annotation terms in real time and with good accuracy.
  • Keywords
    Internet; image processing; indexing; learning (artificial intelligence); multimedia systems; ALIPR system; D2 clustering; Internet photo sharing; automatic linguistic indexing; digital picture annotation; generalized mixture modeling; hypothetical local mapping; kernel smoothing; machine learning; online pictures; realtime computerized picture annotation; Algorithms; Image/video retrieval; Indexing methods; Multimedia databases; Statistical computing; Algorithms; Artificial Intelligence; Computer Systems; Database Management Systems; Databases, Factual; Documentation; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.70847
  • Filename
    4420087