• DocumentCode
    3333114
  • Title

    Automatic indexing and content-based retrieval of captioned photographs

  • Author

    Srihari, Rohini K.

  • Author_Institution
    Center of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
  • Volume
    2
  • fYear
    1995
  • fDate
    14-16 Aug 1995
  • Firstpage
    1165
  • Abstract
    This research explores the interaction of textual and photographic information in an integrated text/image database environment. Specifically, We present a content-based retrieval system for captioned group photographs of people (i.e., human faces) where groups can consist of one or more members. By understanding the caption accompanying a picture, we are able to extract information useful in (i) retrieving the picture and (ii) directing an image interpretation system identify relevant objects (in this case, faces) in the picture. For the latter, we incorporate techniques from our ongoing research on photo understanding using accompanying text. Current image-based techniques have limitations; for example, similarity techniques used for retrieving faces will not perform well in group photographs where the locations of faces is not known a priori or where face sizes are small. By exploiting caption information, we assist a face locator in detecting human faces in a photograph and subsequently labelling them. Text-based similarity algorithms have principally relied on statistical techniques to index and classify documents (e.g., vector models). It is necessary to employ natural language processing techniques in order to derive deeper semantics from captions which contain far fewer words than documents. Our approach is unique since it goes beyond a superficial combination of existing text-based and image-based approaches to information retrieval
  • Keywords
    face recognition; indexing; natural languages; query processing; visual databases; automatic indexing; captioned photographs; content-based retrieval; face locator; human faces; information retrieval; natural language processing; text/image database; Content based retrieval; Data mining; Face detection; Humans; Image databases; Image retrieval; Information retrieval; Labeling; Machine assisted indexing; Natural language processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    0-8186-7128-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.1995.602129
  • Filename
    602129