• DocumentCode
    2012332
  • Title

    Learning Domain-Specific Feature Descriptors for Document Images

  • Author

    Ramakrishnan, Kandan ; Bart, Evgeniy

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Minnesota, Minneapolis, MN, USA
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    415
  • Lastpage
    418
  • Abstract
    Many machine learning algorithms rely on feature descriptors to access information about image appearance. Using an appropriate descriptor is therefore crucial for the algorithm to succeed. Although domain- and task-specific feature descriptors may result in excellent performance, they currently have to be hand-crafted, a difficult and time-consuming process. In contrast, general-purpose descriptors (such as SIFT) are easy to apply and have proved successful for a variety of tasks, including classification, segmentation, and clustering. Unfortunately, most general-purpose feature descriptors are targeted at natural images and may perform poorly in document analysis tasks. In this paper, we propose a method for automatically learning feature descriptors tuned to a given image domain. The method works by first extracting the independent components of the images, and then building a descriptor by pooling these components over multiple overlapping regions. We test the proposed method on several document analysis tasks and several datasets, and show that it outperforms existing general-purpose feature descriptors.
  • Keywords
    data analysis; document image processing; feature extraction; image classification; image segmentation; learning (artificial intelligence); pattern clustering; SIFT descriptor; classification task; clustering task; document analysis task; document image; domain-specific feature descriptor; general-purpose descriptor; image appearance; machine learning algorithm; scale invariant feature transform; segmentation task; task-specific feature descriptor; Detectors; Dictionaries; Feature extraction; Image edge detection; Optical character recognition software; Text analysis; Visualization; Feature descriptors; classification; feature learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.49
  • Filename
    6195405