• DocumentCode
    3407110
  • Title

    Rectification of figures and photos in document images using bounding box interface

  • Author

    Koo, Hyung Il ; Cho, Nam Ik

  • Author_Institution
    Dept. of EECS, Seoul Nat. Univ., Seoul, South Korea
  • fYear
    2010
  • fDate
    13-18 June 2010
  • Firstpage
    3121
  • Lastpage
    3128
  • Abstract
    This paper proposes an algorithm for the segmentation and rectification of figures and photos in document images. The algorithm requires just a rough user-provided bounding box for the objects in a single-view image. On receiving the user´s bounding box, it takes about 1-2 seconds to segment and rectify mega-pixel sized figures. The main feature of the algorithm is a novel segmentation method that exploits the properties of printed figures. Specifically, a set of boundary candidates is generated using the properties, and the optimal boundary in the set is found by using an alternating optimization scheme. This segmentation result is further refined so that it is well localized to the true boundary. In addition to our segmentation method, we also propose a new boundary interpolation method for the rectification of segmented figures. The method improves the quality of output by largely removing perspective distortions compared to conventional boundary interpolation methods. Experimental results on a variety of images show that the method is efficient, robust, and easy to use.
  • Keywords
    document image processing; image segmentation; interpolation; optimisation; alternating optimization scheme; boundary interpolation method; bounding box interface; document images; figure rectification; image segmentation; perspective distortion removal; photo rectification; Books; Digital cameras; Hardware; Image quality; Image reconstruction; Image segmentation; Interpolation; Lighting; Optical character recognition software; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-6984-0
  • Type

    conf

  • DOI
    10.1109/CVPR.2010.5540071
  • Filename
    5540071