• DocumentCode
    3022903
  • Title

    Document image dewarping using robust estimation of curled text lines

  • Author

    Ulges, Adrian ; Lampert, Christoph H. ; Breuel, Thomas M.

  • Author_Institution
    Kaiserslautern Univ., Germany
  • fYear
    2005
  • fDate
    29 Aug.-1 Sept. 2005
  • Firstpage
    1001
  • Abstract
    Digital cameras have become almost ubiquitous and their use for fast and casual capturing of natural images is unchallenged. For making images of documents, however, they have not caught up to flatbed scanners yet, mainly because camera images tend to suffer from distortion due to the perspective and are therefore limited in their further use for archival or OCR. For images of non-planar paper surfaces like books, page curl causes additional distortion, which poses an even greater problem due to its nonlinearity. This paper presents a new algorithm for removing both perspective and page curl distortion. It requires only a single camera image as input and relies on a priori layout information instead of additional hardware. Therefore, it is much more user friendly than most previous approaches, and allows for flexible ad hoc document capture. Results are presented showing that the algorithm produces visually pleasing output and increases OCR accuracy, thus having the potential to become a general purpose preprocessing tool for camera based document capture.
  • Keywords
    document image processing; estimation theory; image enhancement; ad hoc document capture; camera based document capture; curled text lines; document image dewarping; page curl distortion removal; perspective removal; robust estimation; Books; Digital cameras; Digital photography; Hardware; Image databases; Image storage; Nonlinear distortion; Optical character recognition software; Robustness; Software packages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
  • ISSN
    1520-5263
  • Print_ISBN
    0-7695-2420-6
  • Type

    conf

  • DOI
    10.1109/ICDAR.2005.90
  • Filename
    1575694