• DocumentCode
    1544532
  • Title

    Robust detection of skew in document images

  • Author

    Chaudhuri, Arindam ; Chaudhuri, Subhasis

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol., Bombay, India
  • Volume
    6
  • Issue
    2
  • fYear
    1997
  • fDate
    2/1/1997 12:00:00 AM
  • Firstpage
    344
  • Lastpage
    349
  • Abstract
    We describe a robust yet fast algorithm for skew detection in binary document images. The method is based on interline cross-correlation in the scanned image. Instead of finding the correlation for the entire image, it is calculated over small regions selected randomly. The proposed method does not require prior segmentation of the document into text and graphics regions. The maximum median of cross-correlation is used as the criterion to obtain the skew, and a Monte Carlo sampling technique is chosen to determine the number of regions over which the correlations have to be calculated. Experimental results on detecting skews in various types of documents containing different linguistic scripts are presented here
  • Keywords
    Monte Carlo methods; correlation methods; document image processing; image sampling; image segmentation; Monte Carlo sampling technique; binary document images; cross-correlation maximum median; interline cross-correlation; linguistic scripts; robust detection; scanned image; skew; Fourier transforms; Graphics; Histograms; Image converters; Image segmentation; Monte Carlo methods; Nearest neighbor searches; Robustness; Text analysis; Text recognition;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/83.551708
  • Filename
    551708