• DocumentCode
    2190256
  • Title

    Script independent text pre-processing and segmentation for OCR

  • Author

    Sawant, Archana S. ; Chougule, D.G.

  • Author_Institution
    Department of Computer Science and Engineering, KIT´s College of Engineering, Kolhapur, India
  • fYear
    2015
  • fDate
    24-25 Jan. 2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Optical Character Recognition (OCR) systems have been numerously developed for the recognition of printed script in many languages. Multiple approaches to pre-processing and segmentation exist for various scripts where OCR accuracy mainly depends on the text pre-processing and segmentation algorithm being used for the document. When the document is scanned it can be put in any arbitrary angle which would appear in the image as skew angle. Our experimental results proposed in the paper assures the superior algorithm for correction of skew angle of the text document. Projection Profile based methods used makes segmentation easy to separate the text in document image into lines, words and characters independent of the Language in the Text.
  • Keywords
    Character recognition; Frequency-domain analysis; Image analysis; Image recognition; Image segmentation; Optical character recognition software; Text analysis; OCR; Projection Profile; Segmentation; Skew correction; Text pre-processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical, Electronics, Signals, Communication and Optimization (EESCO), 2015 International Conference on
  • Conference_Location
    Visakhapatnam, India
  • Print_ISBN
    978-1-4799-7676-8
  • Type

    conf

  • DOI
    10.1109/EESCO.2015.7253643
  • Filename
    7253643