Title :
An Efficient Algorithm for Segmenting Warped Text-Lines in Document Images
Author :
Oliveira, Daniel ; Lins, R. ; Torreao, Gabriel ; Jian Fan ; Thielo, Marcelo
Author_Institution :
Univ. Fed. de Pernambuco, Recife, Brazil
Abstract :
Warped text-lines often appear whenever one performs the digitalization of bound documents using flatbed scanners or digital cameras. Compensating such distortion is an important pre-processing step in document transcription via OCR, for instance. This paper presents an efficient algorithm for text-line segmentation for document images. A typographic study and parameter tuning are done yielding into high values for precision, recall and f-measure metrics. The method presented outperforms the competing algorithms using a public available dataset.
Keywords :
document image processing; image segmentation; image sensors; optical character recognition; text analysis; OCR; digital cameras; document images; document transcription; efficient algorithm; flatbed scanners; warped text line segmentation; Algorithm design and analysis; Conferences; Equations; Gray-scale; Image recognition; Image segmentation; Text analysis; Text-line segmentation; camera documents; document image processing; page segmentation;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.57