DocumentCode
3487576
Title
Text Line Detection in Document Images: Towards a Support System for the Blind
Author
Tomoyuki Nassu, Bogdan ; Minetto, Rodrigo ; Soares de Oliveira, Luiz Eduardo
Author_Institution
Fed. Univ. of Technol. Parana, Curitiba, Brazil
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
638
Lastpage
642
Abstract
We introduce a novel approach for text line detection in document images, keeping in mind the requirements of a portable text recognition system designed to support the blind. Challenges include shadows, cluttered backgrounds, and perspective distortion. Different from previous approaches, the proposed method does not segment the image. A text model is created by clustering SIFT features extracted from positive and negative examples. Text regions are located by matching the features extracted from the input image to the clusters in the text model. Regions around the correspondences are then analyzed, and text lines are identified based on features such as gradients and histogram distribution. Experimental results show that our approach outperforms a state-of-the-art text detector in a text/non-text classification task.
Keywords
document image processing; feature extraction; handicapped aids; image classification; image matching; object detection; SIFT feature clustering; blind support system; classification task; cluttered backgrounds; document image; feature extraction; feature matching; gradients; histogram distribution; perspective distortion; portable text recognition system; scale-invariant feature transform; shadows; text line detection; Detectors; Feature extraction; Histograms; Image segmentation; Text analysis; Text recognition; Training; Document Image Processing; Text Detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.131
Filename
6628696
Link To Document