Title :
Multi-oriented Handwritten Annotations Extraction from Scanned Documents
Author :
Benjlaiel, Mohamed ; Mullot, Remy ; Alimi, Adel M.
Author_Institution :
REGIM-Lab.: Res. Groups on Intell. Machines, Univ. of Sfax, Sfax, Tunisia
Abstract :
In this paper, we present an integrated system able to localize multi-oriented handwritten annotations in scanned documents. Unlike previous single methods which limit colors or types of annotations to be extracted, the proposed method attempts to extract annotations by fusing three feature extraction techniques based on internal and external shape analysis. Our method consists of two processes: 1) a coarse segmentation process which divides the scanned document into text and non-text regions. 2) A fine segmentation process which consists of three steps: a feature extraction process, a classification process and a majority voting process which identifies the segmented regions as machine-printed or handwritten annotations. We find that our adaptive method outperform all individual methods. Experimental results on a set of 301 annotated scanned documents are reported.
Keywords :
Fourier analysis; Gabor filters; document image processing; feature extraction; image classification; image segmentation; Fourier descriptors; Gabor filters; annotation extraction; classification process; coarse segmentation process; external shape analysis; feature extraction process; feature extraction techniques; fine segmentation process; internal shape analysis; machine-printed annotations; majority voting process; multioriented handwritten annotations extraction; nontext regions; scanned documents; segmented regions; Accuracy; Feature extraction; Gabor filters; Shape; Text analysis; Vectors; Fourier descriptors; Gabor filters; annotation extraction; document analysis; handwritten text separation;
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
DOI :
10.1109/DAS.2014.17