DocumentCode :
2145742
Title :
Minimizing User Annotations in the Generation of Layout Ground-Truthed Data
Author :
Hadjar, Karim ; Ingold, Rolf
Author_Institution :
Dept. of Multimedia, Ahlia Univ., Manama, Bahrain
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
703
Lastpage :
707
Abstract :
This paper describes the adaptation of a previously developed document recognition framework called PLANET (Physical Layout Analysis of complex structured Arabic documents using artificial neural NETs) into a ground truthing system for complex Arabic document images [8]. PLANET is a layout analysis tool for Arabic documents with complex structures allowing incremental learning in an interactive environment. Artificial neural nets drive the classification of homogeneous text blocks. We have observed that when users use PLANET for ground truthing, the number of interactive corrections is quite large. In order to reduce user intervention and to make use of PLANET as a ground truthing system we have adapted its architecture.
Keywords :
document image processing; image classification; learning (artificial intelligence); neural nets; text analysis; PLANET layout analysis tool; complex Arabic document image; document recognition framework; homogeneous text block classification; incremental learning; interactive environment; layout ground-truthed data generation; physical layout analysis of complex structured Arabic documents using artificial neural nets; user annotation minimization; Adaptation models; Artificial neural networks; Knowledge engineering; Layout; Planets; Text analysis; XML; Arabic Newspapers; Artificial Neural Networks; Datasets; Document Image; Ground Truth; Physical Layout Extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.147
Filename :
6065402
Link To Document :
بازگشت