DocumentCode :
3489671
Title :
Text Line Extraction Method Using Domain-Based Active Contour Model
Author :
Itani, Yusuke ; Hirano, Takuichi ; Ishii, Jun
Author_Institution :
Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kamakura, Japan
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
1230
Lastpage :
1234
Abstract :
In this paper, we propose a novel text line extraction method using domain-based optimization for complex documents such as engineering drawings. In the complex documents, variations of text line due to differences in size, interval, and direction, as well as the complex layout (text lines and drawing objects are mixed) make the problem of automatic text line extraction extremely challenging. The proposed method divides a document into some domains based on line objects or spaces. Then, this method generates a potential map in each domain. Finally, text lines are extracted by an active contour model based on the potential map. In this process, the potential map is generated by a potential function that has some parameters concerning interval, character size and drawing objects. By optimizing these parameters, this method extracts text lines easily in complex documents. Our experimental result shows better performance than a conventional method for engineering drawings.
Keywords :
document image processing; optimisation; text detection; complex documents; domain-based active contour model; domain-based optimization; engineering drawings; potential function; potential map; text line extraction method; text line variation; Accuracy; Active contours; Engineering drawings; Equations; Mathematical model; Optimization; Text analysis; Active Contour Model; Document Image Analysis; Text line extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.249
Filename :
6628810
Link To Document :
بازگشت