DocumentCode
3489671
Title
Text Line Extraction Method Using Domain-Based Active Contour Model
Author
Itani, Yusuke ; Hirano, Takuichi ; Ishii, Jun
Author_Institution
Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kamakura, Japan
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
1230
Lastpage
1234
Abstract
In this paper, we propose a novel text line extraction method using domain-based optimization for complex documents such as engineering drawings. In the complex documents, variations of text line due to differences in size, interval, and direction, as well as the complex layout (text lines and drawing objects are mixed) make the problem of automatic text line extraction extremely challenging. The proposed method divides a document into some domains based on line objects or spaces. Then, this method generates a potential map in each domain. Finally, text lines are extracted by an active contour model based on the potential map. In this process, the potential map is generated by a potential function that has some parameters concerning interval, character size and drawing objects. By optimizing these parameters, this method extracts text lines easily in complex documents. Our experimental result shows better performance than a conventional method for engineering drawings.
Keywords
document image processing; optimisation; text detection; complex documents; domain-based active contour model; domain-based optimization; engineering drawings; potential function; potential map; text line extraction method; text line variation; Accuracy; Active contours; Engineering drawings; Equations; Mathematical model; Optimization; Text analysis; Active Contour Model; Document Image Analysis; Text line extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.249
Filename
6628810
Link To Document