• DocumentCode
    3489671
  • Title

    Text Line Extraction Method Using Domain-Based Active Contour Model

  • Author

    Itani, Yusuke ; Hirano, Takuichi ; Ishii, Jun

  • Author_Institution
    Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kamakura, Japan
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    1230
  • Lastpage
    1234
  • Abstract
    In this paper, we propose a novel text line extraction method using domain-based optimization for complex documents such as engineering drawings. In the complex documents, variations of text line due to differences in size, interval, and direction, as well as the complex layout (text lines and drawing objects are mixed) make the problem of automatic text line extraction extremely challenging. The proposed method divides a document into some domains based on line objects or spaces. Then, this method generates a potential map in each domain. Finally, text lines are extracted by an active contour model based on the potential map. In this process, the potential map is generated by a potential function that has some parameters concerning interval, character size and drawing objects. By optimizing these parameters, this method extracts text lines easily in complex documents. Our experimental result shows better performance than a conventional method for engineering drawings.
  • Keywords
    document image processing; optimisation; text detection; complex documents; domain-based active contour model; domain-based optimization; engineering drawings; potential function; potential map; text line extraction method; text line variation; Accuracy; Active contours; Engineering drawings; Equations; Mathematical model; Optimization; Text analysis; Active Contour Model; Document Image Analysis; Text line extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.249
  • Filename
    6628810