• DocumentCode
    1879176
  • Title

    Automatic text area segmentation in natural images

  • Author

    Jafri, Syed Ali Raza ; Boutin, Mireille ; Delp, Edward J.

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN
  • fYear
    2008
  • fDate
    12-15 Oct. 2008
  • Firstpage
    3196
  • Lastpage
    3199
  • Abstract
    We present a hierarchical method for segmenting text areas in natural images. The method assumes that the text is written with a contrasting color on a more or less uniform background. No assumption is made regarding the language or character set used to write the text. In particular, the text can contain simple graphics or symbols. The key feature of our approach is that we first concentrate on finding the background of the text, before testing whether there is actually text on the background. Since uniform areas are easy to find in natural images, and since text backgrounds define areas which contain "holes" (where the text is written) we thus look for uniform areas containing "holes" and label them as text backgrounds candidates. Each candidate area is then further tested for the presence of text within its convex hull. We tested our method on a database of 65 images including English and Urdu text. The method correctly segmented all the text areas in 63 of these images, and in only 4 of these were areas that do not contain text also segmented.
  • Keywords
    image segmentation; natural scenes; text analysis; English text; Urdu text; automatic text area segmentation; contrasting color; hierarchical method; natural images; simple graphics; text segmenation; Computer vision; Design methodology; Graphics; Image databases; Image segmentation; Licenses; Robustness; Shape; Testing; Wire; text segmentation; uniform texture;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on
  • Conference_Location
    San Diego, CA
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-1765-0
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2008.4712475
  • Filename
    4712475