• DocumentCode
    2012205
  • Title

    Text Detection in Natural Scenes with Salient Region

  • Author

    Meng, Quan ; Song, Yonghong

  • Author_Institution
    Inst. of Artificial Intell. & Robot., Xi´´an Jiaotong Univ., Xi´´an, China
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    384
  • Lastpage
    388
  • Abstract
    In this paper, we present a novel approach to detect text in natural scenes. This approach is a type of bionic method, which imitates how human beings detect text exactly and robustly. Practically, human beings follow two steps to detect text: the first step is to find salient regions in a scene and the second step is to determine whether these salient regions are text or not. Therefore, two similar steps namely salient regions computation and text localization are used in our method. In the step of salient regions computation, a set of salient features including multi-sacle contrast, modified center-surround histogram, color spatial distribution and similarity of stroke width are used to describe an image, following with computation of salient regions based on the combination of Conditional Random Fields model and above features. Because sole letter rarely appear, in the step of text localization, salient regions are segmented and the connected components are grouped into text strings based on their features such as spatial relationships, color difference and stroke width. As an elementary unit, the text string is refined by connected component analysis. We tested the effectiveness of our method on the ICDAR 2003 database. The experimental results show that the proposed method provides promising performance in comparison with existing methods.
  • Keywords
    document image processing; image colour analysis; text analysis; visual databases; ICDAR 2003 database; bionic method; center surround histogram; color spatial distribution; conditional random fields model; connected component analysis; multiscale contrast; natural scenes; salient region; text detection; text localization; text string; Computational modeling; Conferences; Histograms; Humans; Image color analysis; Image edge detection; Pattern recognition; conditional random fields; salient regions; text detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.85
  • Filename
    6195399