• DocumentCode
    3135734
  • Title

    Text Detection and Recognition in Real World Images

  • Author

    Saabni, Raid ; Zwilling, M.

  • Author_Institution
    Triangle R&D Center, Tel-Aviv Univ., Kafr Qarea, Israel
  • fYear
    2012
  • fDate
    18-20 Sept. 2012
  • Firstpage
    443
  • Lastpage
    448
  • Abstract
    Detecting and recognizing texts in real world images such as sign boards and advertisements is an important part of computer vision applications. The complexity of the problem comes out of many factors such as nonuniform background, different languages and fonts, and non consistent text alignment and orientation. In this paper, we present a novel approach to detect characters and words in real-world images. The presented approach decompose the gray level image into sequence of images, each one includes pixels with gray level values from different disjoint ranges. This decomposition enables extracting connected components representing characters or other non textual objects separated from their neighborhood background. An interpolation of two classes of features translated to histograms is used by a support vector machine to classify and collect the textual objects generating the textual zones. The Shape Context Descriptor [1], is used by the Earth Movers Distance(EMD) method to recognize the characters within the image. The recognized characters are fed to heuristic rule based system to determine words and give final results. To optimize the speed of the system, we follow the embedding of the EMD metric presented in [22] to a normed space to enable fast approximation of the k-Nearest Neighbors using Local Sensitivity Hashing functions(LSH). Experiments show that our algorithm can detect and recognize text regions from the ICDAR 2005 datasets [17] with high rates.
  • Keywords
    character recognition; computational complexity; computer vision; feature extraction; image classification; image recognition; image representation; image sequences; interpolation; knowledge based systems; support vector machines; text detection; EMD metric; Earth movers distance method; ICDAR 2005 datasets; LSH; character detection; character representation; computer vision applications; connected components extraction; fast approximation; feature translation; gray level image composition; gray level values; heuristic rule-based system; histograms; image sequence; interpolation; k-nearest neighbors; local sensitivity hashing functions; neighborhood background; nontextual objects; problem complexity; real world image; shape context descriptor; support vector machine; text detection; text recognition; textual object generation; textual zones; words determine; Character recognition; Context; Histograms; Image recognition; Measurement; Shape; Text recognition; Earth Movers Distance; Embedding; Local Sensitivity Hashing; Text Detection; Word Searching; k-Nearest Neighbor;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
  • Conference_Location
    Bari
  • Print_ISBN
    978-1-4673-2262-1
  • Type

    conf

  • DOI
    10.1109/ICFHR.2012.279
  • Filename
    6424433