Title :
Text Detection and Recognition in Real World Images
Author :
Saabni, Raid ; Zwilling, M.
Author_Institution :
Triangle R&D Center, Tel-Aviv Univ., Kafr Qarea, Israel
Abstract :
Detecting and recognizing texts in real world images such as sign boards and advertisements is an important part of computer vision applications. The complexity of the problem comes out of many factors such as nonuniform background, different languages and fonts, and non consistent text alignment and orientation. In this paper, we present a novel approach to detect characters and words in real-world images. The presented approach decompose the gray level image into sequence of images, each one includes pixels with gray level values from different disjoint ranges. This decomposition enables extracting connected components representing characters or other non textual objects separated from their neighborhood background. An interpolation of two classes of features translated to histograms is used by a support vector machine to classify and collect the textual objects generating the textual zones. The Shape Context Descriptor [1], is used by the Earth Movers Distance(EMD) method to recognize the characters within the image. The recognized characters are fed to heuristic rule based system to determine words and give final results. To optimize the speed of the system, we follow the embedding of the EMD metric presented in [22] to a normed space to enable fast approximation of the k-Nearest Neighbors using Local Sensitivity Hashing functions(LSH). Experiments show that our algorithm can detect and recognize text regions from the ICDAR 2005 datasets [17] with high rates.
Keywords :
character recognition; computational complexity; computer vision; feature extraction; image classification; image recognition; image representation; image sequences; interpolation; knowledge based systems; support vector machines; text detection; EMD metric; Earth movers distance method; ICDAR 2005 datasets; LSH; character detection; character representation; computer vision applications; connected components extraction; fast approximation; feature translation; gray level image composition; gray level values; heuristic rule-based system; histograms; image sequence; interpolation; k-nearest neighbors; local sensitivity hashing functions; neighborhood background; nontextual objects; problem complexity; real world image; shape context descriptor; support vector machine; text detection; text recognition; textual object generation; textual zones; words determine; Character recognition; Context; Histograms; Image recognition; Measurement; Shape; Text recognition; Earth Movers Distance; Embedding; Local Sensitivity Hashing; Text Detection; Word Searching; k-Nearest Neighbor;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
DOI :
10.1109/ICFHR.2012.279