Title :
A component-tree based method for user-intention guided text extraction
Author :
Lei Sun ; Qiang Huo
Author_Institution :
Dept. of Electron. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
We propose a new component-tree based method with efficient and effective pruning strategies for user intention guided text extraction from scene images. A grayscale image is represented first as two component-trees, whose nodes represent possible candidates of character components. The non-text candidates are then pruned by using contrast, geometric and text line information as well as the constraint imposed by user intention. Bounding boxes of the intended words are then generated by grouping the survived text objects. Finally, the better result is chosen as extracted text from the two trees using an evaluation function. The proposed algorithm is evaluated on ICDAR-2003 benchmark dataset and promising results are achieved.
Keywords :
image representation; information retrieval; natural scenes; text analysis; trees (mathematics); ICDAR-2003 benchmark dataset; character component; component tree based method; contrast information; evaluation function; geometric information; grayscale image representation; nontext candidate; pruning strategy; scene image; text line information; text objects; user intention guided text extraction; Algorithm design and analysis; Buildings; Feature extraction; Gray-scale; Lighting; Robustness; Sun;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4