DocumentCode :
112705
Title :
Multi-Orientation Scene Text Detection with Adaptive Clustering
Author :
Yin, Xu-Cheng ; Pei, Wei-Yi ; Zhang, Jun ; Hao, Hong-Wei
Author_Institution :
Department of Computer Science and Technology and also with the Beijing Key Laboratory of Materials Science Knowledge Engineering, School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Volume :
37
Issue :
9
fYear :
2015
fDate :
Sept. 1 2015
Firstpage :
1930
Lastpage :
1937
Abstract :
Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks, while most current research efforts only focus on horizontal or near horizontal scene text. In this paper, first we present a unified distance metric learning framework for adaptive hierarchical clustering, which can simultaneously learn similarity weights (to adaptively combine different feature similarities) and the clustering threshold (to automatically determine the number of clusters). Then, we propose an effective multi-orientation scene text detection system, which constructs text candidates by grouping characters based on this adaptive clustering. Our text candidates construction method consists of several sequential coarse-to-fine grouping steps: morphology-based grouping via single-link clustering, orientation-based grouping via divisive hierarchical clustering, and projection-based grouping also via divisive clustering. The effectiveness of our proposed system is evaluated on several public scene text databases, e.g., ICDAR Robust Reading Competition data sets (2011 and 2013), MSRA-TD500 and NEOCR. Specifically, on the multi-orientation text data set MSRA-TD500, the f measure of our system is 71 percent, much better than the state-of-the-art performance. We also construct and release a practical challenging multi-orientation scene text data set (USTB-SV1K), which is available at http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/.
Keywords :
Clustering algorithms; Equations; Image color analysis; Mathematical model; Measurement; Morphology; Robustness; Scene text detection; adaptive hierarchical clustering; coarse-to-fine grouping; multi-orientation;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2014.2388210
Filename :
7001081
Link To Document :
بازگشت