Title :
Gradient Vector Flow and Grouping-Based Method for Arbitrarily Oriented Scene Text Detection in Video Images
Author :
Shivakumara, Palaiahnakote ; Trung Quy Phan ; Shijian Lu ; Chew Lim Tan
Author_Institution :
Dept. of Comput. Syst. & Inf. Technol., Univ. of Malaya, Kuala Lumpur, Malaysia
Abstract :
Text detection in videos is challenging due to low resolution and complex background of videos. Besides, an arbitrary orientation of scene text lines in video makes the problem more complex and challenging. This paper presents a new method that extracts text lines of any orientations based on gradient vector flow (GVF) and neighbor component grouping. The GVF of edge pixels in the Sobel edge map of the input frame is explored to identify the dominant edge pixels which represent text components. The method extracts edge components corresponding to dominant pixels in the Sobel edge map, which we call text candidates (TC) of the text lines. We propose two grouping schemes. The first finds nearest neighbors based on geometrical properties of TC to group broken segments and neighboring characters which results in word patches. The end and junction points of skeleton of the word patches are considered to eliminate false positives, which output the candidate text components (CTC). The second is based on the direction and the size of the CTC to extract neighboring CTC and to restore missing CTC, which enables arbitrarily oriented text line detection in video frame. Experimental results on different datasets, including arbitrarily oriented text data, nonhorizontal and horizontal text data, Hua´s data and ICDAR-03 data (camera images), show that the proposed method outperforms existing methods in terms of recall, precision and f-measure.
Keywords :
gradient methods; object detection; video signal processing; CTC; GVF; Hua data; ICDAR-03 data; Sobel edge map; arbitrarily oriented scene text detection; arbitrarily oriented text data; candidate text component; edge pixel; f-measure; geometrical property; gradient vector flow; grouping-based method; nearest neighbor; neighbor component grouping; nonhorizontal text data; scene text lines; text candidates; text line detection; video images; Arbitrarily oriented text detection; candidate text components (CTC); dominant text pixel; gradient vector flow (GVF); text candidates (TC); text components;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2013.2255396