DocumentCode :
1305533
Title :
A Laplacian Approach to Multi-Oriented Text Detection in Video
Author :
Shivakumara, Palaiahnakote ; Phan, Trung Quy ; Tan, Chew Lim
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
Volume :
33
Issue :
2
fYear :
2011
Firstpage :
412
Lastpage :
419
Abstract :
In this paper, we propose a method based on the Laplacian in the frequency domain for video text detection. Unlike many other approaches which assume that text is horizontally-oriented, our method is able to handle text of arbitrary orientation. The input image is first filtered with Fourier-Laplacian. K-means clustering is then used to identify candidate text regions based on the maximum difference. The skeleton of each connected component helps to separate the different text strings from each other. Finally, text string straightness and edge density are used for false positive elimination. Experimental results show that the proposed method is able to handle graphics text and scene text of both horizontal and nonhorizontal orientation.
Keywords :
Fourier transforms; Laplace transforms; character recognition; frequency-domain analysis; image thinning; object detection; pattern clustering; video signal processing; Fourier-Laplacian filtering; K-means clustering; Laplacian approach; component skeleton; edge density; false positive elimination; frequency domain; graphics text; multioriented text detection; scene text; text region identification; text string straightness; video text detection; Color; Decision support systems; Filtering; Frequency domain analysis; Graphics; Laplace equations; Skeleton; Connected component analysis; frequency domain processing; text detection; text orientation.;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2010.166
Filename :
5557889
Link To Document :
بازگشت