Title :
Video text detection and localization based on localized generalization error model
Author :
Ma, Xian-heng ; Ng, Wing W Y ; Chan, Patrick P K ; Yeung, Daniel S.
Author_Institution :
Machine Learning & Cybern. Res. Center, South China Univ. of Technol., Guangzhou, China
Abstract :
Texts in videos provide plenteous information for video analysis such as video indexing, understanding and retrieval. We propose a neural network based method detecting text in the video frames in this work. The proposed method consists of three major steps: feature extraction, text region detection and candidate region refinement. Firstly, we extract texture features from four edge maps yielded from the target video frame. Secondly, a Radial Basis Function Neural Network (RBFNN) optimized by the Localized Generalization Error Model (L-GEM) is applied to detect text candidates. Finally, a false detection of text is applied to fine tune the result. Experimental results demonstrate that the proposed method is efficient for different font-colors, font-sizes and language in complex background.
Keywords :
edge detection; feature extraction; radial basis function networks; text analysis; video signal processing; edge maps; localized generalization error model; radial basis function neural network; texture features extraction; video indexing; video retrieval; video text detection; video understanding; Classification algorithms; Computer architecture; Feature extraction; Image edge detection; Machine learning; Neurons; Training; Localized generalization error model (LGEM); Radial basis function neural network (RBFNN); Text detection;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2010 International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-1-4244-6526-2
DOI :
10.1109/ICMLC.2010.5580484