Title :
A New Method for Text Verification Based on Random Forests
Author :
Yang Zhang ; Chunheng Wang ; Baihua Xiao ; Cunzhao Shi
Author_Institution :
State Key Lab. of Intell. Control & Manage. of Complex Syst. Inst. of Autom., Beijing, China
Abstract :
Text in image or video frames contains a lot of high-level semantics which can be useful for multimedia indexing, management. Coarse text detection results may contain many false alarms, which makes it necessary to eliminate the false alarms for further recognition. As text has distinct textural features, texture-based classifier such as SVM, MLP and Adaboost has been used to classify the detection regions as text or non-text region. In this paper, a random forests based method for text verification is proposed. The reason of choosing random forests lies in: 1) its ability of maintaining accuracy in small labeled dataset and 2) its good performance in unbalanced dataset as in the case of unbalanced text and non-text distribution. Furthermore, we propose to merge different random forests trained with different kinds of features to improve the accuracy of classification. The comprehensive experimental results show that our methods are effective.
Keywords :
decision trees; document image processing; image texture; multimedia systems; pattern classification; text detection; Adaboost; MLP; SVM; coarse text detection; high-level semantics; multimedia indexing; nontext distribution; random forest; text verification; textural feature; texture-based classifier; unbalanced text; Accuracy; Feature extraction; Radio frequency; Support vector machines; Testing; Training; Vegetation; EOH; GSC; MsLBP; RF; random forests; text verification;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
DOI :
10.1109/ICFHR.2012.163