Title :
Handwritten and machine printed text discrimination using an edge co-occurrence matrix
Author :
Zhang, Xiaofeng ; Lu, Yue
Author_Institution :
Sch. of Comput. Sci. & Technol., Nantong Univ., Nantong, China
Abstract :
We employ an edge co-occurrence matrix (ECM) to distinguish handwritten and machine printed text without resorting to line or word information. The ECM is a modified co-occurrence matrix (CM) on edge images. First, the whole image is divided into overlapping range blocks with fixed size. Then, the ECMs are abstracted from these blocks. The ECM only counts the co-occurring edges connected with each other and its up direction part is the part with most distribution. The liner Support Vector Machine (SVM) is used to classify the features. Because of the similarities of neighboring blocks, the Discriminative Random Fields (DRF) is used to further improve the classification accuracy. The experiments on document images taken from HIT and IMA databases show the effectiveness of our proposed method.
Keywords :
document image processing; edge detection; image classification; matrix algebra; random processes; support vector machines; text analysis; visual databases; DRF; ECM; HIT database; IMA database; SVM; discriminative random fields; document images; edge co-occurrence matrix; edge images; feature classification; handwritten text discrimination; liner support vector machine; machine printed text discrimination; overlapping range blocks;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376728