DocumentCode :
2149465
Title :
Connected Component Level Discrimination of Handwritten and Machine-Printed Text Using Eigenfaces
Author :
Pinson, Samuel J. ; Barrett, William A.
Author_Institution :
Pinson Linguistic Service, Sultan, WA, USA
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
1394
Lastpage :
1398
Abstract :
We employ Eigenfaces to discriminate between handwritten and machine-printed text at the connected component (CC) level. Normalized images of machine print CCs are treated as points in a high-dimensional space. PCA yields a reduced-dimensional character space. Representative machine print CCs are projected into character space and a local distance threshold for each representative is automatically determined. CCs are classified as machine print if they are within the local distance threshold of their closest machine print representative. Otherwise, they are classified as handwriting. Recursive character segmentation using min graph cut is used to address the problem of touching characters. Validation over a large NIST handwriting and machine print database demonstrates precision of 93.98% and 89.1% for machine print and handwriting respectively.
Keywords :
eigenvalues and eigenfunctions; graph theory; handwritten character recognition; image classification; image segmentation; principal component analysis; printing; text analysis; visual databases; CC classification; NIST handwriting database; PCA; connected component level; eigenfaces; handwritten text; high-dimensional space; local distance threshold; machine print database; machine printed text; min graph cut; normalized images; recursive character segmentation; reduced dimensional character space; representative machine print CC; touching characters; Accuracy; Databases; Feature extraction; Image color analysis; NIST; Text analysis; Eigenfaces; Handwriting/machine print discrimination; NIST; min graph cut; touching character segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.280
Filename :
6065539
Link To Document :
بازگشت