Title :
CSM-based feature extraction for degraded machine printed character recognition
Author :
Namane, Abderrahmane ; Maamoun, Mountassar ; Soubari, EL Houssine ; Meyrueis, Patrick
Author_Institution :
Dept. of Electron., Saad Dahleb Univ., Blida, Algeria
Abstract :
This paper presents an OCR method for degraded character recognition applied to typewritten document produced by typesetting machine. The complementary similarity measure method (CSM) is a well known classification method and widely applied in the area of character recognition. In this work the CSM method is not only used as a classifier but also introduced as a feature extractor, and applied to degraded character recognition. The resulted CSM feature vector is used to train a multi layered perceptron (MLP). The use of the CSM as a feature extractor tends to boost the MLP and makes it very powerful and very well suited for rejection. Experimental results on n typewritten A4 page documents show the ability of the model to yield relevant and robust recognition on poor quality printed document characters.
Keywords :
feature extraction; multilayer perceptrons; optical character recognition; pattern classification; text analysis; CSM based feature extraction; MLP; OCR method; complementary similarity measure method; degraded machine printed character recognition; multi layered perceptron; typesetting machine; typewritten document; Character recognition; Feature extraction; Image recognition; Neurons; Robustness; Support vector machine classification; Training; Complementary similarity measure; MLP; OCR; character recognition; degraded printed characters;
Conference_Titel :
Cybernetic Intelligent Systems (CIS), 2010 IEEE 9th International Conference on
Conference_Location :
Reading
Print_ISBN :
978-1-4244-9023-3
Electronic_ISBN :
978-1-4244-9024-0
DOI :
10.1109/UKRICIS.2010.5898105