DocumentCode :
2596449
Title :
A Bilingual Machine-Interface OCR for Printed Kannada and English Text Employing Wavelet Features
Author :
Kunte, R. Sanjeev ; Samuel, R. D Sudhaker
Author_Institution :
J S S Res. Found., Mysore
fYear :
2007
fDate :
17-20 Dec. 2007
Firstpage :
202
Lastpage :
207
Abstract :
An Optical Character Recognition (OCR) system is one of the important research areas in the field of Human- machine interface. This paper presents a bilingual OCR system for printed Kannada and English text. Gabor filter based features are used for separating the Kannada and English words from the bilingual document. Wavelets that have been progressively used in pattern recognition are used in the system to extract the features for classifying both the Kannada and English characters. Multilayer feed forward Neural classifiers known for their good generalization and approximation property have been effectively used in the system for the classification. An overall recognition rate of 90.5% is obtained at character level.
Keywords :
Gabor filters; feature extraction; human computer interaction; multilayers; natural language processing; optical character recognition; pattern classification; recurrent neural nets; text analysis; wavelet transforms; English text; Gabor filter; bilingual OCR system; bilingual document; bilingual machine-interface OCR; feature extraction; human- machine interface; multilayer feed forward neural classifiers; optical character recognition system; pattern recognition; printed Kannada; wavelet features; Character recognition; Databases; Feature extraction; Gabor filters; Information technology; Natural languages; Optical character recognition software; Optical filters; Pattern analysis; Pattern recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology, (ICIT 2007). 10th International Conference on
Conference_Location :
Orissa
Print_ISBN :
0-7695-3068-0
Type :
conf
DOI :
10.1109/ICIT.2007.12
Filename :
4418296
Link To Document :
بازگشت