DocumentCode
2596449
Title
A Bilingual Machine-Interface OCR for Printed Kannada and English Text Employing Wavelet Features
Author
Kunte, R. Sanjeev ; Samuel, R. D Sudhaker
Author_Institution
J S S Res. Found., Mysore
fYear
2007
fDate
17-20 Dec. 2007
Firstpage
202
Lastpage
207
Abstract
An Optical Character Recognition (OCR) system is one of the important research areas in the field of Human- machine interface. This paper presents a bilingual OCR system for printed Kannada and English text. Gabor filter based features are used for separating the Kannada and English words from the bilingual document. Wavelets that have been progressively used in pattern recognition are used in the system to extract the features for classifying both the Kannada and English characters. Multilayer feed forward Neural classifiers known for their good generalization and approximation property have been effectively used in the system for the classification. An overall recognition rate of 90.5% is obtained at character level.
Keywords
Gabor filters; feature extraction; human computer interaction; multilayers; natural language processing; optical character recognition; pattern classification; recurrent neural nets; text analysis; wavelet transforms; English text; Gabor filter; bilingual OCR system; bilingual document; bilingual machine-interface OCR; feature extraction; human- machine interface; multilayer feed forward neural classifiers; optical character recognition system; pattern recognition; printed Kannada; wavelet features; Character recognition; Databases; Feature extraction; Gabor filters; Information technology; Natural languages; Optical character recognition software; Optical filters; Pattern analysis; Pattern recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology, (ICIT 2007). 10th International Conference on
Conference_Location
Orissa
Print_ISBN
0-7695-3068-0
Type
conf
DOI
10.1109/ICIT.2007.12
Filename
4418296
Link To Document