Title :
Integrating word level knowledge in text recognition
Author :
Harmalkar, Subodh ; Sinha, R.M.K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Kanpur, India
Abstract :
A text recognition system capable of correcting character-level confusions is developed. It incorporates word-level knowledge sources in the form of a dictionary and an envelope of the words. This speeds up the word recognition process by recognizing only crucial characters in the word. Also, the same knowledge sources are used to generate possible pairs of characters in a merged blob of characters, so that the blob can be segmented at proper positions. The system is capable of segmenting and classifying merged pairs of characters in the input text. A dictionary of the 9800 most frequently used words in English (taken from Brown Corpus) has been used
Keywords :
knowledge based systems; optical character recognition; Brown Corpus; character-level confusions; text recognition; word level knowledge; Character generation; Character recognition; Computer science; Dictionaries; Humans; Knowledge engineering; Retina; Text recognition; Viterbi algorithm; Writing;
Conference_Titel :
Pattern Recognition, 1990. Proceedings., 10th International Conference on
Conference_Location :
Atlantic City, NJ
Print_ISBN :
0-8186-2062-5
DOI :
10.1109/ICPR.1990.118211