مرکز منطقه ای اطلاع رساني علوم و فناوري - Recognition of poor quality words without segmentation

DocumentCode :

305629

Title :

Recognition of poor quality words without segmentation

Author :

Raza, G. ; Sherkat, N. ; Whitrow, R.J.

Author_Institution :

Dept. of Comput., Nottingham Trent Univ., UK

Volume :

fYear :

1996

fDate :

14-17 Oct 1996

Firstpage :

Abstract :

A method for recognition of poor quality documents is presented. The method is based on extraction of independent and robust features of `objects´ within a word without segmenting touching objects. A number of alternatives for each object are found. A lexical lookup method is used to verify the alternatives. The method has been applied to seven different poor quality facsimile messages in order to observe its effectiveness. The facsimile messages were first processed using a commercial OCR software and only the unrecognized words were selected and processed using the developed method. Different improvement rates were observed for the facsimile messages. The improvement ranged from 21.57% to 100%. The results obtained from these facsimile messages suggest that the developed method is capable of recognizing poor quality documents whilst avoiding segmentation of touching characters

Keywords :

document image processing; feature extraction; optical character recognition; commercial OCR software; facsimile messages; improvement rates; lexical lookup method; poor quality documents; poor quality words; touching objects; unrecognized words; Character recognition; Degradation; Educational institutions; Facsimile; Feature extraction; Image segmentation; Optical character recognition software; Robustness; Telephony; Text recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems, Man, and Cybernetics, 1996., IEEE International Conference on

Conference_Location :

Beijing

ISSN :

1062-922X

Print_ISBN :

0-7803-3280-6

Type :

conf

DOI :

10.1109/ICSMC.1996.569741

Filename :

569741

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=305629