Title :
Handling artifacts in digitally reproduced documents
Author :
Cinque, L. ; Levialdi, S. ; Lombardi, L. ; Tanimoto, S.
Author_Institution :
Dept. of Inf. Sci., Rome Univ., Italy
Abstract :
The analysis of scanned documents is important in the construction of digital libraries and paperless offices. One significant challenge is coping with artifacts of photocopying and scanning. We present a series of simple techniques for handling these difficulties. Using 125 images of the University of Washington scanned documents database, we demonstrate the effectiveness of these methods in preparing the images for segmentation by a multiresolution algorithm
Keywords :
digital libraries; document image processing; image segmentation; artifacts; digital libraries; multiresolution algorithm; paperless offices; photocopying; scanned documents; scanned documents database; scanning; segmentation; Document handling; Image databases; Image resolution; Image segmentation; Information analysis; Information science; Printing; Remuneration; Software libraries; Text analysis;
Conference_Titel :
Computer Architectures for Machine Perception, 2000. Proceedings. Fifth IEEE International Workshop on
Conference_Location :
Padova
Print_ISBN :
0-7695-0740-9
DOI :
10.1109/CAMP.2000.875993