Title :
Recovery of distorted document images from bound volumes
Author :
Zhang, Zheng ; Tan, Chew Lim
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore
fDate :
6/23/1905 12:00:00 AM
Abstract :
Recovery, of document images scanned from thick bound volumes is necessary for the purpose of human reading and text retrieval. The main problem with scanning of bound volumes is that there always occurs perspective distortion. Stich distortion causes two sources of degradation for the scanned images - 1) shadow at the bookspine area, and 2) warping of the words in the shadow. In this paper, we have developed a restoration system to solve these two problems. First, the boundary between the shadow and the clean area is detected. Then the system applies a modified Niblack´s method to remove the shadow. The system uses a connected component analysis to help improve the noise reduction and adjust the location and orientation of the warped word in the shadow area, i.e. the words within the boundary detected earlier. The implementation results for each step are presented. Our system will be used in the text retrieval projects for National Archives of Singapore and NUS Digital Library
Keywords :
document image processing; image restoration; noise; connected component analysis; distorted document image recovery; human reading; modified Niblack method; noise reduction; restoration system; scanned image degradation; shadow; text retrieval; thick bound volumes; warping; Books; Degradation; Filtration; Glass; Humans; Image restoration; Image retrieval; Noise reduction; Pixel; Software libraries;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953826