DocumentCode
771013
Title
Document binarisation using Kohonen SOM
Author
Badekas, E. ; Papamarkos, N.
Author_Institution
Dept. of Electr. & Comput. Eng., Democritus Univ. of Thrace, Xanthi
Volume
1
Issue
1
fYear
2007
fDate
3/1/2007 12:00:00 AM
Firstpage
67
Lastpage
84
Abstract
An integrated system for the binarisation of normal and degraded printed documents for the purpose of visualisation and recognition of text characters is proposed. In degraded documents, where considerable background noise or variation in contrast and illumination exists, there are many pixels that cannot be easily classified as foreground or background pixels. For this reason, it is necessary to perform document binarisation by combining and taking into account the results of a set of binarisation techniques, especially for document pixels that have high vagueness. The proposed binarisation technique takes advantages of the benefits of a set of selected binarisation algorithms by combining their results using a Kohonen self-organising map neural network. In order to improve further the binarisation results, significant improvements are proposed for two of the most powerful document binarisation techniques used, that is for the adaptive logical level technique and for the improvement of integrated function algorithm. The proposed binarisation technique is extensively tested with a variety of degraded documents. Several experimental and comparative results, demonstrating the performance of the proposed technique, are presented
Keywords
character recognition; document image processing; image denoising; self-organising feature maps; Kohonen SOM; Kohonen self-organising map neural network; adaptive logical level technique; background noise; binarisation techniques; degraded printed documents; document binarisation; document pixels; integrated function algorithm; text character recognition; text character visualisation;
fLanguage
English
Journal_Title
Image Processing, IET
Publisher
iet
ISSN
1751-9659
Type
jour
DOI
10.1049/iet-ipr:2005031
Filename
4149698
Link To Document