DocumentCode
2628601
Title
Segmentation and classification for mixed text/image documents using neural network
Author
Imade, Shinichi ; Tatsuta, Seiji ; Wada, Toshiaki
fYear
1993
fDate
20-22 Oct 1993
Firstpage
930
Lastpage
934
Abstract
A segmentation and classification method for separating a document image into printed character, handwritten character, photograph, and painted image regions is presented. A document image is segmented into rectangular areas. Each of which contains a cluster of image elements. A layered feed-forward neural network is then used to classify each segmented area using the histograms of gradient vector directions and luminance levels. A high classification performance was obtained, even with a small number of training samples. It is confirmed that the histograms of gradient vector directions and luminance levels are significantly effective features for the classification of the four kinds of image regions. Increasing the number of the discrimination areas improves the classification performance sufficiently even using a small number of training samples for the neural network
Keywords
Data compression; High speed optical techniques; Histograms; Image coding; Image converters; Image segmentation; Image storage; Neural networks; Optical imaging; Pixel;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location
Tsukuba Science City
Print_ISBN
0-8186-4960-7
Type
conf
DOI
10.1109/ICDAR.1993.395584
Filename
395584
Link To Document