DocumentCode :
183371
Title :
Page Segmentation for Historical Handwritten Document Images Using Color and Texture Features
Author :
Kai Chen ; Hao Wei ; Hennebert, Jean ; Ingold, Rolf ; Liwicki, Marcus
Author_Institution :
Dept. of Inf., Univ. of Fribourg, Fribourg, Switzerland
fYear :
2014
fDate :
1-4 Sept. 2014
Firstpage :
488
Lastpage :
493
Abstract :
In this paper we present a physical structure detection method for historical handwritten document images. We considered layout analysis as a pixel labeling problem. By classifying each pixel as either periphery, background, text block, or decoration, we achieve high quality segmentation without any assumption of specific topologies and shapes. Various color and texture features such as color variance, smoothness, Laplacian, Local Binary Patterns, and Gabor Dominant Orientation Histogram are used for classification. Some of these features have so far not got many attentions for document image layout analysis. By applying an Improved Fast Correlation-Based Filter feature selection algorithm, the redundant and irrelevant features are removed. Finally, the segmentation results are refined by a smoothing post-processing procedure. The proposed method is demonstrated by experiments conducted on three different historical handwritten document image datasets. Experiments show the benefit of combining various color and texture features for classification. The results also show the advantage of using a feature selection method to choose optimal feature subset. By applying the proposed method we achieve superior accuracy compared with earlier work on several datasets, e.g., We achieved 93% accuracy compared with 91% of the previous method on the Parzival dataset which contains about 100 million pixels.
Keywords :
document image processing; feature selection; handwritten character recognition; image segmentation; image texture; Gabor dominant orientation histogram; Parzival dataset; color variance; document image layout analysis; feature selection method; high quality segmentation; historical handwritten document image datasets; historical handwritten document images; improved fast correlation-based filter feature selection algorithm; local binary patterns; page segmentation; pixel labeling problem; texture features; Accuracy; Color; Histograms; Image color analysis; Image segmentation; Layout; Text analysis; feature selection; historical document; layout analysis; page segmentation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
ISSN :
2167-6445
Print_ISBN :
978-1-4799-4335-7
Type :
conf
DOI :
10.1109/ICFHR.2014.88
Filename :
6981067
Link To Document :
بازگشت