Title :
Continuous CRF with Multi-scale Quantization Feature Functions Application to Structure Extraction in Old Newspaper
Author :
Hebert, David ; Paquet, Thierry ; Nicolas, Stephane
Author_Institution :
Lab. LITIS EA 4108, Univ. de Rouen, Rouen, France
Abstract :
We introduce quantization feature functions to represent continuous or large range discrete data into the symbolic CRF data representation. We show that doing this convertion in a simple way allows the CRF to automaticaly select discriminative features to achieve best performance. This system is evaluated on a segmentation task of degraded newspapers archives. The results obtained show the ability of the CRF model to deal with numerical features similarly as for symbolic representation thanks to the use of quantization feature functions. The segmentation task is achieved by the definition of a horizontal CRF model dedicated to pixel labelling.
Keywords :
data structures; document image processing; feature extraction; image resolution; image segmentation; random processes; conditional random fields; continuous CRF; discriminative feature selection; horizontal CRF model; multiscale quantization feature functions; numerical features; old newspaper; pixel labelling; segmentation task; structure extraction; symbolic CRF data representation; Decoding; Feature extraction; Hidden Markov models; Image segmentation; Particle separators; Quantization; Training; L-CRF; document images labelling; quantization feature functions;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.105