DocumentCode :
2144699
Title :
Continuous CRF with Multi-scale Quantization Feature Functions Application to Structure Extraction in Old Newspaper
Author :
Hebert, David ; Paquet, Thierry ; Nicolas, Stephane
Author_Institution :
Lab. LITIS EA 4108, Univ. de Rouen, Rouen, France
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
493
Lastpage :
497
Abstract :
We introduce quantization feature functions to represent continuous or large range discrete data into the symbolic CRF data representation. We show that doing this convertion in a simple way allows the CRF to automaticaly select discriminative features to achieve best performance. This system is evaluated on a segmentation task of degraded newspapers archives. The results obtained show the ability of the CRF model to deal with numerical features similarly as for symbolic representation thanks to the use of quantization feature functions. The segmentation task is achieved by the definition of a horizontal CRF model dedicated to pixel labelling.
Keywords :
data structures; document image processing; feature extraction; image resolution; image segmentation; random processes; conditional random fields; continuous CRF; discriminative feature selection; horizontal CRF model; multiscale quantization feature functions; numerical features; old newspaper; pixel labelling; segmentation task; structure extraction; symbolic CRF data representation; Decoding; Feature extraction; Hidden Markov models; Image segmentation; Particle separators; Quantization; Training; L-CRF; document images labelling; quantization feature functions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.105
Filename :
6065360
Link To Document :
بازگشت