DocumentCode
700116
Title
Authorial manuscript image analysis using markovian models: the Bovary project
Author
Nicolas, Stephane ; Heutte, Laurent ; Paquet, Thierry
Author_Institution
Univ. de Rouen, St. Etienne du Rouvray, France
fYear
2008
fDate
25-29 Aug. 2008
Firstpage
1
Lastpage
5
Abstract
In this paper, we present our recent work on the Bovary project, a manuscript digitization project of the famous French writer Gustave Flaubert, which aims at providing an online access to a hyper textual edition of “Madame Bovary” draft sets. We first describe the global context of this project, its main objectives, and then focus on the document image analysis problem for which we have developed effective and powerful page layout extraction algorithms based on Markovian generative and discriminative models such as Hidden Markov Random Fields and Conditional Random Fields. We show with some experiments that these stochastic and contextual models are able to cope with local spatial variability while taking into account some prior knowledge about the global structure of the document image, thus being well suited to the segmentation of very complex document images like authorial manuscripts or historical documents.
Keywords
Markov processes; document image processing; history; image recognition; literature; text analysis; Bovary project; French writer Gustave Flaubert; Madame Bovary draft set; Markovian discriminative model; Markovian generative model; Markovian model; authorial manuscript image analysis; conditional random fields; document image analysis; hidden Markov random fields; historical documents; hyper textual edition; local spatial variability; manuscript digitization project; page layout extraction algorithms; Context modeling; Feature extraction; Handwriting recognition; Hidden Markov models; Image analysis; Labeling; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2008 16th European
Conference_Location
Lausanne
ISSN
2219-5491
Type
conf
Filename
7080648
Link To Document