DocumentCode :
3511871
Title :
Handwritten text segmentation using average longest path algorithm
Author :
Salvi, Dario ; Jun Zhou ; Waggoner, Jarrell ; Song Wang
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Carolina, Columbia, SC, USA
fYear :
2013
fDate :
15-17 Jan. 2013
Firstpage :
505
Lastpage :
512
Abstract :
Offline handwritten text recognition is a very challenging problem. Aside from the large variation of different handwriting styles, neighboring characters within a word are usually connected, and we may need to segment a word into individual characters for accurate character recognition. Many existing methods achieve text segmentation by evaluating the local stroke geometry and imposing constraints on the size of each resulting character, such as the character width, height and aspect ratio. These constraints are well suited for printed texts, but may not hold for handwritten texts. Other methods apply holistic approach by using a set of lexicons to guide and correct the segmentation and recognition. This approach may fail when the lexicon domain is insufficient. In this paper, we present a new global non-holistic method for handwritten text segmentation, which does not make any limiting assumptions on the character size and the number of characters in a word. Specifically, the proposed method finds the text segmentation with the maximum average likeliness for the resulting characters. For this purpose, we use a graph model that describes the possible locations for segmenting neighboring characters, and we then develop an average longest path algorithm to identify the globally optimal segmentation. We conduct experiments on real images of handwritten texts taken from the IAM handwriting database and compare the performance of the proposed method against an existing text segmentation algorithm that uses dynamic programming.
Keywords :
computational geometry; document image processing; handwriting recognition; handwritten character recognition; image segmentation; text analysis; IAM handwriting database; aspect ratio; average longest path algorithm; character height; character recognition; character width; constraint imposition; global nonholistic method; handwriting styles; handwritten text segmentation; local stroke geometry evaluation; neighboring character segmentation; offline handwritten text recognition; Character recognition; Dynamic programming; Image edge detection; Image segmentation; Support vector machines; Text recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Computer Vision (WACV), 2013 IEEE Workshop on
Conference_Location :
Tampa, FL
ISSN :
1550-5790
Print_ISBN :
978-1-4673-5053-2
Electronic_ISBN :
1550-5790
Type :
conf
DOI :
10.1109/WACV.2013.6475061
Filename :
6475061
Link To Document :
بازگشت