DocumentCode
9671
Title
Word Segmentation Method for Handwritten Documents based on Structured Learning
Author
Jewoong Ryu ; Hyung Il Koo ; Nam Ik Cho
Author_Institution
Dept. of Electr. & Comput. Eng., Seoul Nat. Univ., Seoul, South Korea
Volume
22
Issue
8
fYear
2015
fDate
Aug. 2015
Firstpage
1161
Lastpage
1165
Abstract
Segmentation of handwritten document images into text-lines and words is an essential task for optical character recognition. However, since the features of handwritten document are irregular and diverse depending on the person, it is considered a challenging problem. In order to address the problem, we formulate the word segmentation problem as a binary quadratic assignment problem that considers pairwise correlations between the gaps as well as the likelihoods of individual gaps. Even though many parameters are involved in our formulation, we estimate all parameters based on the Structured SVM framework so that the proposed method works well regardless of writing styles and written languages without user-defined parameters. Experimental results on ICDAR 2009/2013 handwriting segmentation databases show that proposed method achieves the state-of-the-art performance on Latin-based and Indian languages.
Keywords
handwritten character recognition; image segmentation; natural language processing; optical character recognition; support vector machines; text analysis; ICDAR 2009/2013 handwriting segmentation databases; Indian languages; Latin-based languages; binary quadratic assignment problem; handwritten document features; handwritten document image segmentation; optical character recognition; pairwise correlations; parameter estimation; structured SVM framework; structured learning; text-lines; user-defined parameters; word segmentation method; word segmentation problem; writing styles; written languages; Correlation; Cost function; Databases; Image segmentation; Signal processing algorithms; Writing; Handwritten documents; structured SVM; word segmentation;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2015.2389852
Filename
7004865
Link To Document