DocumentCode
3021749
Title
Word separation of unconstrained handwritten text lines in PCR forms
Author
Nwogu, Ifeoma ; Kim, Gyeonghwan
Author_Institution
Dept. of CSE, New York State Univ., Buffalo, NY, USA
fYear
2005
fDate
29 Aug.-1 Sept. 2005
Firstpage
715
Abstract
An approach for segmenting handwritten text in a pre-hospital care report (PCR) is presented. Segmentation of lines and words in a PCR is extremely challenging due to the nature of the environment in which the reports are created, giving rise to low quality, poorly written, loosely constrained data. Stroke analyses are performed and image primitives are extracted for word detection. A heuristics-based approach, involving gap spacing, height transitions, and the average stroke width of the writer is used in detecting word boundaries. Carbon copies of live PCRs are used for testing. Experiments show perfect segmentation of 69%, outperforming the more tested and proven algorithms by as much as 15%.
Keywords
document image processing; handwritten character recognition; image segmentation; medical information systems; text analysis; PCR forms; gap spacing; handwritten text segmentation; height transition; heuristics-based approach; prehospital care report; stroke analysis; unconstrained handwritten text lines; word boundary detection; word detection; word separation; Data analysis; Data mining; Data preprocessing; Image analysis; Image segmentation; Medical services; Performance analysis; Personnel; Testing; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN
1520-5263
Print_ISBN
0-7695-2420-6
Type
conf
DOI
10.1109/ICDAR.2005.255
Filename
1575638
Link To Document