DocumentCode :
3021749
Title :
Word separation of unconstrained handwritten text lines in PCR forms
Author :
Nwogu, Ifeoma ; Kim, Gyeonghwan
Author_Institution :
Dept. of CSE, New York State Univ., Buffalo, NY, USA
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
715
Abstract :
An approach for segmenting handwritten text in a pre-hospital care report (PCR) is presented. Segmentation of lines and words in a PCR is extremely challenging due to the nature of the environment in which the reports are created, giving rise to low quality, poorly written, loosely constrained data. Stroke analyses are performed and image primitives are extracted for word detection. A heuristics-based approach, involving gap spacing, height transitions, and the average stroke width of the writer is used in detecting word boundaries. Carbon copies of live PCRs are used for testing. Experiments show perfect segmentation of 69%, outperforming the more tested and proven algorithms by as much as 15%.
Keywords :
document image processing; handwritten character recognition; image segmentation; medical information systems; text analysis; PCR forms; gap spacing; handwritten text segmentation; height transition; heuristics-based approach; prehospital care report; stroke analysis; unconstrained handwritten text lines; word boundary detection; word detection; word separation; Data analysis; Data mining; Data preprocessing; Image analysis; Image segmentation; Medical services; Performance analysis; Personnel; Testing; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.255
Filename :
1575638
Link To Document :
بازگشت