Title :
Word separation of unconstrained handwritten text lines in PCR forms
Author :
Nwogu, Ifeoma ; Kim, Gyeonghwan
Author_Institution :
Dept. of CSE, New York State Univ., Buffalo, NY, USA
fDate :
29 Aug.-1 Sept. 2005
Abstract :
An approach for segmenting handwritten text in a pre-hospital care report (PCR) is presented. Segmentation of lines and words in a PCR is extremely challenging due to the nature of the environment in which the reports are created, giving rise to low quality, poorly written, loosely constrained data. Stroke analyses are performed and image primitives are extracted for word detection. A heuristics-based approach, involving gap spacing, height transitions, and the average stroke width of the writer is used in detecting word boundaries. Carbon copies of live PCRs are used for testing. Experiments show perfect segmentation of 69%, outperforming the more tested and proven algorithms by as much as 15%.
Keywords :
document image processing; handwritten character recognition; image segmentation; medical information systems; text analysis; PCR forms; gap spacing; handwritten text segmentation; height transition; heuristics-based approach; prehospital care report; stroke analysis; unconstrained handwritten text lines; word boundary detection; word detection; word separation; Data analysis; Data mining; Data preprocessing; Image analysis; Image segmentation; Medical services; Performance analysis; Personnel; Testing; Writing;
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
Print_ISBN :
0-7695-2420-6
DOI :
10.1109/ICDAR.2005.255