DocumentCode :
3023534
Title :
A lexicon reduction strategy in the context of handwritten medical forms
Author :
Milewski, Robert ; Setlur, Srirangaraj ; Govindaraju, Venu
Author_Institution :
Center of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
1146
Abstract :
Traditional handwriting recognition algorithms rely heavily on small lexicons and clean word images. Unfortunately, emergency medical documents do not satisfy either of these conditions. This is a significant road-block that is hampering efforts to rapidly convert valuable offline healthcare handwriting data into digital content that can be efficiently mined for information. This paper describes a strategy whereby given an image representing a noisy handwritten word from a medical document, and a large lexicon consisting of English, medical and pharmacological words, symbols, abbreviations and acronyms, significantly reduces the size of the lexicon while keeping the unknown desired entry within the lexicon. The approach combines geometric interpretations of the word image along with contextual inference of concepts to reduce lexicons for word recognition. The data extracted can then be efficiently and securely disseminated for epidemiological and outbreak detection/analysis. Experimental results on NY State PCR forms are reported.
Keywords :
handwriting recognition; health care; medical administrative data processing; medical computing; medical image processing; English lexicon; contextual inference; data extraction; digital content; emergency medical documents; epidemiological detection; geometric interpretations; handwriting recognition; handwritten medical forms; lexicon reduction; medical words; noisy handwritten word; offline healthcare handwriting data; outbreak analysis; outbreak detection; pharmacological abbreviations; pharmacological acronyms; pharmacological symbols; pharmacological words; word recognition; Algorithm design and analysis; Artificial neural networks; Biomedical imaging; Data mining; Databases; Handwriting recognition; Image recognition; Medical services; Text analysis; Venus;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.20
Filename :
1575723
Link To Document :
بازگشت