DocumentCode
3141373
Title
Automating data entry for an online biomedical database: a document image analysis application
Author
Thoma, George R.
Author_Institution
U.S. Nat. Libr. of Med., Bethesda, MD, USA
fYear
1999
fDate
20-22 Sep 1999
Firstpage
370
Lastpage
373
Abstract
Creating online bibliographic databases from paper-based journal articles continues to be heavily manual. An R&D center at the United States National Library of Medicine (NLM), is developing systems for automating the extraction of information from biomedical journals to create bibliographic records in MEDLINE(R), NLM´s premier online database used worldwide. The first phase of this project has resulted in a system that involves scanning and converting (by OCR) the abstracts that appear in journal articles, while keyboarding the remaining fields. A second generation system is being designed to scan/OCR other fields such as author names, institutional affiliations, page numbers, article titles and others. This system will employ scanning and OCR as well as modules that automatically zone the scanned pages, identify the zones as particular fields, and reformat the field syntax to adhere to conventional practice in MEDLINE
Keywords
bibliographic systems; document image processing; information services; medical information systems; optical character recognition; MEDLINE; OCR; abstract conversion; abstract scanning; article titles; author names; automated data entry; automated information extraction; automatic scanned page zoning; biomedical journals; document image analysis; field syntax reformatting; institutional affiliations; online bibliographic databases; online biomedical database; page numbers; paper-based journal articles; zone identification; Automation; Biomedical imaging; File servers; Image analysis; Image databases; Libraries; Network servers; Optical character recognition software; Text analysis; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location
Bangalore
Print_ISBN
0-7695-0318-7
Type
conf
DOI
10.1109/ICDAR.1999.791801
Filename
791801
Link To Document