Title :
Influence of a priori Knowledge on Medical Document Categorization
Author :
Itert, Lukasz ; Duch, Wlodzislaw ; Pestian, John
Author_Institution :
Div. of Rheumatology, Children´´s Hosp. Res. Found., Cincinnati, OH
fDate :
March 1 2007-April 5 2007
Abstract :
A significant part of medical data remains stored as unstructured texts. Semantic search requires introduction of markup tags. Medical concepts discovered in hospital discharge summaries are used to create several feature spaces. Experts use their background knowledge to categorize new documents, and knowing category of the document disambiguate words and acronyms. A model of document similarity to reference sources that captures some intuitions of an expert is introduced. Parameters of the model are evaluated using linear programming techniques. This approach is applied to categorization of the medical discharge summaries providing simpler and more accurate model than alternative text categorization approaches
Keywords :
document handling; linear programming; medical administrative data processing; a priori knowledge; document similarity; feature spaces; hospital discharge summaries; linear programming; markup tags; medical data; medical document categorization; semantic search; Biomedical informatics; Computational intelligence; Data mining; Fault location; Hospitals; Linear programming; Medical diagnostic imaging; Natural language processing; Pediatrics; Unified modeling language;
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
DOI :
10.1109/CIDM.2007.368868