Title :
Mining Association Rules on Qing Court Medical Records: Semantic Abstraction and Standardization
Author :
Cong Cao ; Weimin Wang ; Cungen Cao ; Liangjun Zang
Author_Institution :
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
Abstract :
To explore the association relations among disease, pathogenesis, physician, symptoms and drug, we adapt a variational Apriori algorithm for discovering association rules on a dataset of the Qing Court Medical Records. There are five types of semantic associations we intend to discover, including Disease-Pathogenesis-Drug set(DPaD), Disease-Symptoms-Drug set (DSyD), Disease-Drug set (DD), Disease-Physician-Drug set (DPhD) and Disease-Drug Category Set (DDC). To solve the synonymity problem and the data sparseness problem, we give a mapping strategy which maps pathogenesis to standardized forms and maps drugs to drug categories. With the mapping strategy the number of frequent drug sets rises from 287 to 1184. The experimental results indicate that our method with the mapping strategy is an effective way to acquire valuable semantic association rules.
Keywords :
data mining; diseases; drugs; medical information systems; DDC; DPaD; DPhD; DSyD; Qing Court Medical Records; Qing court medical records; association relations; association rules mining; data sparseness problem; disease-drug category set; disease-drug set; disease-pathogenesis-drug set; disease-physician-drug set; disease-symptoms-drug set; drug categories; mapping strategy; semantic abstraction; semantic association rules; semantic associations; standardization; standardized forms; synonymity problem; variational Apriori algorithm; Association rules; Databases; Diseases; Drugs; Semantics; Apriori; Traditional Chinese Medicine; association rule; data sparseness; frequent drugset; knowledge base; synonymity problem;
Conference_Titel :
Computer and Information Technology (CIT), 2012 IEEE 12th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4673-4873-7
DOI :
10.1109/CIT.2012.102