• DocumentCode
    2684132
  • Title

    Mining Association Rules on Qing Court Medical Records: Semantic Abstraction and Standardization

  • Author

    Cong Cao ; Weimin Wang ; Cungen Cao ; Liangjun Zang

  • Author_Institution
    Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
  • fYear
    2012
  • fDate
    27-29 Oct. 2012
  • Firstpage
    442
  • Lastpage
    447
  • Abstract
    To explore the association relations among disease, pathogenesis, physician, symptoms and drug, we adapt a variational Apriori algorithm for discovering association rules on a dataset of the Qing Court Medical Records. There are five types of semantic associations we intend to discover, including Disease-Pathogenesis-Drug set(DPaD), Disease-Symptoms-Drug set (DSyD), Disease-Drug set (DD), Disease-Physician-Drug set (DPhD) and Disease-Drug Category Set (DDC). To solve the synonymity problem and the data sparseness problem, we give a mapping strategy which maps pathogenesis to standardized forms and maps drugs to drug categories. With the mapping strategy the number of frequent drug sets rises from 287 to 1184. The experimental results indicate that our method with the mapping strategy is an effective way to acquire valuable semantic association rules.
  • Keywords
    data mining; diseases; drugs; medical information systems; DDC; DPaD; DPhD; DSyD; Qing Court Medical Records; Qing court medical records; association relations; association rules mining; data sparseness problem; disease-drug category set; disease-drug set; disease-pathogenesis-drug set; disease-physician-drug set; disease-symptoms-drug set; drug categories; mapping strategy; semantic abstraction; semantic association rules; semantic associations; standardization; standardized forms; synonymity problem; variational Apriori algorithm; Association rules; Databases; Diseases; Drugs; Semantics; Apriori; Traditional Chinese Medicine; association rule; data sparseness; frequent drugset; knowledge base; synonymity problem;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (CIT), 2012 IEEE 12th International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4673-4873-7
  • Type

    conf

  • DOI
    10.1109/CIT.2012.102
  • Filename
    6391940