• DocumentCode
    3129728
  • Title

    Identifying HotSpots in Lung Cancer Data Using Association Rule Mining

  • Author

    Agrawal, Ankit ; Choudhary, Alok

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
  • fYear
    2011
  • fDate
    11-11 Dec. 2011
  • Firstpage
    995
  • Lastpage
    1002
  • Abstract
    We analyze the lung cancer data available from the SEER program with the aim of identifying hotspots using association rule mining techniques. A subset of 13 patient attributes from the SEER data were recently linked with the survival outcome using prediction models, which is used in this study for segmentation. The goal here is to identify characteristics of patient segments where average survival is significantly higher/lower than average survival across the entire dataset. Automated association rule mining techniques resulted in hundreds of rules, from which many redundant rules were manually removed based on domain knowledge. The resulting rules conform with existing biomedical knowledge and provide interesting insights into lung cancer survival.
  • Keywords
    cancer; data mining; lung; medical information systems; HotSpots identification; SEER program; association rule mining; automated association rule mining techniques; biomedical knowledge; lung cancer data; patient segments; Association rules; Cancer; Lungs; Lymph nodes; Surgery; Tumors; Association rule mining; hotspots; lung cancer;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4673-0005-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2011.93
  • Filename
    6137489