• DocumentCode
    2131371
  • Title

    A novel rule infusion technique for generating simulated binary data to validate data mining methods

  • Author

    Siadat, Mohammad-Reza ; Craig, Douglas ; Hickman, Gregory F. ; Ogunyemi, Theophilus ; Diokno, Ananias

  • Author_Institution
    Dept. of Comp Sci., Oakland Univ., Rochester, MI, USA
  • fYear
    2012
  • fDate
    16-18 Oct. 2012
  • Firstpage
    945
  • Lastpage
    948
  • Abstract
    Mathematical analysis of existing data mining methods is not straightforward and in many cases it is not possible. Therefore, simulated data plays a central role in validation of data mining results in a given situation, i.e., noise, missing value and multicollinearity levels. This paper proposes a longitudinal binary data simulation focusing on presentation of the major challenge of infusing user-defined rules. Results of applying Apriori, PRAT, Prism, and JRip rule extraction methods on these simulated data in several missing value levels are presented in this paper. This simulation proved to be essential in verifying data mining results that we have generated on Medical Epidemiological and Social Aspects of Aging (MESA) data set.
  • Keywords
    data mining; mathematical analysis; Apriori; JRip rule extraction methods; MESA data set; Medical Epidemiological and Social Aspects of Aging; PRAT; Prism; data mining methods; infusing user-defined rules; mathematical analysis; missing value levels; multicollinearity levels; rule infusion technique; simulated binary data generation; data mining; rule extraction; rule infusion; simulation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Biomedical Engineering and Informatics (BMEI), 2012 5th International Conference on
  • Conference_Location
    Chongqing
  • Print_ISBN
    978-1-4673-1183-0
  • Type

    conf

  • DOI
    10.1109/BMEI.2012.6512924
  • Filename
    6512924