• DocumentCode
    243756
  • Title

    Integration and Automation of Data Preparation and Data Mining

  • Author

    Narayanan, Shrikanth ; Jaiswal, Ayush ; Yao-Yi Chiang ; Yanhui Geng ; Knoblock, Craig A. ; Szekely, Pedro

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    1076
  • Lastpage
    1085
  • Abstract
    Data mining tasks typically require significant effort in data preparation to find, transform, integrate and prepare the data for the relevant data mining tools. In addition, the work performed in data preparation is often not recorded and is difficult to reproduce from the raw data. In this paper we present an integrated approach to data preparation and data mining that combines the two steps into a single integrated process and maintains detailed metadata about the data sources, the steps in the process, and the resulting learned classifier produced from data mining algorithms. We present results on an example scenario, which shows that our approach provides significant reduction in the time in takes to perform a data mining task.
  • Keywords
    data integration; data mining; data preparation; meta data; pattern classification; data mining algorithms; data mining automation; data mining tools; data preparation automation; data sources; detailed metadata; learned classifier; raw data; Data mining; Data models; Discrete Fourier transforms; Global Positioning System; Ontologies; Semantics; Sensors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.44
  • Filename
    7022716