• DocumentCode
    2456193
  • Title

    Boolean Factor Analysis for Data Preprocessing in Machine Learning

  • Author

    Outrata, Jan

  • Author_Institution
    Dept. of Comput. Sci., Palacky Univ., Olomouc, Czech Republic
  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    899
  • Lastpage
    902
  • Abstract
    We present two input data preprocessing methods for machine learning (ML). The first one consists in extending the set of attributes describing objects in input data table by new attributes and the second one consists in replacing the attributes by new attributes. The methods utilize formal concept analysis (FCA) and boolean factor analysis, recently described by FCA, in that the new attributes are defined by so-called factor concepts computed from input data table. The methods are demonstrated on decision tree induction. The experimental evaluation and comparison of performance of decision trees induced from original and preprocessed input data is performed with standard decision tree induction algorithms ID3 and C4.5 on several benchmark datasets.
  • Keywords
    Boolean functions; data handling; decision trees; formal concept analysis; learning (artificial intelligence); Boolean factor analysis; decision tree induction algorithms C4.5; decision tree induction algorithms ID3; factor concepts; formal concept analysis; input data preprocessing methods; input data table; machine learning; Bismuth; Data mining; Data preprocessing; Decision trees; Learning systems; Machine learning; Matrix decomposition; data preprocessing; decision trees; formal concept; machine learning; matrix decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.141
  • Filename
    5708964