• DocumentCode
    3074579
  • Title

    Techniques for Missing Value Recovering in Imbalanced Databases: Application in a Marketing Database with Massive Missing Data

  • Author

    Zarate, Luis E. ; Nogueira, Bruno M. ; Santos, Tadeu R A ; Song, Mark A J

  • Author_Institution
    Pontifical Catholic Univ. of Minas Gerais, Belo Horizonte
  • Volume
    3
  • fYear
    2006
  • fDate
    8-11 Oct. 2006
  • Firstpage
    2658
  • Lastpage
    2664
  • Abstract
    Missing data in databases are considered to be one of the biggest problems faced on data mining application. This problem can be aggravated when there is massive missing data in the presence of imbalanced databases. Several techniques as imputation, classifiers and approximation of patterns have been proposed and compared, but these comparisons do not consider adverse conditions found in real databases. In this work, it is presented a comparison of techniques used to classify records from a real imbalanced database with massive missing data, where the main objective is the database pre-processing to recover and select records completely filled for a further application of the techniques. It was compared algorithms such as clustering, decision tree, artificial neural networks and Bayesian classifier. Through the results, it can be verified that the problem characterization and database understanding are essential steps for a correct techniques comparison in a real problem. It was observed that artificial neural networks are an interesting alternative for this kind of problem since it is capable to obtain satisfactory results even when dealing with real-world problems.
  • Keywords
    data mining; database management systems; marketing data processing; neural nets; artificial neural network; data mining; imbalanced database; marketing database; missing value recovery; Artificial neural networks; Bayesian methods; Classification tree analysis; Clustering algorithms; Communication industry; Cybernetics; Data mining; Databases; Decision trees; Finance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    1-4244-0099-6
  • Electronic_ISBN
    1-4244-0100-3
  • Type

    conf

  • DOI
    10.1109/ICSMC.2006.385265
  • Filename
    4274271