• DocumentCode
    2833174
  • Title

    A Multi-Approaches-Guided Preprocess Algorithm with Application to Chance Discovery

  • Author

    Xu, Yuezhu ; Daxin Liu ; Sun, Xiaohua ; Zhang, Jin

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin
  • fYear
    2008
  • fDate
    Aug. 29 2008-Sept. 2 2008
  • Firstpage
    182
  • Lastpage
    185
  • Abstract
    Unrepresentative data samples are likely to reduce the utility of data classifiers in practical application. This study presents a multi-approaches-guided preprocess algorithm in the design of an effective chance discovery model, which bases on data crystallization, clustering and neural network techniques. We used data crystallization to discover unobservable events of the input samples with the objective of indicating unrepresentative samples, used clustering techniques to process the samples into isolated and inconsistent clusters, and neural networks to construct the chance discovery data set model. The aim of this paper is to develop a combined method for data preprocess by using different methods to preprocess different data features in order for exerting their unique characteristics. The results show its effect to industrial decision making.
  • Keywords
    data mining; neural nets; pattern classification; pattern clustering; text analysis; chance discovery data set model; data classifier; data clustering; data crystallization; industrial decision making; multiapproach-guided data preprocess algorithm; neural network; text analysis; unobservable event discovery; unrepresentative data sample; Algorithm design and analysis; Application software; Clustering algorithms; Computer science; Crystallization; Data engineering; Decision making; Information technology; Neural networks; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-0-7695-3308-7
  • Type

    conf

  • DOI
    10.1109/ICCSIT.2008.132
  • Filename
    4624857