Title :
A Multi-Approaches-Guided Preprocess Algorithm with Application to Chance Discovery
Author :
Xu, Yuezhu ; Daxin Liu ; Sun, Xiaohua ; Zhang, Jin
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Eng. Univ., Harbin
fDate :
Aug. 29 2008-Sept. 2 2008
Abstract :
Unrepresentative data samples are likely to reduce the utility of data classifiers in practical application. This study presents a multi-approaches-guided preprocess algorithm in the design of an effective chance discovery model, which bases on data crystallization, clustering and neural network techniques. We used data crystallization to discover unobservable events of the input samples with the objective of indicating unrepresentative samples, used clustering techniques to process the samples into isolated and inconsistent clusters, and neural networks to construct the chance discovery data set model. The aim of this paper is to develop a combined method for data preprocess by using different methods to preprocess different data features in order for exerting their unique characteristics. The results show its effect to industrial decision making.
Keywords :
data mining; neural nets; pattern classification; pattern clustering; text analysis; chance discovery data set model; data classifier; data clustering; data crystallization; industrial decision making; multiapproach-guided data preprocess algorithm; neural network; text analysis; unobservable event discovery; unrepresentative data sample; Algorithm design and analysis; Application software; Clustering algorithms; Computer science; Crystallization; Data engineering; Decision making; Information technology; Neural networks; Sun;
Conference_Titel :
Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-0-7695-3308-7
DOI :
10.1109/ICCSIT.2008.132