Title :
A New Filter Method of Specific Sample Points Based on Partial Least-Squares Analysis
Author :
Jianxiao, Guo ; Yarong, Gao ; Jinling, Li ; Fang, Zhao
Author_Institution :
Sch. of Int. Bus., Tianjin Foreign Studies Univ., Tianjin, China
Abstract :
Specific sample point was a kind of noise that must be excluded from original data in the process of data mining and machine learning. A new filter method of specific sample points based on partial least-squares analysis was introduced in this paper. Two conceptions of true and false specific sample points were given and their relationship was elaborated in detail. The paper defined the critical value distinguishing true and false specific sample points and presented the critical formula. A novel method for identifying true and false specific sample points was described by using ellipse T2 diagram, ellipsoid T2 and scatter diagram of the principal component. Discrimination and filter method of specific sample points took great effect on eliminating samples created by random factors and purifying ultimate model.
Keywords :
data mining; learning (artificial intelligence); principal component analysis; data mining; ellipse T2 diagram; ellipsoid T2 and scatter diagram; filter method; machine learning; partial least-squares analysis; specific sample point; Conference management; Data mining; Education; Electronic mail; Engineering management; Filters; Information analysis; Information management; Information technology; Mathematical model; critical value; data mining; machine learning; partial least-squares; specific sample points;
Conference_Titel :
Future Information Technology and Management Engineering, 2009. FITME '09. Second International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-5339-9
DOI :
10.1109/FITME.2009.73