Title :
How to organize data with measurement errors?
Author :
Qian, Yuhua ; Liang, Jiye
Author_Institution :
Sch. of Comput. & Inf. Technol., Shanxi Univ., Taiyuan, China
Abstract :
Clustering analysis is an outstanding contribution to data mining and knowledge discovery from large-scale data, which has become an important research direction. Many excellent researches have been developed, whereas there is a blind point which is not addressed. How to organize data with measurement errors? What is needed for this purpose is the concept of error number- an error number which is defined as a(x)±αa(x), where a(x) is the measurement value of an object x under an attribute a, and αa(x) is the value of its measurement error. In this paper, we will explore tentatively data clustering with measurement errors through employing the framework of k-means clustering algorithm, which can be regarded as an introductory work.
Keywords :
data mining; measurement errors; pattern clustering; clustering analysis; data clustering; data mining; data organization; k-means clustering algorithm; knowledge discovery; large-scale data; measurement errors; Algorithm design and analysis; Clustering algorithms; Educational institutions; Gaussian distribution; Measurement errors; Measurement uncertainty; Vectors; Clustering Analysis; Distance; Measurement Error; k-means Algorithm;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4577-0652-3
DOI :
10.1109/ICSMC.2011.6084135