DocumentCode :
688294
Title :
Measuring and Discovering Correlations in Large Data Sets
Author :
Lijue Liu ; Ming Li ; Sha Wen
Author_Institution :
Coll. of Inf. Sci. & Eng., Central South Univ., Changsha, China
fYear :
2013
fDate :
13-15 Nov. 2013
Firstpage :
1302
Lastpage :
1307
Abstract :
In this paper, a class of statistics named ART (the alternant recursive topology statistics) is proposed to measure the properties of correlation between two variables. A wide range of bi-variable correlations both linear and nonlinear can be evaluated by ART efficiently and equitably even if nothing is known about the specific types of those relationships. ART compensates the disadvantages of Reshef\´s model in which no polynomial time precise algorithm exists and the "local random" phenomenon can not be identified. As a class of nonparametric exploration statistics, ART is applied for analyzing a dataset of 10 American classical indexes, as a result, lots of bi-variable correlations are discovered.
Keywords :
data analysis; statistics; ART; American classical indexes; alternant recursive topology statistics; bi-variable correlations; correlation properties; dataset analysis; large data set correlation discovery; linear correlations; nonlinear correlations; nonparametric exploration statistics; Correlation; Histograms; Indexes; Microwave integrated circuits; Polynomials; Subspace constraints; Topology; ART Statistics; Association Mining; Correlation Mining; Non-Linear Correlation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
Conference_Location :
Zhangjiajie
Type :
conf
DOI :
10.1109/HPCC.and.EUC.2013.185
Filename :
6832067
Link To Document :
بازگشت