DocumentCode
3664325
Title
Detecting novel multi-variable associations in big data based on MIC
Author
Fubo Shao;Keping Li
Author_Institution
State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing, China
fYear
2015
fDate
5/1/2015 12:00:00 AM
Firstpage
39
Lastpage
42
Abstract
It is meaningful to discover valuable relationships in big data. The maximal information coefficient (MIC), a new measure of dependence of relationships, was proposed by Reshef et al. in 2011, and an approximate algorithm was designed. But the algorithm designed by Reshef et al. (2011) can only calculate the MIC of two-variable relationships. In this paper, an algorithm (BKM-MIC) is proposed. To our best knowledge, the BKM-MIC algorithm is the first algorithm calculating the MIC of multi-variable relationships. And based on the BKM-MIC algorithm, a matrix iteration algorithm with pruning (MIP) is designed. A simple example shows that MIP algorithm can not only reduce computation workload, but also can precisely identify dependent and independent multi-variable relationships.
Keywords
"Microwave integrated circuits","Algorithm design and analysis","Approximation algorithms","Big data","Clustering algorithms","Partitioning algorithms","Mutual information"
Publisher
ieee
Conference_Titel
Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
Print_ISBN
978-1-4799-7283-8
Type
conf
DOI
10.1109/ICEIEC.2015.7284482
Filename
7284482
Link To Document