• DocumentCode
    3664325
  • Title

    Detecting novel multi-variable associations in big data based on MIC

  • Author

    Fubo Shao;Keping Li

  • Author_Institution
    State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing, China
  • fYear
    2015
  • fDate
    5/1/2015 12:00:00 AM
  • Firstpage
    39
  • Lastpage
    42
  • Abstract
    It is meaningful to discover valuable relationships in big data. The maximal information coefficient (MIC), a new measure of dependence of relationships, was proposed by Reshef et al. in 2011, and an approximate algorithm was designed. But the algorithm designed by Reshef et al. (2011) can only calculate the MIC of two-variable relationships. In this paper, an algorithm (BKM-MIC) is proposed. To our best knowledge, the BKM-MIC algorithm is the first algorithm calculating the MIC of multi-variable relationships. And based on the BKM-MIC algorithm, a matrix iteration algorithm with pruning (MIP) is designed. A simple example shows that MIP algorithm can not only reduce computation workload, but also can precisely identify dependent and independent multi-variable relationships.
  • Keywords
    "Microwave integrated circuits","Algorithm design and analysis","Approximation algorithms","Big data","Clustering algorithms","Partitioning algorithms","Mutual information"
  • Publisher
    ieee
  • Conference_Titel
    Electronics Information and Emergency Communication (ICEIEC), 2015 5th International Conference on
  • Print_ISBN
    978-1-4799-7283-8
  • Type

    conf

  • DOI
    10.1109/ICEIEC.2015.7284482
  • Filename
    7284482