• DocumentCode
    633109
  • Title

    Interpretable models from distributed data via merging of decision trees

  • Author

    Andrzejak, Artur ; Langner, Felix ; Zabala, Silvestre

  • Author_Institution
    PVS, Heidelberg Univ., Heidelberg, Germany
  • fYear
    2013
  • fDate
    16-19 April 2013
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    Learning from distributed data becomes increasingly important. Factors contributing to this trend include emergence of data sets exceeding RAM sizes and inherently distributed scenarios such as mobile environments. Also in these cases interpretable models are favored: they facilitate identifying artifacts and understanding the impact of individual variables. Given the distributed environment, even if the individual learner on each site is interpretable, the overall model usually is not (as e.g. in case of voting schemes). To overcome this problem we propose an approach for efficient merging of decision trees (each learned independently) into a single decision tree. The method complements the existing distributed decision trees algorithms by providing interpretable intermediate models and tolerating constraints on bandwidth and RAM size. The latter properties are achieved by trading RAM and communication constraints for accuracy. Our method and the mentioned trade-offs are validated in experiments on real-world data sets.
  • Keywords
    data analysis; decision trees; distributed databases; learning (artificial intelligence); RAM sizes; communication constraints; data sets; distributed data; distributed decision trees algorithms; interpretable intermediate models; learning; mobile environments; voting schemes; Accuracy; Decision trees; Distributed databases; Handheld computers; Merging; Program processors; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining (CIDM), 2013 IEEE Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/CIDM.2013.6597210
  • Filename
    6597210