• DocumentCode
    3704172
  • Title

    Cost Models for Distributed Pattern Mining in the Cloud

  • Author

    Sabeur Aridhi;Laurent dOrazio;Mondher Maddouri;Engelbert Mephu Nguifo

  • Author_Institution
    LIMOS, Clermont Univ., Clermont-Ferrand, France
  • Volume
    2
  • fYear
    2015
  • Firstpage
    112
  • Lastpage
    119
  • Abstract
    Recently, distributed pattern mining approaches have become very popular, especially in certain domains such as bioinformatics, chemoinformatics and social networks. In most cases, the distribution of the pattern mining process generates a loss of information in the output results. Reducing this loss may affect the performance of the distributed approach and thus, the monetary cost when using cloud environments. In this context, cost models are needed to help selecting the best parameters of the used approach in order to achieve a better performance especially in the cloud. In this paper, we address the multi-criteria optimization problem of tuning thresholds related to distributed frequent pattern mining in cloud computing environment while optimizing the global monetary cost of storing and querying data in the cloud. To achieve this goal, we design cost models for managing and mining graph data with large scale pattern mining framework over a cloud architecture. Furthermore, we define four objective functions, with respect to the needs of customers. We present an experimental validation of the proposed cost models in the case of distributed subgraph mining in the cloud.
  • Keywords
    "Data mining","Cloud computing","Data transfer","Data models","Pricing","Distributed databases","Bandwidth"
  • Publisher
    ieee
  • Conference_Titel
    Trustcom/BigDataSE/ISPA, 2015 IEEE
  • Type

    conf

  • DOI
    10.1109/Trustcom.2015.569
  • Filename
    7345482