• DocumentCode
    168620
  • Title

    Evaluating Streaming Strategies for Event Processing Across Infrastructure Clouds

  • Author

    Tudoran, Radu ; Keahey, Kate ; Riteau, Pierre ; Panitkin, Sergey ; Antoniu, Gabriel

  • Author_Institution
    IRISA/ENS Cachan Rennes, Cachan, France
  • fYear
    2014
  • fDate
    26-29 May 2014
  • Firstpage
    151
  • Lastpage
    159
  • Abstract
    Infrastructure clouds revolutionized the way in which we approach resource procurement by providing an easy way to lease compute and storage resources on short notice, for a short amount of time, and on a pay-as-you-go basis. This new opportunity, however, introduces new performance trade-offs. Making the right choices in leveraging different types of storage available in the cloud is particularly important for applications that depend on managing large amounts of data within and across clouds. An increasing number of such applications conform to a pattern in which data processing relies on streaming the data to a compute platform where a set of similar operations is repeatedly applied to independent chunks of data. This pattern is evident in virtual observatories such as the Ocean Observatory Initiative, in cases when new data is evaluated against existing features in geospatial computations or when experimental data is processed as a series of time events. In this paper, we propose two strategies for efficiently implementing such streaming in the cloud and evaluate them in the context of an ATLAS application processing experimental data. Our results show that choosing the right cloud configuration can improve overall application performance by as much as three times.
  • Keywords
    cloud computing; resource allocation; storage management; ATLAS application processing; Ocean Observatory Initiative; application performance; cloud configuration; data processing; event processing; geospatial computations; infrastructure clouds; resource procurement; storage resources; streaming strategies; virtual observatories; Bandwidth; Cloud computing; Data transfer; Measurement; Observatories; Oceans; Throughput; event processing; geographically distributed computing; infrastructure clouds; multiple data centers; stream processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/CCGrid.2014.89
  • Filename
    6846450