• DocumentCode
    3063089
  • Title

    Scalable and Low-Latency Data Processing with Stream MapReduce

  • Author

    Brito, Andrey ; Martin, André ; Knauth, Thomas ; Creutz, Stephan ; Becker, Diogo ; Weigert, Stefan ; Fetzer, Christof

  • Author_Institution
    Univ. Fed. de Campina Grande, Campina Grande, Brazil
  • fYear
    2011
  • fDate
    Nov. 29 2011-Dec. 1 2011
  • Firstpage
    48
  • Lastpage
    58
  • Abstract
    We present StreamMapReduce, a data processing approach that combines ideas from the popular MapReduce paradigm and recent developments in Event Stream Processing. We adopted the simple and scalable programming model of MapReduce and added continuous, low-latency data processing capabilities previously found only in Event Stream Processing systems. This combination leads to a system that is efficient and scalable, but at the same time, simple from the user´s point of view. For latency-critical applications, our system allows a hundred-fold improvement in response time. Notwithstanding, when throughput is considered, our system offers a ten-fold per node throughput increase in comparison to Hadoop. As a result, we show that our approach addresses classes of applications that are not supported by any other existing system and that the MapReduce paradigm is indeed suitable for scalable processing of real-time data streams.
  • Keywords
    data handling; distributed processing; StreamMapReduce paradigm; event stream processing; latency-critical application; real-time data stream; scalable low latency data processing; scalable programming model; Databases; Fault tolerance; Fault tolerant systems; Instruction sets; Programming; Real time systems; Scalability; Complex Event Processing; Distributed Computing; Event Stream Processing; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4673-0090-2
  • Type

    conf

  • DOI
    10.1109/CloudCom.2011.17
  • Filename
    6133126