DocumentCode :
3707083
Title :
On Streaming Consistency of Big Data Stream Processing in Heterogenous Clutsers
Author :
Fatos Xhafa;Victor Naranjo;Leonard Barolli;Makoto Takizawa
Author_Institution :
Univ. Politec. de Catalunya, Barcelona, Spain
fYear :
2015
Firstpage :
476
Lastpage :
482
Abstract :
There is currently a growing interest on Big Data Stream processing. With the increasing capabilities of Internet-based computing systems to generate, store and process Big Data Streams, various applications are benefiting from the information extracted in real time from data streams. Such families of applications comprise IoT-based monitoring systems, decision making systems, recommendation systems, security, etc. In all of them, the common ingredient is that of using real time data analytics for the purpose at hand. The foremost challenge of processing Big Data Streams is that of throughput performance, i.e., the computing system should be able to compute at high throughput to accommodate the high data stream rate generation in input. Additionally to this challenge, the processing should be such that the consistency of streaming is achieved. The consistency of streaming requires that the order of events in the original stream (in input) is preserved also in the output. In this paper we show that using a heterogeneous cluster for Big Data Stream processing could indeed incur into streaming inconsistency and one would indeed have to carefully tune the system so that streaming consistency is achieved. We exemplify the approach using the Yahoo!S4 for processing the Big Data Stream from FlightRadar24 global flight monitoring system.
Keywords :
"Big data","Data mining","Real-time systems","Monitoring","Airports","Aircraft","Data models"
Publisher :
ieee
Conference_Titel :
Network-Based Information Systems (NBiS), 2015 18th International Conference on
Type :
conf
DOI :
10.1109/NBiS.2015.122
Filename :
7350662
Link To Document :
بازگشت