DocumentCode :
659533
Title :
Real-time data analysis in ClowdFlows
Author :
Kranjc, Janez ; Podpecan, Vid ; Lavrac, Nada
Author_Institution :
Jozef Stefan Inst., Ljubljana, Slovenia
fYear :
2013
fDate :
6-9 Oct. 2013
Firstpage :
15
Lastpage :
22
Abstract :
ClowdFlows is an open cloud based platform for composition, execution, and sharing of interactive data mining workflows. In this paper we extend the ClowdFlows platform with the ability to mine real-time data streams. This functionality was implemented by creating a specialized type of workflow component and a stream mining daemon that delegates the execution of workflows in real-time. In this way, we have transformed a batch data processing platform into a real-time stream mining platform with an intuitive user interface. The real-time analytics aspect of the platform is demonstrated in a Twitter sentiment analysis use case where the sentiment of tweets about whistleblower Edward Snowden was monitored for approximately one month.
Keywords :
batch processing (computers); cloud computing; data mining; public domain software; user interfaces; ClowdFlows platform; Edward Snowden; Twitter sentiment analysis; batch data processing platform; interactive data mining workflows; intuitive user interface; open cloud based platform; real-time analytics; real-time data analysis; real-time data streams; real-time stream mining platform; stream mining daemon; whistleblower; workflow component; Data mining; Engines; Graphical user interfaces; Real-time systems; Servers; Twitter; Visualization; data mining platform; real-time data analysis; sentiment analysis; stream mining; web application; workflows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data, 2013 IEEE International Conference on
Conference_Location :
Silicon Valley, CA
Type :
conf
DOI :
10.1109/BigData.2013.6691682
Filename :
6691682
Link To Document :
بازگشت