DocumentCode
659533
Title
Real-time data analysis in ClowdFlows
Author
Kranjc, Janez ; Podpecan, Vid ; Lavrac, Nada
Author_Institution
Jozef Stefan Inst., Ljubljana, Slovenia
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
15
Lastpage
22
Abstract
ClowdFlows is an open cloud based platform for composition, execution, and sharing of interactive data mining workflows. In this paper we extend the ClowdFlows platform with the ability to mine real-time data streams. This functionality was implemented by creating a specialized type of workflow component and a stream mining daemon that delegates the execution of workflows in real-time. In this way, we have transformed a batch data processing platform into a real-time stream mining platform with an intuitive user interface. The real-time analytics aspect of the platform is demonstrated in a Twitter sentiment analysis use case where the sentiment of tweets about whistleblower Edward Snowden was monitored for approximately one month.
Keywords
batch processing (computers); cloud computing; data mining; public domain software; user interfaces; ClowdFlows platform; Edward Snowden; Twitter sentiment analysis; batch data processing platform; interactive data mining workflows; intuitive user interface; open cloud based platform; real-time analytics; real-time data analysis; real-time data streams; real-time stream mining platform; stream mining daemon; whistleblower; workflow component; Data mining; Engines; Graphical user interfaces; Real-time systems; Servers; Twitter; Visualization; data mining platform; real-time data analysis; sentiment analysis; stream mining; web application; workflows;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691682
Filename
6691682
Link To Document