• DocumentCode
    659533
  • Title

    Real-time data analysis in ClowdFlows

  • Author

    Kranjc, Janez ; Podpecan, Vid ; Lavrac, Nada

  • Author_Institution
    Jozef Stefan Inst., Ljubljana, Slovenia
  • fYear
    2013
  • fDate
    6-9 Oct. 2013
  • Firstpage
    15
  • Lastpage
    22
  • Abstract
    ClowdFlows is an open cloud based platform for composition, execution, and sharing of interactive data mining workflows. In this paper we extend the ClowdFlows platform with the ability to mine real-time data streams. This functionality was implemented by creating a specialized type of workflow component and a stream mining daemon that delegates the execution of workflows in real-time. In this way, we have transformed a batch data processing platform into a real-time stream mining platform with an intuitive user interface. The real-time analytics aspect of the platform is demonstrated in a Twitter sentiment analysis use case where the sentiment of tweets about whistleblower Edward Snowden was monitored for approximately one month.
  • Keywords
    batch processing (computers); cloud computing; data mining; public domain software; user interfaces; ClowdFlows platform; Edward Snowden; Twitter sentiment analysis; batch data processing platform; interactive data mining workflows; intuitive user interface; open cloud based platform; real-time analytics; real-time data analysis; real-time data streams; real-time stream mining platform; stream mining daemon; whistleblower; workflow component; Data mining; Engines; Graphical user interfaces; Real-time systems; Servers; Twitter; Visualization; data mining platform; real-time data analysis; sentiment analysis; stream mining; web application; workflows;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data, 2013 IEEE International Conference on
  • Conference_Location
    Silicon Valley, CA
  • Type

    conf

  • DOI
    10.1109/BigData.2013.6691682
  • Filename
    6691682