• DocumentCode
    3065074
  • Title

    A Framework for Cloud-Based Large-Scale Data Analytics and Visualization: Case Study on Multiscale Climate Data

  • Author

    Lu, Sifei ; Li, Reuben Mingguang ; Tjhi, William Chandra ; Lee, Kee Khoon ; Wang, Long ; Li, Xiarong ; Ma, Di

  • Author_Institution
    Inst. of High Performance Comput., Agency for Sci., Technol. & Res., Singapore, Singapore
  • fYear
    2011
  • fDate
    Nov. 29 2011-Dec. 1 2011
  • Firstpage
    618
  • Lastpage
    622
  • Abstract
    In this paper, we present a cloud framework to provide cloud clustering, workflow scheduling and management, fault tolerance and distributed data storage, data analytics and visualisation services. Using a practical case study, we show that in the process of analyzing multiscale climate data, typical problems plaguing data analysts are faced. These include large datasets and limited computational resources, data complexity and limited knowledge, and varying data structures/formats and the need to integrate different tools. The implementation of our framework to climate studies was a success. This can be seen in its ability to perform spatio-temporal data analysis and visualization of a large multi-dimensional climate dataset with reduced processing time. The framework demonstrates great flexibility and simplicity for end users intending to perform data analysis by aiding the integration of data and tools and enabling interactive visualization on-the-fly. This is coupled with effective utilization of computational resources and data storage systems.
  • Keywords
    cloud computing; computational complexity; data analysis; data integration; data visualisation; fault tolerance; geographic information systems; spatiotemporal phenomena; cloud clustering; cloud-based large-scale data analytics; computational resource; data complexity; data integration; distributed data storage; fault tolerance; multiscale climate dataset visualization; spatio-temporal data analysis; workflow management; workflow scheduling; Cloud computing; Data mining; Data visualization; Engines; Meteorology; Processor scheduling; Trajectory; climate data analysis; cloud computing; multiscale data; spatio-temporal visualization; workflow scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing Technology and Science (CloudCom), 2011 IEEE Third International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4673-0090-2
  • Type

    conf

  • DOI
    10.1109/CloudCom.2011.95
  • Filename
    6133204