DocumentCode :
2458867
Title :
Project Daytona: Data Analytics as a Cloud Service
Author :
Barga, Roger S. ; Ekanayake, Jaliya ; Lu, Wei
Author_Institution :
eXtreme Comput. Group, Microsoft Res. Microsoft Corp., Redmond, WA, USA
fYear :
2012
fDate :
1-5 April 2012
Firstpage :
1317
Lastpage :
1320
Abstract :
Spreadsheets are established data collection and analysis tools in business, technical computing and academic research. Excel, for example, offers an attractive user interface, provides an easy to use data entry model, and offers substantial interactivity for what-if analysis. However, spreadsheets and other common client applications do not offer scalable computation for large scale data analytics and exploration. Increasingly researchers in domains ranging from the social sciences to environmental sciences are faced with a deluge of data, often sitting in spreadsheets such as Excel or other client applications, and they lack a convenient way to explore the data, to find related data sets, or to invoke scalable analytical models over the data. To address these limitations, we have developed a cloud data analytics service based on Daytona, which is an iterative MapReduce runtime optimized for data analytics. In our model, Excel and other existing client applications provide the data entry and user interaction surfaces, Daytona provides a scalable runtime on the cloud for data analytics, and our service seamlessly bridges the gap between the client and cloud. Any analyst can use our data analytics service to discover and import data from the cloud, invoke cloud scale data analytics algorithms to extract information from large datasets, invoke data visualization, and then store the data back to the cloud all through a spreadsheet or other client application they are already familiar with.
Keywords :
business data processing; cloud computing; data analysis; data visualisation; environmental factors; social sciences; spreadsheet programs; user interfaces; Excel; Project Daytona; academic research; business; cloud service; data analytics; data collection; data visualization; environmental sciences; information extraction; iterative MapReduce; social sciences; spreadsheets; technical computing; user interface; Algorithm design and analysis; Analytical models; Cloud computing; Computational modeling; Data models; Distributed databases; Runtime;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
Conference_Location :
Washington, DC
ISSN :
1063-6382
Print_ISBN :
978-1-4673-0042-1
Type :
conf
DOI :
10.1109/ICDE.2012.136
Filename :
6228197
Link To Document :
بازگشت