DocumentCode
659407
Title
Storing and manipulating environmental big data with JASMIN
Author
Lawrence, B.N. ; Bennett, V.L. ; Churchill, J. ; Juckes, M. ; Kershaw, P. ; Pascoe, S. ; Pepler, S. ; Pritchard, M. ; Stephens, Adrian
Author_Institution
Dept. of Meteorol., Univ. of Reading, Reading, UK
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
68
Lastpage
75
Abstract
JASMIN is a super-data-cluster designed to provide a high-performance high-volume data analysis environment for the UK environmental science community. Thus far JASMIN has been used primarily by the atmospheric science and earth observation communities, both to support their direct scientific workflow, and the curation of data products in the STFC Centre for Environmental Data Archival (CEDA). Initial JASMIN configuration and first experiences are reported here. Useful improvements in scientific workflow are presented. It is clear from the explosive growth in stored data and use that there was a pent up demand for a suitable big-data analysis environment. This demand is not yet satisfied, in part because JASMIN does not yet have enough compute, the storage is fully allocated, and not all software needs are met. Plans to address these constraints are introduced.
Keywords
Big Data; data analysis; environmental science computing; pattern clustering; storage management; JASMIN super-data-cluster; STFC Centre for Environmental Data Archival; UK environmental science community; United Kingdom; big data analysis; big data manipulation; big data storage; data products; environmental big data; high-performance high-volume data analysis; scientific workflow; software needs; Communities; Data analysis; Data handling; Data models; Earth; Meteorology; Virtual machining; Big Data; Climate; Cloud Computing; Curation; Earth Observation;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691556
Filename
6691556
Link To Document