DocumentCode :
623991
Title :
Comparative performance analysis of a Big Data NORA problem on a variety of architectures
Author :
Kogge, Peter M. ; Bayliss, David A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Notre Dame, Notre Dame, IN, USA
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
22
Lastpage :
34
Abstract :
Non Obvious Relationship Analysis (NORA) is one of the most stressing classes of Big Data Analytics problems. This paper proposes a reference NORA problem that is representative of real problems, and can rationally scale to very large sizes. It then develops a highly concurrent implementation that can run on large systems. Each step of this implementation is sized in terms of how much of four different resources (CPU, memory, disk, and network) might be used. From this, a parameterized model projecting both execution time and utilizations is used to identify the “tall poles” in performance. The parameters are then modified to represent several different target systems, from a large cluster typical of today to variations in an advanced architecture where processing has been moved into memory. A “thought experiment” then uses this model to discover the parameters of a system that would provide both a near 100X speedup, but with a balanced design where no resource is badly over or under utilized.
Keywords :
data analysis; data mining; parallel architectures; advanced architecture; big data NORA problem; big data analytics problem; concurrent implementation; execution time; nonobvious relationship analysis; parameterized model; tall pole identification; thought experiment; Blades; Computer architecture; Data handling; Data storage systems; Information management; Random access memory; Sockets; Algorithms; ECL; NORA; Performance; association mining; link discovery; non-obvious relationship analysis; parallel big data systems; performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Collaboration Technologies and Systems (CTS), 2013 International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-6403-4
Type :
conf
DOI :
10.1109/CTS.2013.6567199
Filename :
6567199
Link To Document :
بازگشت