Title :
A network-aware distributed storage cache for data intensive environments
Author :
Tierney, Brian L. ; Lee, Jason ; Crowley, Brian ; Holding, Mason ; Hylton, Jeremy ; Drake, Fred L., Jr.
Author_Institution :
California Univ., Berkeley, CA, USA
Abstract :
Modern scientific computing involves organizing, moving, visualizing, and analyzing massive amounts of data at multiple sites around the world. The technologies, the middleware services, and the architectures that are used to build useful high-speed, wide area distributed systems, constitute the field of data intensive computing. We describe an architecture for data intensive applications where we use a high-speed distributed data cache as a common element for all of the sources and sinks of data. This cache-based approach provides standard interfaces to a large, application-oriented, distributed, on-line, transient storage system. We describe our implementation of this cache, how we have made it “network aware ”, and how we do dynamic load balancing based on the current network conditions. We also show large increases in application throughput by access to knowledge of the network conditions
Keywords :
cache storage; client-server systems; data analysis; distributed processing; resource allocation; application throughput; data analysis; data intensive applications; data intensive computing; data intensive environments; dynamic load balancing; middleware; network-aware distributed storage cache; online transient storage; scientific computing; wide area distributed systems; Cache storage; Computer architecture; Distributed computing; Grid computing; High-speed networks; Laboratories; Medical services; Middleware; Research initiatives; Streaming media;
Conference_Titel :
High Performance Distributed Computing, 1999. Proceedings. The Eighth International Symposium on
Conference_Location :
Redondo Beach, CA
Print_ISBN :
0-7803-5681-0
DOI :
10.1109/HPDC.1999.805297