DocumentCode :
2422471
Title :
Architecture of Request Distributor for GPU Clusters
Author :
Doost, Mani Shafaat ; Sadjadi, S. Masoud ; da Silva, Juliano Ricardo ; Zamith, Marcelo ; Joselli, Mark ; Clua, Esteban
Author_Institution :
Sch. of Comput. & Inf. Sci., Florida Int. Univ., Miami, FL, USA
fYear :
2012
fDate :
24-25 Oct. 2012
Firstpage :
13
Lastpage :
18
Abstract :
The advent of GPU computing has enabled development of many strategies for accelerating different kinds of simulations. Even further, instead of processing an application by just using one GPU, it is a common to use a collection of GPUs as a solution. These GPUs can be located in the same machine, network, or even across a wide area network. Unfortunately, distribution and management of GPUs requires additional efforts by the user such as deal with data transfer, connection and processing among GPUs. Request distributor for GPU clusters (RDGPUC) is a software architecture which allows companies, institutes and other users to share their GPU resources. By using this architecture, each cluster can have its own software to manage internal resources and they only need to develop small code to interact with RDGPUC. This novel design brings flexibility to the system and allows everyone to share their resources without need to change their GPU cluster tool. Another interesting part of system is to allow users to submit requests from all kind of devices and platforms. Admin of this system is able to specify resource groups and special schedules for using resources. On the other hand, end-users can just use a simple interface to submit their requests on RDGPUC without knowing about internal design and current status of GPU clusters.
Keywords :
electronic engineering computing; graphics processing units; software architecture; GPU computing; RDGPUC; request distributor; request distributor for GPU clusters; software architecture; wide area network; Computational modeling; Computer architecture; Databases; Graphics processing units; Kernel; Schedules; GPU; cluster computing; load balance; resource sharing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications for Multi-Core Architectures (WAMCA), 2012 Third Workshop on
Conference_Location :
New York, NY
Print_ISBN :
978-1-4673-5025-9
Type :
conf
DOI :
10.1109/WAMCA.2012.15
Filename :
6374746
Link To Document :
بازگشت