Title :
A dynamic load balance on GPU cluster for fork-join search
Author :
Ou, Yanlin ; Chen, Hu ; Lai, Lushuang
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
Abstract :
As a result of that every computer can have different CPUs, memory size, GPU devices and so on, they are heterogeneous and unreliable, dynamic load balancing is a difficult problem for a GPU cluster system needs to solve. In this paper, we discuss a method that can dispatch the appropriate tasks to each node to achieve load balancing. We assume that each node has an initial capability of hyper-computing, according to number of completed tasks in each cycle; this capability of each node will be updated dynamically. We will also show that how the tasks resend when some nodes disconnect to improve the system´s reliability. In our experiments, the load of each computing node can be balanced within a few minutes, and if some nodes disconnect, the computing tasks can be completed normally.
Keywords :
computer graphic equipment; coprocessors; resource allocation; GPU cluster system; computing tasks; dynamic load balancing; fork-join search; hyper-computing; system reliability; Acceleration; Computer architecture; Educational institutions; Graphics processing unit; High performance computing; Instruction sets; Silicon; Cluster; Dynamic Load Balance; GPU; High Performance Computing;
Conference_Titel :
Cloud Computing and Intelligence Systems (CCIS), 2011 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-61284-203-5
DOI :
10.1109/CCIS.2011.6045138