Title :
Wide-area transposition-driven scheduling
Author :
Romein, John W. ; Bal, Henri E.
Author_Institution :
Dept. of Math. & Comput. Sci., Vrije Univ., Amsterdam, Netherlands
Abstract :
The distributed searching of state spaces containing cycles is a challenging task and has been studied for several years. Traditional parallel search algorithms either ignore the cyclic nature of the state space and waste much time in duplicated search effort, or they rely on heavy communication to reduce duplicate work, resulting in a large communication overhead. Both methods perform poorly, even when using a fast, local interconnection. A recently-developed task distribution scheme, called transposition-driven scheduling (TDS), performs much better, since it communicates asynchronously and efficiently suppresses duplicate search effort. TDS, however, requires bandwidths of megabytes per second per processor. In this paper, we investigate how cyclic state spaces can be searched efficiently on a meta-computing system containing multiple clusters, connected by high-latency, low-bandwidth wide-area links. This is quite a challenge, because the wide-area links provide neither the bandwidth required for TDS nor the latency required for traditional distributed search algorithms. We propose a scheme that strongly reduces communication between clusters at the expense of some duplicate search effort. Performance measurements for several applications show that the new scheme outperforms traditional schemes by a wide margin
Keywords :
distributed algorithms; scheduling; search problems; state-space methods; wide area networks; asynchronous communication; bandwidth; communication overhead; cyclic state spaces; distributed search algorithms; distributed state-space searching; distributed supercomputing; duplicated search effort; fast local interconnection; high-latency low-bandwidth wide-area links; inter-cluster communication; meta-computing system; multiple clusters; parallel search algorithms; performance; task distribution scheme; transposition-driven scheduling; Application software; Bandwidth; Concurrent computing; Delay; Distributed computing; Grid computing; Mathematics; Processor scheduling; State-space methods; Table lookup;
Conference_Titel :
High Performance Distributed Computing, 2001. Proceedings. 10th IEEE International Symposium on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7695-1296-8
DOI :
10.1109/HPDC.2001.945202