Title :
Parallel efficient hierarchical algorithms for module placement of large chips on distributed memory architectures
Author :
Yang, Laurence Tianruo
Author_Institution :
Dept. of Comput. Sci., Saint Francis Xavier Univ., Antigonish, NS, Canada
Abstract :
The PROUD module placement algorithm mainly uses a hierarchical decomposition technique and the solution of sparse linear systems based on a resistive network analogy. It has been shown that the PROUD algorithm can achieve a comparable design of the placement problems for very large circuits with the best placement algorithm based on simulated annealing, but with several order of magnitude faster. The modified PROUD, namely MPROUD algorithm by perturbing the coefficient matrices performs much faster that the original PROUD algorithm. Due to the instability and unguaranteed convergence of MPROUD algorithm, we have proposed a new convergent and numerically stable PROUD, namely Improved PROUD algorithm, denoted as IPROUD with attractive computational costs to solve the module placement problems by making use of the SYMMLQ and MINRES methods based on Lanczos process (Yang, 1997). We subsequently propose parallel versions of the improved PROUD algorithms. The parallel algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step are independent. Therefore, the cost of global communication which represents the bottleneck of the parallel performance on parallel distributed memory computers can be significantly reduced, therefore, to obtain another order of magnitude improvement in the runtime without loss of the quality of the layout.
Keywords :
circuit layout CAD; distributed memory systems; matrix multiplication; numerical stability; parallel algorithms; parallel architectures; sparse matrices; IPROUD; Improved PROUD algorithm; MINRES; MPROUD algorithm; PROUD; SYMMLQ; bottleneck; coefficient matrices; computational costs; distributed memory architectures; hierarchical decomposition technique; large chip module placement; matrix-vector multiplications; numerical stability; parallel algorithm; parallel distributed memory computers; parallel efficient hierarchical algorithms; resistive network analogy; simulated annealing; sparse linear systems; Algorithm design and analysis; Circuit simulation; Computational efficiency; Convergence of numerical methods; Costs; Global communication; Linear systems; Memory architecture; Parallel algorithms; Simulated annealing;
Conference_Titel :
Parallel Computing in Electrical Engineering, 2002. PARELEC '02. Proceedings. International Conference on
Print_ISBN :
0-7695-1730-7
DOI :
10.1109/PCEE.2002.1115310