Parallel efficient hierarchical algorithms for module placement of large chips on distributed memory architectures

Author

Yang, Laurence Tianruo

Author_Institution

Dept. of Comput. Sci., Saint Francis Xavier Univ., Antigonish, NS, Canada

fYear

2002

fDate

2002

Firstpage

405

Lastpage

408

Abstract

The PROUD module placement algorithm mainly uses a hierarchical decomposition technique and the solution of sparse linear systems based on a resistive network analogy. It has been shown that the PROUD algorithm can achieve a comparable design of the placement problems for very large circuits with the best placement algorithm based on simulated annealing, but with several order of magnitude faster. The modified PROUD, namely MPROUD algorithm by perturbing the coefficient matrices performs much faster that the original PROUD algorithm. Due to the instability and unguaranteed convergence of MPROUD algorithm, we have proposed a new convergent and numerically stable PROUD, namely Improved PROUD algorithm, denoted as IPROUD with attractive computational costs to solve the module placement problems by making use of the SYMMLQ and MINRES methods based on Lanczos process (Yang, 1997). We subsequently propose parallel versions of the improved PROUD algorithms. The parallel algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step are independent. Therefore, the cost of global communication which represents the bottleneck of the parallel performance on parallel distributed memory computers can be significantly reduced, therefore, to obtain another order of magnitude improvement in the runtime without loss of the quality of the layout.

Keywords

circuit layout CAD; distributed memory systems; matrix multiplication; numerical stability; parallel algorithms; parallel architectures; sparse matrices; IPROUD; Improved PROUD algorithm; MINRES; MPROUD algorithm; PROUD; SYMMLQ; bottleneck; coefficient matrices; computational costs; distributed memory architectures; hierarchical decomposition technique; large chip module placement; matrix-vector multiplications; numerical stability; parallel algorithm; parallel distributed memory computers; parallel efficient hierarchical algorithms; resistive network analogy; simulated annealing; sparse linear systems; Algorithm design and analysis; Circuit simulation; Computational efficiency; Convergence of numerical methods; Costs; Global communication; Linear systems; Memory architecture; Parallel algorithms; Simulated annealing;

fLanguage

English

Publisher

ieee

Conference_Titel

Parallel Computing in Electrical Engineering, 2002. PARELEC '02. Proceedings. International Conference on

Print_ISBN

0-7695-1730-7

Type

conf

DOI

10.1109/PCEE.2002.1115310

Filename

1115310