• DocumentCode
    1783378
  • Title

    Petascale General Solver for Semidefinite Programming Problems with Over Two Million Constraints

  • Author

    Fujisawa, Katsuki ; Endo, T. ; Yasui, Yuichiro ; Sato, Hikaru ; Matsuzawa, Naoki ; Matsuoka, Shingo ; Waki, Hayato

  • Author_Institution
    JST CREST, Chuo Univ., Tokyo, Japan
  • fYear
    2014
  • fDate
    19-23 May 2014
  • Firstpage
    1171
  • Lastpage
    1180
  • Abstract
    The semi definite programming (SDP) problem is one of the central problems in mathematical optimization. The primal-dual interior-point method (PDIPM) is one of the most powerful algorithms for solving SDP problems, and many research groups have employed it for developing software packages. However, two well-known major bottlenecks, i.e., the generation of the Schur complement matrix (SCM) and its Cholesky factorization, exist in the algorithmic framework of the PDIPM. We have developed a new version of the semi definite programming algorithm parallel version (SDPARA), which is a parallel implementation on multiple CPUs and GPUs for solving extremely large-scale SDP problems with over a million constraints. SDPARA can automatically extract the unique characteristics from an SDP problem and identify the bottleneck. When the generation of the SCM becomes a bottleneck, SDPARA can attain high scalability using a large quantity of CPU cores and some processor affinity and memory interleaving techniques. SDPARA can also perform parallel Cholesky factorization using thousands of GPUs and techniques for overlapping computation and communication if an SDP problem has over two million constraints and Cholesky factorization constitutes a bottleneck. We demonstrate that SDPARA is a high-performance general solver for SDPs in various application fields through numerical experiments conducted on the TSUBAME 2.5 supercomputer, and we solved the largest SDP problem (which has over 2.33 million constraints), thereby creating a new world record. Our implementation also achieved 1.713 PFlops in double precision for large-scale Cholesky factorization using 2,720 CPUs and 4,080 GPUs.
  • Keywords
    mathematical programming; mathematics computing; matrix decomposition; multiprocessing systems; parallel algorithms; software packages; CPU cores; GPUs; PDIPM algorithmic framework; SCM; SDPARA; Schur complement matrix generation; TSUBAME 2.5 supercomputer; high-performance general solver; large-scale SDP problems; mathematical optimization; memory interleaving techniques; parallel Cholesky factorization; petascale general solver; primal-dual interior-point method; processor affinity; semidefinite programming algorithm parallel version; software packages; two million constraints; Educational institutions; Matrices; Optimization; Programming; Software algorithms; Software packages; Supercomputers; Dense Matrix Algebra; GPGPU; General Optimization Solver; Semidefinite Programming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2014 IEEE 28th International
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4799-3799-8
  • Type

    conf

  • DOI
    10.1109/IPDPS.2014.121
  • Filename
    6877345