• DocumentCode
    3199952
  • Title

    An Algebraic Parallel Treecode in Arbitrary Dimensions

  • Author

    March, William B. ; Bo Xiao ; Yu, Chenhan D. ; Biros, George

  • Author_Institution
    Inst. for Comput. Eng. & Sci., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2015
  • fDate
    25-29 May 2015
  • Firstpage
    571
  • Lastpage
    580
  • Abstract
    We present a parallel tree code for fast kernel summation in high dimensions -- a common problem in data analysis and computational statistics. Fast kernel summations can be viewed as approximation schemes for dense kernel matrices. Tree code algorithms (or simply tree codes) construct low-rank approximations of certain off-diagonal blocks of the kernel matrix. These blocks are identified with the help of spatial data structures, typically trees. There is extensive work on tree codes and their parallelization for kernel summations in three dimensions, but there is little work on high-dimensional problems. Recently, we introduced a novel tree code, ASKIT, which resolves most of the shortcomings of existing methods. We introduce novel parallel algorithms for ASKIT, derive complexity estimates, and demonstrate scalability on synthetic, scientific, and image datasets. In particular, we introduce a local essential tree construction that extends to arbitrary dimensions in a scalable manner. We introduce data transformations for memory locality and use GPU acceleration. We report results on the "Maverick" and "Stampede" systems at the Texas Advanced Computing Centre. Our largest computations involve two billion points in 64 dimensions on 32, 768 x86 cores and 8 million points in 784 dimensions on 16, 384 x86 cores.
  • Keywords
    matrix algebra; parallel algorithms; tree data structures; ASKIT; GPU acceleration; Maverick-Stampede systems; algebraic parallel treecode algorithms; arbitrary dimensions; computational statistics; data analysis; data transformations; dense kernel matrices; fast kernel summation; low-rank approximations; memory locality; off-diagonal blocks; spatial data structures; Algorithm design and analysis; Approximation algorithms; Approximation methods; Complexity theory; Kernel; Scalability; Skeleton; GPUs; kernel summation; randomized linear algebra; treecodes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
  • Conference_Location
    Hyderabad
  • ISSN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2015.86
  • Filename
    7161545