• DocumentCode
    25425
  • Title

    Efficient Acceleration of Mutual Information Computation for Nonrigid Registration Using CUDA

  • Author

    Ikeda, Ken-ichi ; Ino, Fumihiko ; Hagihara, Kohei

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
  • Volume
    18
  • Issue
    3
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    956
  • Lastpage
    968
  • Abstract
    In this paper, we propose an efficient acceleration method for the nonrigid registration of multimodal images that uses a graphics processing unit. The key contribution of our method is efficient utilization of on-chip memory for both normalized mutual information (NMI) computation and hierarchical B-spline deformation, which compose a well-known registration algorithm. We implement this registration algorithm as a compute unified device architecture program with an efficient parallel scheme and several optimization techniques such as hierarchical data organization, data reuse, and multiresolution representation. We experimentally evaluate our method with four clinical datasets consisting of up to 512 × 512 × 296 voxels. We find that exploitation of on-chip memory achieves a 12-fold increase in speed over an off-chip memory version and, therefore, it increases the efficiency of parallel execution from 4% to 46%. We also find that our method running on a GeForce GTX 580 card is approximately 14 times faster than a fully optimized CPU-based implementation running on four cores. Some multimodal registration results are also provided to understand the limitation of our method. We believe that our highly efficient method, which completes an alignment task within a few tens of seconds, will be useful to realize rapid nonrigid registration.
  • Keywords
    digital storage; graphics processing units; image registration; information theory; medical image processing; parallel architectures; splines (mathematics); CUDA; GeForce GTX 580 card; NMI computation; alignment task; clinical datasets; compute unified device architecture program; data reuse; efficient mutual information computation acceleration; efficient on-chip memory utilization; fully optimized CPU-based implementation; graphics processing unit; hierarchical B-spline deformation; hierarchical data organization; multiresolution representation; nonrigid multimodal image registration; normalized mutual information computation; off-chip memory; optimization techniques; parallel execution efficiency; rapid nonrigid registration; registration algorithm; Acceleration; Graphics processing units; Histograms; Instruction sets; Joints; Memory management; Splines (mathematics); Acceleration; compute unified device architecture (CUDA); graphics processing unit (GPU); mutual information (MI); nonrigid registration;
  • fLanguage
    English
  • Journal_Title
    Biomedical and Health Informatics, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    2168-2194
  • Type

    jour

  • DOI
    10.1109/JBHI.2014.2310745
  • Filename
    6762836