DocumentCode :
1557511
Title :
Forward-Projection Architecture for Fast Iterative Image Reconstruction in X-Ray CT
Author :
Kim, Jung Kuk ; Fessler, Jeffrey A. ; Zhang, Zhengya
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Michigan, Ann Arbor, MI, USA
Volume :
60
Issue :
10
fYear :
2012
Firstpage :
5508
Lastpage :
5518
Abstract :
Iterative image reconstruction can dramatically improve the image quality in X-ray computed tomography (CT), but the computation involves iterative steps of 3D forward- and back-projection, which impedes routine clinical use. To accelerate forward-projection, we analyze the CT geometry to identify the intrinsic parallelism and data access sequence for a highly parallel hardware architecture. To improve the efficiency of this architecture, we propose a water-filling buffer to remove pipeline stalls, and an out-of-order sectored processing to reduce the off-chip memory access by up to three orders of magnitude. We make a floating-point to fixed-point conversion based on numerical simulations and demonstrate comparable image quality at a much lower implementation cost. As a proof of concept, a 5-stage fully pipelined, 55-way parallel separable-footprint forward-projector is prototyped on a Xilinx Virtex-5 FPGA for a throughput of 925.8 million voxel projections/s at 200 MHz clock frequency, 4.6 times higher than an optimized 16-threaded program running on an 8-core 2.8-GHz CPU. A similar architecture can be applied to back-projection for a complete iterative image reconstruction system. The proposed algorithm and architecture can also be applied to hardware platforms such as graphics processing unit and digital signal processor to achieve significant accelerations.
Keywords :
X-ray imaging; computerised tomography; field programmable gate arrays; geometry; image reconstruction; information retrieval; iterative methods; medical image processing; numerical analysis; ίoating-point to fixed-point conversion; 16-threaded program optimization; 3D back-projection; 3D forward-projection; 5-stage fully pipelining; 55-way parallel separable-footprint forward-projector; CPU; CT geometry analysis; X-ray CT; X-ray computed tomography; Xilinx Virtex-5 FPGA; clock frequency; data access sequence; digital signal processor; fast iterative image reconstruction; frequency 2.8 GHz; frequency 200 MHz; graphics processing unit; highly parallel hardware architecture; image quality; intrinsic parallelism identification; numerical simulation; off-chip memory access reduction; out-of-order sectored processing; pipeline stall removal; routine clinical use; water-filling buffer; Computed tomography; Computer architecture; Detectors; Image reconstruction; Quantization; System-on-a-chip; X-ray imaging; Algorithm and architecture co-optimization; X-ray computed tomography; hardware acceleration; iterative image reconstruction; separable footprint projection;
fLanguage :
English
Journal_Title :
Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1053-587X
Type :
jour
DOI :
10.1109/TSP.2012.2208636
Filename :
6239609
Link To Document :
بازگشت