DocumentCode
3143344
Title
CRT-Based DSP Decryption Using Montgomery Modular Multiplication on the FPGA
Author
Song, Bo ; Ito, Yasuaki ; Nakano, Koji
Author_Institution
Dept. of Inf. Eng., Hiroshima Univ., Hiroshima, Japan
fYear
2011
fDate
16-20 May 2011
Firstpage
532
Lastpage
541
Abstract
The main contribution of this paper is to present an efficient hardware algorithm for Chinese Remainder Theorem (CRT) based RSA decryption using Montgomery multiplication algorithm. Our hardware algorithm supporting up-to 2048-bit RSA decryption is designed to be implemented using one DSP48E1 block, one Block RAM and few logic blocks in the Xilinx Virtex-6 FPGA. The implementation results show that our RSA core for 1024-bit RSA decryption runs in 11.263ms. Quite surprisingly, the multiplier in DSP block used to compute Montgomery multiplication works in more than 95% clock cycles during the processing. Hence, our implementation is close to optimal in the sense that it has only less than 5% overhead in multiplication and no further improvement is possible as long as CRT-based Montgomery multiplication based algorithm is applied. We have also succeeded in implementing 320 RSA cores in one Xilinx Virtex-6 FPGA XC6VLX240T-1 which work in parallel. The implemented parallel 320 RSA cores achieve 26.2 Mbit/s throughput for 1024-bit RSA decryption.
Keywords
cryptography; digital signal processing chips; field programmable gate arrays; logic circuits; multiplying circuits; random-access storage; CRT-based DSP decryption; Chinese remainder theorem; Montgomery modular multiplication; Xilinx Virtex-6 FPGA; block RAM; hardware algorithm; logic blocks; word length 2048 bit; Distributed processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location
Shanghai
ISSN
1530-2075
Print_ISBN
978-1-61284-425-1
Electronic_ISBN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2011.208
Filename
6008874
Link To Document