DocumentCode :
264078
Title :
Parallel Random Access Memory in a shared memory architecture
Author :
Tran Duc Linh ; de Souza-Daw, Tony ; Thang Manh Hoang ; Nguyen Tien Dzung
Author_Institution :
Centre of Technol., RMIT Univ., Ho Chi Minh City, Vietnam
fYear :
2014
fDate :
July 30 2014-Aug. 1 2014
Firstpage :
364
Lastpage :
369
Abstract :
Parallel algorithms can significantly speed up computing performance. However, parallel architecture often needs shared-memories for concurrent access. Conventionally, parallel memories are constructed as space-multiplexed memories with many memory chips connected in parallel. This architecture normally requires a large number of interconnects with potentially large routing delay and consumes massive area. This proposal develops a new memory component called Parallel Random Access Memory (P-RAM) with four identical parallel ports. This component is designed using VHDL hardware description language and emulated on Cyclone II FPGA. The P-RAM is not a conventional RAM memory since its four ports can be read and write concurrently. It can be used for many purposes such as shared memory for multiple processors in a parallel model. The design of the P-RAM component has been fully tested in both simulation and hardware integration with processors. Five simulation test cases were used to test all possible access cases of P-RAM and they all passed. The P-RAM is synthesized as combinations of logic elements (flip-flops and logic gates) instead of normal memory bits since it is a new component to the synthesis tool. lKB P-RAM take about 11,630 logic elements. The proposed P-RAM Parallel System has two instances (distinct copy) of the P-RAM component. One is used as shared instruction memory and the another is used as shared data memory. To demonstrate P-RAM operations, one port is used to observe data memory content and 3 CPUs are connected to the other ports of P-RAM component. Due to the FPGA chip resource limitation, the P-RAM instruction memory is built with only lKB size (32-bit data 8-bit address buses). The instruction code of the parallel algorithm is hard-coded in this memory by direct memory initialization. From this memory, the parallel algorithm of Finding the Maximum is run on all processors. The P-RAM data memory has 4-bit data and 4-bit address buses.
Keywords :
field programmable gate arrays; hardware description languages; parallel algorithms; random-access storage; Cyclone II FPGA; P-RAM; P-RAM component design; VHDL hardware description language; direct memory initialization; flip-flops; logic elements; logic gates; memory chips; parallel algorithms; parallel architecture; parallel random access memory; routing delay; shared memory architecture; space multiplexed memories; speed up computing performance; Computational modeling; Computer architecture; Field programmable gate arrays; Random access memory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Electronics (ICCE), 2014 IEEE Fifth International Conference on
Conference_Location :
Danang
Print_ISBN :
978-1-4799-5049-2
Type :
conf
DOI :
10.1109/CCE.2014.6916731
Filename :
6916731
Link To Document :
بازگشت