DocumentCode :
3385027
Title :
Parallel Implementation of 2D-DWT by Purging Read after Write Dependency for High Speed Applications
Author :
Ashraf, M. ; Baig, Mirza Sami ; Khan, L.A. ; Hassan, Asif
Author_Institution :
Center for Cyber Technol. & Spectrum Manage., Nat. Univ. of Sci. & Technol., Islamabad
fYear :
2007
fDate :
12-13 Nov. 2007
Firstpage :
263
Lastpage :
268
Abstract :
This paper proposes an efficient implementation of multistage, multiple-level DSP algorithms suitable for parallel and distributed processing. To describe our method we selected Mallat\´s algorithm for two dimensional wavelet transforms (2D-DWT) coefficient computation which has multistage and multilevel processing requirements. We have selected field programmable gate arrays (FPGA) as a processing unit because of its inherited parallel processing capabilities but our method is not limited to FPGAs only. Our method directly computes 2D-DWT coefficients without computing and storing intermediate results; which makes it faster; resource saving and removes read after Write (RAW) dependencies. We discuss multistage and single level implementation but ideally it can be extended to n-level implementation. We also proposed method for generation of "mutually scaled filter coefficients (MSFC)" and computation of maximum number of parallel processors for optimized performance in this particular case. Both lookup tables (LUTs) and multipliers along with addition/subtraction architecture can be used. However, LUTs have advantage of high processing speed. Two computational stages are combined into a single stage to remove read after write (RAW) dependency. Our implementation takes N2/4+L2/4-2 time units to compute 2D- DWT of N x N input data, with filter length of L without intermediate storage. This method can be used for other multistage, multilevel DSP problems. Quartusreg II IDE and Altera Stratix device is used for implementation.
Keywords :
discrete wavelet transforms; field programmable gate arrays; image processing; parallel processing; table lookup; 2D-DWT; field programmable gate arrays; lookup tables; multistage multiple-level DSP algorithms; mutually scaled filter coefficients; parallel processing; read after write dependencies; two dimensional wavelet transforms; Computer architecture; Concurrent computing; Digital signal processing; Distributed processing; Field programmable gate arrays; Filters; Optimization methods; Parallel processing; Table lookup; Wavelet transforms; Concurrent Readable Memory; Image processing; Parallel Processing; Pipelining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Technologies, 2007. ICET 2007. International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4244-1493-2
Electronic_ISBN :
978-1-4244-1494-9
Type :
conf
DOI :
10.1109/ICET.2007.4516355
Filename :
4516355
Link To Document :
بازگشت