Title :
Reduced computational redundancy implementation of DSP algorithms using computation sharing vector scaling
Author :
Muhammad, Khurram ; Roy, Kaushik
Author_Institution :
Texas Instrum. Inc., Dallas, TX, USA
fDate :
6/1/2002 12:00:00 AM
Abstract :
In this paper, we present a general approach which specifically targets reduction of redundant computation in common digital-signal processing (DSP) tasks such as filtering and matrix multiplication. We show that such tasks can be expressed as multiplication of vectors by scalars and this allows fast multiplication by sharing computation. Vector scaling operation is decomposed to find the most effective precomputations which yield a fast multiplier implementation. Two decomposition approaches are presented, one based on a greedy decomposition and the other based on fixed-size lookup and this leads to two multiplier architectures for vector-scalar products. Analog simulation of an example multiplier shows a speed advantage by a factor of about 1.85 over a conventional carry save array multiplier. Further simulations using 0.18 /spl mu/ technology show up to 20% speed advantage over Booth encoded Wallace tree multipliers.
Keywords :
filtering theory; matrix multiplication; redundancy; signal processing; Booth encoded Wallace tree multipliers; DSP algorithms; DSP tasks; carry save array multiplier; computation sharing vector scaling; digital signal processing tasks; filtering; fixed-size lookup; greedy decomposition; matrix multiplication; reduced computational redundancy implementation; redundant computation reduction; vector multiplication; vector scaling decomposition; Arithmetic; Computational modeling; Computer architecture; Digital filters; Digital signal processing; Filtering algorithms; High performance computing; Matrix decomposition; Read only memory;
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
DOI :
10.1109/TVLSI.2002.1043332