• DocumentCode
    669761
  • Title

    Precision-energy-throughput scaling of generic matrix multiplication and discrete convolution kernels via linear projections

  • Author

    Anam, Mohammad Ashraful ; Whatmough, Paul ; Andreopoulos, Yiannis

  • Author_Institution
    Electron. & Electr. Eng. Dept., Univ. Coll. London, London, UK
  • fYear
    2013
  • fDate
    3-4 Oct. 2013
  • Firstpage
    21
  • Lastpage
    30
  • Abstract
    Generic matrix multiplication (GEMM) and one-dimensional discrete convolution/cross-correlation (CONV) kernels perform the bulk of the compute- and memory-intensive processing within image/audio recognition and matching systems. We propose a novel method to scale the energy and processing throughput of GEMM and CONV kernels for such error-tolerant multimedia applications by adjusting the precision of computation. Our technique employs linear projections to the input matrix or signal data during the top-level GEMM and CONV blocking and reordering. The GEMM and CONV kernel processing then uses the projected inputs and the results are accumulated to form the final outputs. Throughput and energy scaling takes place by decreasing the number of projections computed by each kernel, which in turn produces approximate results, i.e. lowers the precision of the performed computation. Existing realizations of error-tolerant multimedia applications can opt to utilize a small number of the input projections (typically just one) in order to save energy and processing cycles, while all error-intolerant systems can compute all input projections and obtain full-precision outputs. Results derived from a voltage- and frequency-scaled ARM Cortex A15 processor running face recognition demonstrate that the proposed approach allows for 5-fold to 10-fold increase of processing throughput and more than 80% decrease of energy consumption against optimized GEMM and CONV kernels without any impact in the expected recognition and matching precision.
  • Keywords
    convolution; face recognition; fault tolerant computing; image matching; matrix multiplication; multimedia computing; operating system kernels; power aware computing; 1D CONV kernels; 1D discrete convolution/cross-correlation; ARM Cortex A15 processor; CONV blocking; CONV reordering; GEMM kernels; audio recognition; compute-intensive processing; discrete convolution kernels; energy consumption; energy saving; error-tolerant multimedia applications; face recognition; frequency scaling; generic matrix multiplication kernels; image matching system; image recognition; input matrix; linear projections; matching precision; memory-intensive processing; precision-energy-throughput scaling; processing cycles; processing throughput; recognition precision; signal data; top-level GEMM blocking; voltage scaling; Acceleration; Convolution; Kernel; Multimedia communication; Signal processing algorithms; Throughput; Vectors; discrete convolution; embedded systems; energy and throughput scaling; generic matrix multiplication; multimedia recognition and matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Embedded Systems for Real-time Multimedia (ESTIMedia), 2013 IEEE 11th Symposium on
  • Conference_Location
    Montreal, QC
  • Type

    conf

  • DOI
    10.1109/ESTIMedia.2013.6704499
  • Filename
    6704499