Title :
SIMD - efficient loop unrolling design for embedded multimedia applications
Author :
Dai, Yunyang ; Li, Qing ; Zhang, Qi ; Kuo, C. C Jay
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
Due to the rising complexity of modern embedded media applications (EMAs), compilers must have the capability to exploit superword level parallelism (SLP). This work analyzes the memory access patterns found in EMAs and presents a scheme to calculate the loop unrolling factor to utilize these patterns fully to generate efficient single instruction multiple data (SIMD) instructions. The loop nest is also considered for actual memory access patterns, which can be used to improve the efficiency of the compiler. We observe a performance improvement by an average factor of 12 times for manual experiments conducted on the TriMedia TM-1300 processor for the H.264 encoding application.
Keywords :
embedded systems; multimedia computing; parallel processing; program compilers; video coding; H.264 encoding; SIMD instructions; TriMedia TM-1300 processor; compilers; embedded multimedia applications; embedded systems; loop nest; loop unrolling design; memory access patterns; single instruction multiple data instructions; superword level parallelism; video decoding; video encoding; Application software; Computer applications; Concurrent computing; Embedded system; Encoding; Medical services; Modems; Multimedia systems; Parallel processing; Pattern analysis;
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8603-5
DOI :
10.1109/ICME.2004.1394618