Title :
Implicit Data Permutation for SIMD Devices
Author :
Shen, Li ; Huang, Libo ; Xiao, Nong ; Wang, Zhiying
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
SIMD extension is one of the most effective ways to exploit data level parallelism in current microprocessor design. But limited by some constraints, such as memory address alignment and in consecutive memory access, data permutation operations are usually needed before SIMD calculations, which impede us to exploit more parallelism. In this paper, an implicit data permutation mechanism is proposed. With our approach, original explicit data permutation can be split into two stages: explicit pattern setting and implicit data reorganization. The first stage is performed by scalar instructions and the second one is triggered implicitly when a vector register is read. It provides new chance for further optimization. To make this mechanism programmable, several new scalar instructions are extended and corresponding compilation strategies are also proposed. Experimental results show that oriented to multimedia benchmarks, 1.18x speedup can be achieved over current SIMD optimization techniques on average.
Keywords :
parallel processing; program compilers; storage allocation; SIMD calculations; SIMD devices; SIMD extension; SIMD optimization techniques; compilation strategy; consecutive memory access; data level parallelism; data permutation mechanism; data permutation operations; data reorganization; implicit data permutation; memory address alignment; microprocessor design; multimedia benchmarks; scalar instructions; vector register; Assembly; Concurrent computing; Flow graphs; Impedance; Memory management; Microprocessors; Parallel processing; Registers; Virtual colonoscopy;
Conference_Titel :
Embedded and Multimedia Computing, 2009. EM-Com 2009. 4th International Conference on
Conference_Location :
Jeju
Print_ISBN :
978-1-4244-4995-8
DOI :
10.1109/EM-COM.2009.5403000