DocumentCode
729451
Title
SIMD vectorization of nested loop based on strip mining
Author
Jinlong Xu ; Huihui Sun ; Rongcai Zhao
Author_Institution
State Key Lab. of Math. Eng. & Adv. Comput., Zhengzhou, China
fYear
2015
fDate
1-3 June 2015
Firstpage
1
Lastpage
7
Abstract
The difference between vector machine and SIMD extension is analyzed at the very start. The multilevel loop vector code generation algorithm termed Codegen put forward by Kennedy and other fellows can´t be directly applied to SIMD extension as it is oriented to vector machine. The vectorization algorithm in state-of-the-art compilers can only process one level of nested loop. In order to vectorize the entire nested loop, a vectorization algorithm based on strip mining called simdcodegen is proposed. Firstly, the formation reason of dependence circles is analyzed and the role of strip mining played in elimination of dependence circles is discussed. Then on the basis of codegen, strip mining is applied on each level of the loop recursively to explore the SIMD parallelism in the nested loop. Although strip mining is always legitimate, the executing cost increases after strip mining. To assure that strip mining is beneficial, that is to say, strip mining can break some dependence circles, cycle broken test is applied before implementation of strip mining. Effectiveness of this method is verified by the experimental results.
Keywords
data mining; parallel processing; support vector machines; Codegen; SIMD extension; SIMD vectorization; multilevel loop vector code generation algorithm; nested loop; strip mining; vector machine; Data mining; Data processing; Fellows; Parallel processing; Registers; Strips; Support vector machines; Dependence cycle; Dependence distance; Local parallel; SIMD extension; Strip mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on
Conference_Location
Takamatsu
Type
conf
DOI
10.1109/SNPD.2015.7176176
Filename
7176176
Link To Document