• DocumentCode
    729451
  • Title

    SIMD vectorization of nested loop based on strip mining

  • Author

    Jinlong Xu ; Huihui Sun ; Rongcai Zhao

  • Author_Institution
    State Key Lab. of Math. Eng. & Adv. Comput., Zhengzhou, China
  • fYear
    2015
  • fDate
    1-3 June 2015
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    The difference between vector machine and SIMD extension is analyzed at the very start. The multilevel loop vector code generation algorithm termed Codegen put forward by Kennedy and other fellows can´t be directly applied to SIMD extension as it is oriented to vector machine. The vectorization algorithm in state-of-the-art compilers can only process one level of nested loop. In order to vectorize the entire nested loop, a vectorization algorithm based on strip mining called simdcodegen is proposed. Firstly, the formation reason of dependence circles is analyzed and the role of strip mining played in elimination of dependence circles is discussed. Then on the basis of codegen, strip mining is applied on each level of the loop recursively to explore the SIMD parallelism in the nested loop. Although strip mining is always legitimate, the executing cost increases after strip mining. To assure that strip mining is beneficial, that is to say, strip mining can break some dependence circles, cycle broken test is applied before implementation of strip mining. Effectiveness of this method is verified by the experimental results.
  • Keywords
    data mining; parallel processing; support vector machines; Codegen; SIMD extension; SIMD vectorization; multilevel loop vector code generation algorithm; nested loop; strip mining; vector machine; Data mining; Data processing; Fellows; Parallel processing; Registers; Strips; Support vector machines; Dependence cycle; Dependence distance; Local parallel; SIMD extension; Strip mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on
  • Conference_Location
    Takamatsu
  • Type

    conf

  • DOI
    10.1109/SNPD.2015.7176176
  • Filename
    7176176