Title :
Quantitative Analysis of Inter-block Dependence in Speculative Execution
Author :
Junrui Zhou ; Hong An ; Yanyan Zou ; Tao Sun
Author_Institution :
Sch. of Comput. Sci. & Technol., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
Given increasingly abundant resources on substrate, speculative execution has been proposed to seek opportunities for improving performance of sequential applications restricted by control dependence and data dependence. In spite of benefits obtained from speculative execution, performance and power suffer severely from pipeline flushes due to failure of aggressive branch speculation. Unfortunately, without knowing behavior of control dependence at runtime, a compiler is unable to split sequential application for speculative execution to achieve optimal performance. Therefore, control dependence which is source of performance loss should be identified and properly optimized. In this paper, we propose an efficient two level quantitative performance analysis, called QAD, to identify dependence needed to be optimized. The breakdown in performance analysis of QAD helps performance optimization get rid of heavy reliance on instruction scheduling that is a NP problem. Thus, compared to prior performance analysis based on critical path model, more room for optimizing sequential application can be identified by QAD. Experimental results are presented in the context of a TRIPS processor which supports speculative execution at the granularity of block. Analysis time of QAD gains a 2.6x speedup over traditional critical path analysis on average. With code optimizations guided by result of quantitative performance analysis, a geometric mean of 18.93% performance improvement is achieved.
Keywords :
computational complexity; multiprocessing systems; parallel processing; processor scheduling; NP problem; QAD; TRIPS processor; aggressive branch speculation failure; code optimizations; control dependence; critical path analysis; data dependence; instruction scheduling; inter-block dependence; performance loss; performance optimization; sequential application optimization; speculative execution; two level quantitative performance analysis; Algorithm design and analysis; Benchmark testing; Delays; Optimization; Performance analysis; Radio frequency; Registers; inter-block dependences; intermediate code optimization; performance analysis; speculative execution;
Conference_Titel :
Trust, Security and Privacy in Computing and Communications (TrustCom), 2013 12th IEEE International Conference on
Conference_Location :
Melbourne, VIC
DOI :
10.1109/TrustCom.2013.121