مرکز منطقه ای اطلاع رساني علوم و فناوري - High Performance by Exploiting Information Locality through Reverse Computing

DocumentCode :

2688137

Title :

High Performance by Exploiting Information Locality through Reverse Computing

Author :

Bahi, Mouad ; Eisenbeis, Christine

Author_Institution :

INRIA Saclay - Ile-de-France, Orsay, France

fYear :

2011

fDate :

26-29 Oct. 2011

Firstpage :

Lastpage :

Abstract :

In this paper we present performance results for our register rematerialization technique based on reverse recomputing. Rematerialization adds instructions and we show on one specifically designed example that reverse computing alleviates the impact of these additional instructions on performance. We also show how thread parallelism may be optimized on GPUs by performing register allocation with reverse recomputing that increases the number of threads per Streaming Multiprocessor (SM). This is done on the main kernel of Lattice Quantum Chromo Dynamics (LQCD) simulation program where we gain a 10.84% speedup.

Keywords :

graphics processing units; multiprocessing programs; reverse engineering; GPU; LQCD simulation program; information locality; lattice quantum chromo dynamics; register rematerialization technique; reverse computing; streaming multiprocessor; Assembly; Frequency modulation; Instruction sets; Optimization; Parallel processing; Pipelines; Registers; rematerialization; reversible computing; thread-level parallelism;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Architecture and High Performance Computing (SBAC-PAD), 2011 23rd International Symposium on

Conference_Location :

Vitoria, Espirito Santo

ISSN :

1550-6533

Print_ISBN :

978-1-4577-2050-5

Type :

conf

DOI :

10.1109/SBAC-PAD.2011.10

Filename :

6106002

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2688137