Title :
MicroTools: Automating Program Generation and Performance Measurement
Author :
Beyler, Jean Christophe ; Triquenaux, Nicolas ; Palomares, Vincent ; Chabane, Farid ; Fighiera, Thibault ; Halimi, Jean-Philippe ; Jalby, William
Abstract :
Tuning an application to a given architecture has become a complex procedure. Sophisticated hardware obfuscates the path to easily writing peak-performance applications. During the optimization process, before utilizing the hardware correctly, the user must understand out-of-order execution and micro-operations. Understanding the software´s performance on a given target architecture is the goal of the Micro Creator and Micro Launcher tools. Micro Creator automatically generates a set of benchmark programs from a XML file, whereas Micro Launcher executes them in a stable and closed environment. With these tools, the user has a better understanding of the underlying architecture. The two programs, through the execution of hundreds of micro-programs and from a single-core execution to the parallel world using OpenMP, give insight on performance issues. Looking into unrolling, strided memory accesses, vectorized programs, and parallel programs allow quick and efficient calculations of the latencies and bottlenecks of the architecture.
Keywords :
XML; software performance evaluation; Micro Creator; Micro Launcher tools; MicroTools; OpenMP; XML file; benchmark programs; micro operation; microprograms; optimization process; parallel programs; parallel world; peak performance application; performance measurement; program generation; single core execution; software performance; sophisticated hardware; strided memory; target architecture; vectorized programs; Assembly; Benchmark testing; Computer architecture; Hardware; Kernel; Optimization; Registers; Micro-benchmarks; Performance tools; Program generation;
Conference_Titel :
Parallel Processing Workshops (ICPPW), 2012 41st International Conference on
Conference_Location :
Pittsburgh, PA
Print_ISBN :
978-1-4673-2509-7
DOI :
10.1109/ICPPW.2012.60