• DocumentCode
    2049172
  • Title

    An overview of the ECO project

  • Author

    Chame, Jacqueline ; Chen, Chun ; Diniz, Pedro ; Hall, Mary ; Lee, Yoon-Ju ; Lucas, Robert F.

  • Author_Institution
    Inf. Sci. Inst., Southern California Univ., Marina del Rey, CA
  • fYear
    2006
  • fDate
    25-29 April 2006
  • Abstract
    In this paper, we describe a compilation system that automates much of the process of performance tuning that is currently done manually by application programmers interested in high performance. Our approach combines compiler models and heuristics with guided empirical search to take advantage of their complementary strengths. The models and heuristics limit the search to a small number of candidate implementations, and the empirical results provide the most accurate information to the compiler to select among candidates and tune optimization parameter values. The overall approach can be employed to alleviate some of the performance problems that lead to inefficiencies in key applications today: register pressure, cache conflict misses, and the trade-off between synchronization, parallelism and locality in SMPs. The main focus of the paper is an algorithm for simultaneously optimizing across multiple levels of the memory hierarchy for dense-matrix computations. We have developed an initial compiler implementation, and present automatically-generated results on matrix multiply. Results on two architectures, SGI R10000 and Sun UltraSparc IIe, outperform the native compiler, and either outperform or achieve comparable performance as the ATLAS self-tuning library and the hand-tuned vendor BLAS library. This paper describes other components of the ECO system, including supporting tools and experiments with programmer-guided performance tuning. This approach has provided a foundation for a general framework for systematic optimization of domain-specific applications. Specifically, we are developing an optimization system for signal and image processing that exploits signal properties, and we are using machine learning and a knowledge-rich representation can be exploited to optimize molecular dynamics simulation
  • Keywords
    image processing; matrix multiplication; optimising compilers; ECO project; SMP; compilation system; compiler implementation; dense-matrix computations; domain-specific applacations; empirical search; heuristics; image processing; matrix multiplication; optimization parameter; performance tuning; signal processing; systematic optimization; Computer architecture; Focusing; Libraries; Optimizing compilers; Parallel processing; Programming profession; Registers; Signal processing; Sun; Tuning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
  • Conference_Location
    Rhodes Island
  • Print_ISBN
    1-4244-0054-6
  • Type

    conf

  • DOI
    10.1109/IPDPS.2006.1639571
  • Filename
    1639571