Title : 
Performance Analysis of a Family of WHT Algorithms
         
        
            Author : 
Andrews, Michael ; Johnson, Jeremy
         
        
            Author_Institution : 
Dept. of Comput. Sci., Drexel Univ., Philadelphia, PA
         
        
        
        
        
        
            Abstract : 
This paper explores the correlation of instruction counts and cache misses to runtime performance for a large family of divide and conquer algorithms to compute the Walsh-Hadamard transform (WHT). Previous work showed how to compute instruction counts and cache misses from a high-level description of the algorithm and proved theoretical results about their minimum, maximum, mean, and distribution. While the models themselves do not accurately predict performance, it is shown that they are statistically correlated to performance and thus can be used to prune the search space for fast implementations. When the size of the transform fits in cache the instruction count itself is used; however, when the transform no longer fits in cache, a linear combination of instruction counts and cache misses is used. Thus for small transforms it is safe to ignore algorithms which have a high instruction count and for large transforms it is safe to ignore algorithms with a high value in the combined instruction count/cache miss model. Since the models can be computed from a high-level description of the algorithms, they can be obtained without runtime measurement and the previous theoretical results on the models can be applied to limit empirical search.
         
        
            Keywords : 
Hadamard transforms; cache storage; divide and conquer methods; mathematics computing; Walsh-Hadamard transform algorithm; cache miss model; divide and conquer algorithm; instruction count model; performance analysis; Computer aided instruction; Computer science; Distributed computing; Fast Fourier transforms; High performance computing; Machine learning algorithms; Packaging; Performance analysis; Predictive models; Runtime;
         
        
        
        
            Conference_Titel : 
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
         
        
            Conference_Location : 
Long Beach, CA
         
        
            Print_ISBN : 
1-4244-0910-1
         
        
            Electronic_ISBN : 
1-4244-0910-1
         
        
        
            DOI : 
10.1109/IPDPS.2007.370640