Title :
Diagnostics for causes of packet loss in a high performance data transfer system
Author :
Dickens, Phillip M. ; Larson, Jay W. ; Nicol, David M.
Abstract :
Summary form only given. As computational grids become an increasingly dominant force in the high-performance computing arena, the problem of efficiently transferring very large data sets, across geographically distributed computing resources, becomes increasingly difficult and important. Current approaches view the problem largely, if not exclusively, as a network-level problem. Thus all packet loss is interpreted and treated as a network congestion event, limiting the ability to detect or react to changes in the end-to-end system. We believe that a new approach to this problem is worth pursuing, and we are investigating techniques that can differentiate between data loss caused by contention in the network and loss caused by contention for shared CPU resources at the communication endpoints. The approach is to collect and analyze what we term packet-loss signatures that describe the patterns of packet-loss in the current transmission window. We analyze these signatures using Fourier analysis and symbolic dynamics, and present a simple set of experiments demonstrating the effectiveness of this approach.
Keywords :
Fourier analysis; electronic data interchange; grid computing; packet switching; telecommunication congestion control; Fourier analysis; computational grid; data transfer system; end-to-end system; geographically distributed computing resources; high-performance computing arena; network congestion event; network-level problem; packet-loss signatures; symbolic dynamics; very large data set; Computer networks; Computer science; Control systems; Data engineering; Distributed computing; Grid computing; Laboratories; Large-scale systems; Pattern analysis; Performance loss;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
DOI :
10.1109/IPDPS.2004.1302978