• DocumentCode
    1614866
  • Title

    Impact of high performance sockets on data intensive applications

  • Author

    Balaji, Pavan ; Wu, Jiesheng ; Kurc, Tahsin ; Catalyurek, Umit ; Panda, Dhabaleswar K. ; Saltz, Joel

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
  • fYear
    2003
  • Firstpage
    24
  • Lastpage
    33
  • Abstract
    The challenging issues in supporting data intensive applications on clusters include efficient movement of large volumes of data between processor memories and efficient coordination of data movement and processing by a runtime support to achieve high performance. Such applications have several requirements such as guarantees in performance, scalability with these guarantees and adaptability to heterogeneous environments. With the advent of user-level protocols like the Virtual Interface Architecture (VIA) and the modern InfiniBand Architecture, the latency and bandwidth experienced by applications has approached to that of the physical network on clusters. In order to enable applications written on top of TCP/IP to take advantage of the high performance of these user-level protocols, researchers have come up with a number of techniques including User Level Sockets Layers over high performance protocols. In this paper, we study the performance and limitations of such substrate, referred to here as SocketVIA, using a component framework designed to provide runtime support for data intensive applications. The experimental results show that by reorganizing certain components of an application (in our case, the partitioning of a dataset into smaller data chunks), we can make significant improvements in application performance. This leads to a higher scalability of applications with performance guarantees. It also allows fine grained load balancing, hence making applications more adaptable to heterogeneity in resource availability. The experimental results also show that the different performance characteristics of SocketVIA allow a more efficient partitioning of data at the source nodes, thus improving the performance of the application up to an order of magnitude in some cases.
  • Keywords
    packet switching; processor scheduling; resource allocation; transport protocols; IP; InfiniBand Architecture; PC clusters; SocketVIA; TCP; User Level Sockets Layers; VIA; Virtual Interface Architecture; application performance; data intensive applications; data intensive computing; data movement; data processing; dataset partitioning; high-performance networking; load balancing; performance guarantees; resource availability; user-level protocols; Application software; Availability; Biomedical computing; Computer networks; Pipeline processing; Protocols; Runtime; Scalability; Sockets; Subcontracting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on
  • ISSN
    1082-8907
  • Print_ISBN
    0-7695-1965-2
  • Type

    conf

  • DOI
    10.1109/HPDC.2003.1210013
  • Filename
    1210013