Title :
PaScal - a new parallel and scalable server IO networking infrastructure for supporting global storage/file systems in large-size Linux clusters
Author :
Grider, Gary ; Chen, Hsing-bung ; Nunez, James ; Poole, Steve ; Wacha, Rosie ; Fields, Parks ; Martinez, Robert ; Martinez, Paul ; Khalsa, Satsangat ; Matthews, Abbie ; Gibson, Garth
Author_Institution :
Los Alamos Nat. Lab., NM
Abstract :
This paper presents the design and implementation of a new I/O networking infrastructure, named PaScal (parallel and scalable I/O networking framework). PaScal is used to support high data bandwidth IP based global storage systems for large scale Linux clusters. PaScal has several unique properties. It employs (1) Multi-level switch-fabric interconnection network by combining high speed interconnects for computing inter-process communication (IPC) requirements and low-cost Gigabit Ethernet interconnect for global IP based storage/file access, (2) A bandwidth on demand scaling I/O networking architecture, (3) open-standard IP networks (routing and switching), (4) multipath routing for load balancing and failover, (5) open shortest path first (OSPF) routing software, and (6) Supporting a global file system in multi-cluster and multi-platform environments. We describe both the hardware and software components of our proposed PaScal. We have implemented the PaScal I/O infrastructure on several large-size Linux clusters at LANL. We have conducted a sequence of parallel MPI-IO assessment benchmarks on LANL´s Pink 1024 node Linux cluster and the Panasas global parallel file system. Performance results from our parallel MPI-IO benchmarks on the Pink cluster demonstrate that the PaScal I/O Infrastructure is robust and capable of scaling in bandwidth on large-size Linux clusters
Keywords :
IP networks; Linux; file servers; resource allocation; routing protocols; workstation clusters; IO networking infrastructure; IP based global storage system; IPC computing; Linux cluster; PaScal; gigabit Ethernet; global storage-file system; inter-process communication; load balancing; multilevel switch-fabric interconnection network; multipath routing; open-standard IP network; parallel and scalable server; Bandwidth; Communication switching; Computer networks; File servers; File systems; Large-scale systems; Linux; Multiprocessor interconnection networks; Network servers; Routing;
Conference_Titel :
Performance, Computing, and Communications Conference, 2006. IPCCC 2006. 25th IEEE International
Conference_Location :
Phoenix, AZ
Print_ISBN :
1-4244-0198-4
DOI :
10.1109/.2006.1629424