Title :
A practical external sort for shared disk MPPs
Author :
Li, Xiqing ; Linoff, Gordon ; Smith, Stephen J. ; Stanfill, Craig ; Thearling, Kurt
Author_Institution :
Thinking Machines Corp., Cambridge, MA, USA
Abstract :
An external sort has been implemented and analyzed for a shared disk MPP computer system. In this implementation, we have considered many real world constraints. Decision support functionality in database systems, for instance, often requires that external sorting be done in place on disk, support variable length records, and be restartable from any point of interruption with no loss of data. These three constraints, along with the more standard requirements of speed and stability, affect the choice and implementation of the external sorting algorithm. The implementation of the sample sort algorithm described here meets these requirements. Although written using high level file processing directives, the implementation sorts a 10 GB file in 1.5 h on a 64 processor Connection Machine CM-5 with a DataVault disk system.
Keywords :
parallel programming; shared memory systems; sorting; 1.5 h; Connection Machine CM-5; DataVault disk system; computer speed; computer stability; database systems; decision support functionality; external sorting algorithm; high level file processing directives; interruption; massively parallel processing; practical external sort; sample sort algorithm; shared disk MPP computer system; variable length records; Algorithm design and analysis; Computer aided instruction; Concurrent computing; Data mining; Database systems; Libraries; Message passing; Sorting; Spatial databases; Stability;
Conference_Titel :
Supercomputing '93. Proceedings
Print_ISBN :
0-8186-4340-4
DOI :
10.1109/SUPERC.1993.1263521