DocumentCode :
2772134
Title :
Large Scale Kronecker Product on Supercomputers
Author :
Tadonki, Claude
Author_Institution :
MINES ParisTech - CRI (Centre de Rech. en Inf.) - Math. et Syst., Fontainebleau, France
fYear :
2011
fDate :
26-27 Oct. 2011
Firstpage :
1
Lastpage :
4
Abstract :
The Kronecker product, also called tensor product, is a fundamental matrix algebra operation, which is widely used as a natural formalism to express a convolution of many interactions or representations. Given a set of matrices, we need to multiply their Kronecker product by a vector. This operation is a critical kernel for iterative algorithms, thus needs to be computed efficiently. In a previous work, we have proposed a cost optimal parallel algorithm for the problem, both in terms of floating point computation time and interprocessor communication steps. However, the lower bound of data transfers can only be achieved if we really consider (local) logarithmic broadcasts. In practice, we consider a communication loop instead. Thus, it becomes important to care about the real cost of each broadcast. As this local broadcast is performed simultaneously by each processor, the situation is getting worse on a large number of processors (supercomputers). We address the problem in this paper in two points. In one hand, we propose a way to build a virtual topology which has the lowest gap to the theoretical lower bound. In the other hand, we consider a hybrid implementation, which has the advantage of reducing the number of communicating nodes. We illustrate our work with some benchmarks on a large SMP 8-Core supercomputer.
Keywords :
iterative methods; mathematics computing; matrix algebra; parallel machines; tensors; vectors; Kronecker product; SMP 8-Core supercomputer; fundamental matrix algebra operation; iterative algorithm; natural formalism; tensor product; vector; virtual topology; Benchmark testing; Matrices; Parallel algorithms; Quantum computing; Supercomputers; Topology; Vectors; MPI; kronecker; openMP; parallel; tensor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Architecture and Multi-Core Applications (WAMCA), 2011 Second Workshop on
Conference_Location :
Vitoria, Espirito Santo
Print_ISBN :
978-1-4673-0221-0
Type :
conf
DOI :
10.1109/WAMCA.2011.10
Filename :
6112992
Link To Document :
بازگشت