TY - GEN

T1 - Communication optimal parallel multiplication of sparse random matrices

AU - Ballard, Grey

AU - Buluç, Aydin

AU - Demmel, James

AU - Grigori, Laura

AU - Lipshitz, Benjamin

AU - Schwartz, Oded

AU - Toledo, Sivan

PY - 2013

Y1 - 2013

N2 - Parallel algorithms for sparse matrix-matrix multiplication typically spend most of their time on inter-processor communication rather than on computation, and hardware trends predict the relative cost of communication will only increase. Thus, sparse matrix multiplication algorithms must minimize communication costs in order to scale to large processor counts. In this paper, we consider multiplying sparse matrices corresponding to Erdo″s- Rényi random graphs on distributedmemory parallel machines. We prove a new lower bound on the expected communication cost for a wide class of algorithms. Our analysis of existing algorithms shows that, while some are optimal for a limited range of matrix density and number of processors, none is optimal in general. We obtain two new parallel algorithms and prove that they match the expected communication cost lower bound, and hence they are optimal.

AB - Parallel algorithms for sparse matrix-matrix multiplication typically spend most of their time on inter-processor communication rather than on computation, and hardware trends predict the relative cost of communication will only increase. Thus, sparse matrix multiplication algorithms must minimize communication costs in order to scale to large processor counts. In this paper, we consider multiplying sparse matrices corresponding to Erdo″s- Rényi random graphs on distributedmemory parallel machines. We prove a new lower bound on the expected communication cost for a wide class of algorithms. Our analysis of existing algorithms shows that, while some are optimal for a limited range of matrix density and number of processors, none is optimal in general. We obtain two new parallel algorithms and prove that they match the expected communication cost lower bound, and hence they are optimal.

KW - Communication-avoiding algorithms

KW - Communication-cost lower bounds

KW - Random graphs

KW - Sparse matrix multiplication

UR - http://www.scopus.com/inward/record.url?scp=84883515454&partnerID=8YFLogxK

U2 - 10.1145/2486159.2486196

DO - 10.1145/2486159.2486196

M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???

AN - SCOPUS:84883515454

SN - 9781450315722

T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures

SP - 222

EP - 231

BT - SPAA 2013 - Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures

PB - Association for Computing Machinery

T2 - 25th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2013

Y2 - 23 July 2013 through 25 July 2013

ER -