Scalability of parallel algorithms for matrix multiplication

Anshul Gupta, Vipin Kumar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

23 Scopus citations

Abstract

A number of parallel formulations of dense matrix multiplication algorithm have been developed. For arbitrarily large number of processors, any of these algorithms or their variants can provide near linear speedup for sufficiently large matrix sizes and none of the algorithms can be clearly claimed to be superior than the others. In this paper we analyze the performance and scalability of a number of parallel formulations of the matrix multiplication algorithm and predict the conditions under which each formulation is better than the others. We present a parallel formulation for hypt.rcube and related architectures thai performs better than any of the schemes described in the literature so far for a wide range of matrtx sizes and number of processors. The superior performance and the analytical scalability expressions for this algorithm are verified through experiments on the Thinking Machines Corporation's CM-5TM' parallel computer for up to 512 processors.

Original languageEnglish (US)
Title of host publicationAlgorithms and Applications
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages115-123
Number of pages9
ISBN (Electronic)0849389836
DOIs
StatePublished - Jan 1 1993
Event1993 International Conference on Parallel Processing, ICPP 1993 - Syracuse, United States
Duration: Aug 16 1993Aug 20 1993

Publication series

NameProceedings of the International Conference on Parallel Processing
Volume3
ISSN (Print)0190-3918

Conference

Conference1993 International Conference on Parallel Processing, ICPP 1993
CountryUnited States
CitySyracuse
Period8/16/938/20/93

Fingerprint Dive into the research topics of 'Scalability of parallel algorithms for matrix multiplication'. Together they form a unique fingerprint.

Cite this