A framework for exploiting task and data parallelism on distributed memory multicomputers

Shankar Ramaswamy, Sachin Sapatnekar, Prithviraj Banerjee

Research output: Contribution to journalArticlepeer-review

84 Scopus citations

Abstract

Distributed Memory MultiComputers (DMMs), such as the IBM SP-2, the Intel Paragon, and the Thinking Machines CM-5, offer significant advantages over shared memory multiprocessors in terms of cost and scalability. Unfortunately, the utilization of all the available computational power in these machines involves a tremendous programming effort on the part of users, which creates a need for sophisticated compiler and run-time support for distributed memory machines. In this paper, we explore a new compiler optimization for regular scientific applications - the simultaneous exploitation of task and data parallelism. Our optimization is implemented as part of the PARADIGM HPF compiler framework we have developed. The intuitive idea behind the optimization is the use of task parallelism to control the degree of data parallelism of individual tasks. The reason this provides increased performance is that data parallelism provides diminishing returns as the number of processors used is increased. By controlling the number of processors used for each data parallel task in an application and by concurrently executing these tasks, we make program execution more efficient and, therefore, faster. A practical implementation of a task and data parallel scheme of execution for an application on a distributed memory multicomputer also involves data redistribution. This data redistribution causes an overhead. However, as our experimental results show, this overhead is not a problem; execution of a program using task and data parallelism together can be significantly faster than its execution using data parallelism alone. This makes our proposed optimization practical and extremely useful.

Original languageEnglish (US)
Pages (from-to)1098-1116
Number of pages19
JournalIEEE Transactions on Parallel and Distributed Systems
Volume8
Issue number11
DOIs
StatePublished - 1997

Bibliographical note

Funding Information:
This research was supported in part by an IBM Graduate Fellowship and in part by the National Aeronautics and Space Administration under contract NASA NAG 1-613.

Keywords

  • Allocation
  • Convex programming
  • Data parallel
  • Distributed memory
  • HPF
  • Scheduling
  • Task parallel

Fingerprint

Dive into the research topics of 'A framework for exploiting task and data parallelism on distributed memory multicomputers'. Together they form a unique fingerprint.

Cite this