The performance potential of fine-grain and coarse-grain parallel architectures

Research output: Contribution to journalConference article

4 Scopus citations


Recent work has shown that pipelining and multiple instruction issuing are architecturally equivalent in their abilities to exploit parallelism, but there has been little work directly comparing the performance of these fine-grain parallel architectures with that of the coarse-grain multiprocessors. Using trace-driven simulations, the authors compare the performance of a superscalar processor and a pipelined processor using dynamic dependence checking with that of a shared memory multiprocessor. For very parallel programs, they find that the fine-grain processors must bypass an unrealistically large number of branches to match the performance of the multiprocessor. When executing programs with a wide range of potential parallelism, the best performance is obtained using a multiprocessor where each individual processor has a fine-grain parallelism of two to four.

Original languageEnglish (US)
Article number183902
Pages (from-to)324-333
Number of pages10
JournalProceedings of the Annual Hawaii International Conference on System Sciences
StatePublished - Jan 1 1991
Externally publishedYes
Event24th Annual Hawaii International Conference on System Sciences, HICSS 1991 - Kauai, United States
Duration: Jan 8 1991Jan 11 1991


Cite this