Abstract
Graph partitioning is important in distributing workloads on parallel compute systems, computing sparse matrix re-orderings, and designing VLSI circuits. Refinement algorithms are used to improve existing partitionings, and are essential for obtaining high-quality partitionings. Existing parallel refinement algorithms either extract concurrency by sacrificing in terms of quality, or preserve quality by restricting concurrency. In this work we present a new shared-memory parallel algorithm for refining an existing k-way partitioning that can break out of local minima and produce high-quality partitionings. This allows our algorithm to scale well in terms of the number of processing cores and produce clusterings of quality equal to serial algorithms. Our algorithm achieves speedups of 5.7-16.7&-using 24 cores, while exhibiting only 0.52% higher edgecuts than when run serially. This is 6.3x faster and 1.9% better quality than other parallel refinement algorithms which can break out of local minima.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 45th International Conference on Parallel Processing, ICPP 2016 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 236-241 |
Number of pages | 6 |
ISBN (Electronic) | 9781509028238 |
DOIs | |
State | Published - Sep 21 2016 |
Event | 45th International Conference on Parallel Processing, ICPP 2016 - Philadelphia, United States Duration: Aug 16 2016 → Aug 19 2016 |
Publication series
Name | Proceedings of the International Conference on Parallel Processing |
---|---|
Volume | 2016-September |
ISSN (Print) | 0190-3918 |
Other
Other | 45th International Conference on Parallel Processing, ICPP 2016 |
---|---|
Country/Territory | United States |
City | Philadelphia |
Period | 8/16/16 → 8/19/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- Graph partitioning
- Local minima
- Multithreading