Abstract
Connected component labeling is a key step in a wide-range of applications, such as community detection in social networks and coherent structure identification in massively-parallel scientific simulations. There have been several distributed-memory connected component algorithms described in literature; however, little has been done regarding their scalability analysis. Theoretical and experimental results are presented for five algorithms: three that are direct implementations of previous approaches, one that is an implementation of a previous approach that is optimized to reduce communication, and one that is a novel approach based on graph contraction. Under weak scaling and for certain classes of graphs, the graph contraction algorithm scales consistently better than the four other algorithms. Furthermore, it uses significantly less memory than two of the alternative methods and is of the same order in terms of memory as the other two.
Original language | English (US) |
---|---|
Pages (from-to) | 53-68 |
Number of pages | 16 |
Journal | Parallel Computing |
Volume | 44 |
DOIs | |
State | Published - May 2015 |
Bibliographical note
Publisher Copyright:© 2015 Published by Elsevier B.V.
Keywords
- Connected component
- Distributed-memory
- Scalability