TY - GEN
T1 - Scalability of parallel sorting on mesh multicomputers
AU - Singh, V.
AU - Kumar, V.
AU - Agha, G.
AU - Tomlinson, C.
PY - 1991/1/1
Y1 - 1991/1/1
N2 - This paper presents two new parallel algorithms QSPl and QSP2 based on sequential quicksort for sorting data on a mesh multicomputer, and analyzes their scalability using the isoefficiency metric. We show that QSP2 matches the lower bound on the isoefficiency function for mesh multicomputer. The isoefficiency of QSPl is also fairly close to optimal. Lang et al. and Schnorr et al. have developed parallel sorting algorithms for the mesh architecture that have either optimal (Schnorr) or close to optimal (Lang) run-time complexity for the one-element-per-processor case. Both QSPl and QSP2 have worse performance than these algorithms for the one-element-perprocessor case. But QSPl and QSP2 have better scalability than the scaled-down variants of these algorithms (for the case in which there are more elements than processors). As a result, our new parallel formulations are better than these scaled-down variants in terms of speedup w.r.t the best sequential algorithms. We also present a different variant of Lang's sort which is asymptotically as scalable as QSP2 (for the multiple-element-per-processor case). We briefly discuss another metric called "resource consumption metric". According to this metric, both QSPl and QSP2 are strictly superior to Lang's sort and its variations.
AB - This paper presents two new parallel algorithms QSPl and QSP2 based on sequential quicksort for sorting data on a mesh multicomputer, and analyzes their scalability using the isoefficiency metric. We show that QSP2 matches the lower bound on the isoefficiency function for mesh multicomputer. The isoefficiency of QSPl is also fairly close to optimal. Lang et al. and Schnorr et al. have developed parallel sorting algorithms for the mesh architecture that have either optimal (Schnorr) or close to optimal (Lang) run-time complexity for the one-element-per-processor case. Both QSPl and QSP2 have worse performance than these algorithms for the one-element-perprocessor case. But QSPl and QSP2 have better scalability than the scaled-down variants of these algorithms (for the case in which there are more elements than processors). As a result, our new parallel formulations are better than these scaled-down variants in terms of speedup w.r.t the best sequential algorithms. We also present a different variant of Lang's sort which is asymptotically as scalable as QSP2 (for the multiple-element-per-processor case). We briefly discuss another metric called "resource consumption metric". According to this metric, both QSPl and QSP2 are strictly superior to Lang's sort and its variations.
UR - http://www.scopus.com/inward/record.url?scp=85054920264&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054920264&partnerID=8YFLogxK
U2 - 10.1109/IPPS.1991.153762
DO - 10.1109/IPPS.1991.153762
M3 - Conference contribution
AN - SCOPUS:85054920264
T3 - Proceedings - 5th International Parallel Processing Symposium, IPPS 1991
SP - 92
EP - 101
BT - Proceedings - 5th International Parallel Processing Symposium, IPPS 1991
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Parallel Processing Symposium, IPPS 1991
Y2 - 30 April 1991 through 2 May 1991
ER -