TY - GEN
T1 - On suitability of Euclidean embedding of internet hosts
AU - Lee, Sanghwan
AU - Zhang, Zhi Li
AU - Sahu, Sambit
AU - Saha, Debanjan
PY - 2006/6
Y1 - 2006/6
N2 - In this paper, we investigate the suitability of embedding Internet hosts into a Euclidean space given their pairwise distances (as measured by round-trip time). Using the classical scaling and matrix perturbation theories, we first, establish the (sum of the) magnitude of negative eigenvalues of the (doubly-centered, squared) distance matrix as a measure of suitability of Euclidean embedding. We then show that the distance matrix among Internet hosts contains negative eigenvalues of large magnitude, implying that embedding the Internet hosts in a Euclidean space would incur relatively large errors. Motivated by earlier studies, we demonstrate that the inaccuracy of Euclidean embedding is caused by a large degree of triangle inequality violation (TIV) in the Internet distances, which leads to negative eigenvalues of large magnitude. Moreover, we show that the TIVs are likely to occur locally, hence, the distances among these close-by hosts cannot be estimated accurately using a global Euclidean embedding, in addition, increasing the dimension of embedding does not reduce the embedding errors. Bused on these insights, we propose a new hybrid model for embedding the network nodes using only a 2-dimensional Euclidean coordinate system and small error adjustment terms. We show that the accuracy of the proposed embedding technique is as good as, if not better, than that of a 7-dimensional Euclidean embedding.
AB - In this paper, we investigate the suitability of embedding Internet hosts into a Euclidean space given their pairwise distances (as measured by round-trip time). Using the classical scaling and matrix perturbation theories, we first, establish the (sum of the) magnitude of negative eigenvalues of the (doubly-centered, squared) distance matrix as a measure of suitability of Euclidean embedding. We then show that the distance matrix among Internet hosts contains negative eigenvalues of large magnitude, implying that embedding the Internet hosts in a Euclidean space would incur relatively large errors. Motivated by earlier studies, we demonstrate that the inaccuracy of Euclidean embedding is caused by a large degree of triangle inequality violation (TIV) in the Internet distances, which leads to negative eigenvalues of large magnitude. Moreover, we show that the TIVs are likely to occur locally, hence, the distances among these close-by hosts cannot be estimated accurately using a global Euclidean embedding, in addition, increasing the dimension of embedding does not reduce the embedding errors. Bused on these insights, we propose a new hybrid model for embedding the network nodes using only a 2-dimensional Euclidean coordinate system and small error adjustment terms. We show that the accuracy of the proposed embedding technique is as good as, if not better, than that of a 7-dimensional Euclidean embedding.
KW - Euclidean embedding
KW - Suitability
KW - Triangle inequality
UR - http://www.scopus.com/inward/record.url?scp=33750348103&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33750348103&partnerID=8YFLogxK
U2 - 10.1145/1140103.1140296
DO - 10.1145/1140103.1140296
M3 - Conference contribution
AN - SCOPUS:33750348103
SN - 1595933204
SN - 9781595933201
T3 - Performance Evaluation Review
SP - 157
EP - 168
BT - SIGMETRICS 2006/Performance 2006 - Joint International Conference on Measurement and Modeling of Computer Systems, Proceedings
T2 - SIGMETRICS 2006/Performance 2006 - Joint International Conference on Measurement and Modeling of Computer Systems
Y2 - 26 June 2006 through 30 June 2006
ER -