A new method of calculating squared euclidean distance (SED) using PTreE technology and its performance analysis

Mohammad K. Hossain, Sameer Abufardeh

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

One of the advantages of Euclidean distance is that it measures the regular distance between two points in space. For this reason, it is widely used in the applications where the distance between data points are needed to be calculated to measure similarities. However, this method is costly as there involve expensive square and square root operations. One useful observation is that in many data mining applications absolute distance measures are not necessary as long as the distances are used to compare the closeness between various data points. For example, in classification and clustering, we often measure the distances of multiple data points to compare their distances from known classes or from centroids to assign those points in a class or in a cluster. In this regards, an alternative approach known as Squared Euclidean Distance (SED) can be used to avoid the computation of square root to get the squared distance between the data points. SED has been used in classification, clustering, image processing, and other areas to save the computational expenses. In this paper, we show how SED can be calculated for the vertical data represented in pTrees. We also analyze its performance and compared it with traditional horizontal data representation.

Original languageEnglish (US)
Title of host publicationProceedings of 34th International Conference on Computers and Their Applications, CATA 2019
EditorsGordon Lee, Ying Jin
PublisherThe International Society for Computers and Their Applications (ISCA)
Pages45-54
Number of pages10
ISBN (Electronic)9781510885967
StatePublished - Mar 13 2019
Externally publishedYes
Event34th International Conference on Computers and Their Applications, CATA 2019 - Honolulu, United States
Duration: Mar 18 2019Mar 20 2019

Publication series

NameProceedings of 34th International Conference on Computers and Their Applications, CATA 2019

Conference

Conference34th International Conference on Computers and Their Applications, CATA 2019
CountryUnited States
CityHonolulu
Period3/18/193/20/19

Fingerprint Dive into the research topics of 'A new method of calculating squared euclidean distance (SED) using PTreE technology and its performance analysis'. Together they form a unique fingerprint.

Cite this