TY - GEN

T1 - Generalizing the notion of support

AU - Steinbach, Michael S

AU - Tan, Pang Ning

AU - Xiong, Hui

AU - Kumar, Vipin

PY - 2004/1/1

Y1 - 2004/1/1

N2 - The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data. To that end, we describe a framework for generalizing support that is based on the simple, but useful observation that support can be viewed as the composition of two functions: a function that evaluates the strength or presence of a pattern in each object (transaction) and a function that summarizes these evaluations with a single number. A key goal of any framework is to allow people to more easily express, explore, and communicate ideas, and hence, we illustrate how our support framework can be used to describe support for a variety of commonly used association patterns, such as frequent itemsets, general Boolean patterns, and error-tolerant itemsets. We also present two examples of the practical usefulness of generalized support. One example shows the usefulness of support functions for continuous data. Another example shows how the hyperclique pattern - an association pattern originally defined for binary data - can be extended to continuous data by generalizing a support function.

AB - The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data. To that end, we describe a framework for generalizing support that is based on the simple, but useful observation that support can be viewed as the composition of two functions: a function that evaluates the strength or presence of a pattern in each object (transaction) and a function that summarizes these evaluations with a single number. A key goal of any framework is to allow people to more easily express, explore, and communicate ideas, and hence, we illustrate how our support framework can be used to describe support for a variety of commonly used association patterns, such as frequent itemsets, general Boolean patterns, and error-tolerant itemsets. We also present two examples of the practical usefulness of generalized support. One example shows the usefulness of support functions for continuous data. Another example shows how the hyperclique pattern - an association pattern originally defined for binary data - can be extended to continuous data by generalizing a support function.

KW - Association analysis

KW - Hyperclique

KW - Support

UR - http://www.scopus.com/inward/record.url?scp=12244305832&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=12244305832&partnerID=8YFLogxK

U2 - 10.1145/1014052.1014141

DO - 10.1145/1014052.1014141

M3 - Conference contribution

AN - SCOPUS:12244305832

SN - 1581138881

SN - 9781581138887

T3 - KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

SP - 689

EP - 694

BT - KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

PB - Association for Computing Machinery

T2 - KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Y2 - 22 August 2004 through 25 August 2004

ER -