TY - GEN
T1 - Generalizing the notion of support
AU - Steinbach, Michael S
AU - Tan, Pang Ning
AU - Xiong, Hui
AU - Kumar, Vipin
PY - 2004
Y1 - 2004
N2 - The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data. To that end, we describe a framework for generalizing support that is based on the simple, but useful observation that support can be viewed as the composition of two functions: a function that evaluates the strength or presence of a pattern in each object (transaction) and a function that summarizes these evaluations with a single number. A key goal of any framework is to allow people to more easily express, explore, and communicate ideas, and hence, we illustrate how our support framework can be used to describe support for a variety of commonly used association patterns, such as frequent itemsets, general Boolean patterns, and error-tolerant itemsets. We also present two examples of the practical usefulness of generalized support. One example shows the usefulness of support functions for continuous data. Another example shows how the hyperclique pattern - an association pattern originally defined for binary data - can be extended to continuous data by generalizing a support function.
AB - The goal of this paper is to show that generalizing the notion of support can be useful in extending association analysis to non-traditional types of patterns and non-binary data. To that end, we describe a framework for generalizing support that is based on the simple, but useful observation that support can be viewed as the composition of two functions: a function that evaluates the strength or presence of a pattern in each object (transaction) and a function that summarizes these evaluations with a single number. A key goal of any framework is to allow people to more easily express, explore, and communicate ideas, and hence, we illustrate how our support framework can be used to describe support for a variety of commonly used association patterns, such as frequent itemsets, general Boolean patterns, and error-tolerant itemsets. We also present two examples of the practical usefulness of generalized support. One example shows the usefulness of support functions for continuous data. Another example shows how the hyperclique pattern - an association pattern originally defined for binary data - can be extended to continuous data by generalizing a support function.
KW - Association analysis
KW - Hyperclique
KW - Support
UR - http://www.scopus.com/inward/record.url?scp=12244305832&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=12244305832&partnerID=8YFLogxK
U2 - 10.1145/1014052.1014141
DO - 10.1145/1014052.1014141
M3 - Conference contribution
AN - SCOPUS:12244305832
SN - 1581138881
SN - 9781581138887
T3 - KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 689
EP - 694
BT - KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
PB - Association for Computing Machinery
T2 - KDD-2004 - Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Y2 - 22 August 2004 through 25 August 2004
ER -