TY - JOUR
T1 - Classification, ranking, and top-K stability of recommendation algorithms
AU - Adomavicius, Gediminas
AU - Zhang, Jingjing
PY - 2016/12/1
Y1 - 2016/12/1
N2 - Recommendation stability measures the extent to which a recommendation algorithm provides predictions that are consistent with each other. Several approaches have been proposed in prior work to defining, measuring, and improving the stability of recommendation algorithms. Previous studies have focused primarily on understanding and evaluating recommendation stability in prediction-oriented settings, i.e., recommendation settings where it is crucial to provide the precise prediction of a user's preference rating for an item. However, the research literature has been largely silent on the topic of recommendation stability in other important types of settings, such as classification-oriented (i.e., where it is important to accurately classify the item as relevant versus irrelevant, without having to quantify the user's preference precisely), ranking-oriented (i.e., where it is important to provide accurate relative ranking of items to users), or top-K oriented (i.e., where it is important to suggest K items that are most appealing to the user). Therefore, this paper builds on prior work by generalizing the notion of stability to a broader set of recommendation settings and developing corresponding stability metrics. The paper also provides a comprehensive empirical analysis of classification, ranking, and top-K stability performance of popular recommender algorithms on real-world rating data sets under a variety of settings.
AB - Recommendation stability measures the extent to which a recommendation algorithm provides predictions that are consistent with each other. Several approaches have been proposed in prior work to defining, measuring, and improving the stability of recommendation algorithms. Previous studies have focused primarily on understanding and evaluating recommendation stability in prediction-oriented settings, i.e., recommendation settings where it is crucial to provide the precise prediction of a user's preference rating for an item. However, the research literature has been largely silent on the topic of recommendation stability in other important types of settings, such as classification-oriented (i.e., where it is important to accurately classify the item as relevant versus irrelevant, without having to quantify the user's preference precisely), ranking-oriented (i.e., where it is important to provide accurate relative ranking of items to users), or top-K oriented (i.e., where it is important to suggest K items that are most appealing to the user). Therefore, this paper builds on prior work by generalizing the notion of stability to a broader set of recommendation settings and developing corresponding stability metrics. The paper also provides a comprehensive empirical analysis of classification, ranking, and top-K stability performance of popular recommender algorithms on real-world rating data sets under a variety of settings.
KW - Classification stability
KW - Evaluation of recommender systems
KW - Ranking stability
KW - Recommender systems
KW - Top-K stability
UR - http://www.scopus.com/inward/record.url?scp=84959907635&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84959907635&partnerID=8YFLogxK
U2 - 10.1287/ijoc.2015.0662
DO - 10.1287/ijoc.2015.0662
M3 - Article
AN - SCOPUS:84959907635
SN - 1091-9856
VL - 28
SP - 129
EP - 147
JO - INFORMS Journal on Computing
JF - INFORMS Journal on Computing
IS - 1
ER -