TY - JOUR
T1 - Does `authority' mean quality? Predicting expert quality ratings of Web documents
AU - Amento, Brian
AU - Terveen, Loren
AU - Hill, Will
PY - 2000
Y1 - 2000
N2 - For many topics the World Wide Web contains hundreds or thousands of relevant documents of widely varying quality. Users face a daunting challenge in identifying a small subset of documents worthy of their attention. Link analysis algorithms have received much interest recently, in large part for their potential to identify high quality items. We report here on an experimental evaluation of this potential. We evaluated a number of link and content-based algorithms using a dataset of web documents rated for quality by human topic experts. Link-based metrics did a good job of picking out high-quality items. Precision at 5 is about 0.75, and precision at 10 is about 0.55; this is in a dataset where 0.32 of all documents were of high quality. Surprisingly, a simple content-based metric performed nearly as well; ranking documents by the total number of pages on their containing site.
AB - For many topics the World Wide Web contains hundreds or thousands of relevant documents of widely varying quality. Users face a daunting challenge in identifying a small subset of documents worthy of their attention. Link analysis algorithms have received much interest recently, in large part for their potential to identify high quality items. We report here on an experimental evaluation of this potential. We evaluated a number of link and content-based algorithms using a dataset of web documents rated for quality by human topic experts. Link-based metrics did a good job of picking out high-quality items. Precision at 5 is about 0.75, and precision at 10 is about 0.55; this is in a dataset where 0.32 of all documents were of high quality. Surprisingly, a simple content-based metric performed nearly as well; ranking documents by the total number of pages on their containing site.
UR - http://www.scopus.com/inward/record.url?scp=0033661294&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0033661294&partnerID=8YFLogxK
U2 - 10.1145/345508.345603
DO - 10.1145/345508.345603
M3 - Conference article
AN - SCOPUS:0033661294
SN - 0022-1120
SP - 296
EP - 303
JO - Journal of Fluid Mechanics
JF - Journal of Fluid Mechanics
T2 - Proceedings of the 23rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2000)
Y2 - 24 July 2000 through 28 July 2000
ER -