Concept-aware ranking: Teaching an old graph new moves

Colin DeLong, Sandeep Mane, Jaideep Srivastava

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

In ranking algorithms for web graphs, such as PageRank and HITS, the lack of attention to concepts/topics representing web page content causes problems such as topic drift and mutually reinforcing relationships between hosts. This paper proposes a novel approach to expand the Web graph to incorporate conceptual information encoded by links (anchor text) between web pages. Using web graph link structure and conceptual information associated with each web page (automatically extracted from anchor text of inlinks), a new graph is defined where each node represents a unique pair of a web page and concept associated with that web page, and an edge represents an explicit or implicit link between two such nodes. This graph captures inter-concept relationships, which is then utilized by ranking algorithms. Our experimental results show that such an approach improves accuracy (e.g., first X precision) by retrieving links which are more authoritative given a user's context.

Original languageEnglish (US)
Title of host publicationProceedings - ICDM Workshops 2006 - 6th IEEE International Conference on Data Mining - Workshops
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages80-85
Number of pages6
ISBN (Print)0769527027, 9780769527024
DOIs
StatePublished - 2006

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Keywords

  • Concept-aware ranking
  • Concept-page graph
  • Concepts
  • Implicit links
  • Topic drift

Fingerprint

Dive into the research topics of 'Concept-aware ranking: Teaching an old graph new moves'. Together they form a unique fingerprint.

Cite this