A continuum limit for the PageRank algorithm

A. Yuan, J. Calder, B. Osting

Research output: Contribution to journalArticlepeer-review

Abstract

Semi-supervised and unsupervised machine learning methods often rely on graphs to model data, prompting research on how theoretical properties of operators on graphs are leveraged in learning problems. While most of the existing literature focuses on undirected graphs, directed graphs are very important in practice, giving models for physical, biological or transportation networks, among many other applications. In this paper, we propose a new framework for rigorously studying continuum limits of learning algorithms on directed graphs. We use the new framework to study the PageRank algorithm and show how it can be interpreted as a numerical scheme on a directed graph involving a type of normalised graph Laplacian. We show that the corresponding continuum limit problem, which is taken as the number of webpages grows to infinity, is a second-order, possibly degenerate, elliptic equation that contains reaction, diffusion and advection terms. We prove that the numerical scheme is consistent and stable and compute explicit rates of convergence of the discrete solution to the solution of the continuum limit partial differential equation. We give applications to proving stability and asymptotic regularity of the PageRank vector. Finally, we illustrate our results with numerical experiments and explore an application to data depth.

Original languageEnglish (US)
Article number2100009
JournalEuropean Journal of Applied Mathematics
DOIs
StateAccepted/In press - 2021

Bibliographical note

Publisher Copyright:
© The Author(s), 2021. Published by Cambridge University Press.

Keywords

  • Partial differential equations on graphs and networks
  • second-order elliptic equations
  • viscosity solutions

Fingerprint

Dive into the research topics of 'A continuum limit for the PageRank algorithm'. Together they form a unique fingerprint.

Cite this