Abstract
Link prediction in networks is typically accomplished by estimating or ranking the probabilities of edges for all pairs of nodes. In practice, especially for social networks, the data are often collected by egocentric sampling, which means selecting a subset of nodes and recording all of their edges. This sampling mechanism requires different prediction tools than the typical assumption of links missing at random. We propose a new computationally efficient link prediction algorithm for egocentrically sampled networks, estimating the underlying probability matrix by estimating its row space. We empirically evaluate the method on several synthetic and real-world networks and show that it provides accurate predictions for network links. Supplemental materials including the code for experiments are available online.
Original language | English (US) |
---|---|
Pages (from-to) | 1296-1319 |
Number of pages | 24 |
Journal | Journal of Computational and Graphical Statistics |
Volume | 32 |
Issue number | 4 |
DOIs | |
State | Published - 2023 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2023 American Statistical Association and Institute of Mathematical Statistics.
Keywords
- Binary data analysis
- Machine learning
- Network modeling