Unbiased Sampling of Users from (Online) Activity Data

Z.W. Almquist, S. Arya, L. Zeng, E. Spiro

Research output: Contribution to journalArticlepeer-review

1 Scopus citations


Online platforms offer new opportunities to study human behavior. However, while social scientists are often interested in using behavioral trace data—data created by a user over the course of their everyday life—to draw inferences about users, many online platforms only allow data to be sampled based on user activities (leading to data sets that are biased toward highly active users). Here, we introduce a simple method for reweighting activity-based sample statistics in order to provide descriptive (and potentially model-based) estimates of the user population. We illustrate these techniques by applying them to a case study of an online fitness community (Strava) and use it to explore basic network properties. Last, we explore the weights effect on model-based estimates for count data. © The Author(s) 2018.
Original languageEnglish
JournalField Methods
StatePublished - 2018

Bibliographical note

Publisher Copyright:
© The Author(s) 2018.


Dive into the research topics of 'Unbiased Sampling of Users from (Online) Activity Data'. Together they form a unique fingerprint.

Cite this