Unbiased Sampling of Users from (Online) Activity Data

Z.W. Almquist, S. Arya, L. Zeng, E. Spiro

Research output: Contribution to journalArticlepeer-review

Abstract

Online platforms offer new opportunities to study human behavior. However, while social scientists are often interested in using behavioral trace data—data created by a user over the course of their everyday life—to draw inferences about users, many online platforms only allow data to be sampled based on user activities (leading to data sets that are biased toward highly active users). Here, we introduce a simple method for reweighting activity-based sample statistics in order to provide descriptive (and potentially model-based) estimates of the user population. We illustrate these techniques by applying them to a case study of an online fitness community (Strava) and use it to explore basic network properties. Last, we explore the weights effect on model-based estimates for count data. © The Author(s) 2018.
Original languageEnglish
JournalField Methods
DOIs
StatePublished - 2018

Bibliographical note

Export Date: 26 December 2018

Article in Press

Correspondence Address: Almquist, Z.W.; Departments of Sociology, School of Statistics, Minnesota Population Center, University of MinnesotaUnited States; email: almquist@umn.edu

Fingerprint Dive into the research topics of 'Unbiased Sampling of Users from (Online) Activity Data'. Together they form a unique fingerprint.

Cite this