TY - JOUR
T1 - Analyzing spatiotemporal trends in social media data via smoothing spline analysis of variance
AU - Helwig, Nathaniel E.
AU - Gao, Yizhao
AU - Wang, Shaowen
AU - Ma, Ping
PY - 2015/11/1
Y1 - 2015/11/1
N2 - Social media have become an integral part of life for many individuals, and social media websites generate incredible amounts of data on a variety of societal topics. Furthermore, some social media posts contain geolocation information, so social media data can be viewed as a spatiotemporal phenomenon. To understand spatiotemporal trends in ultra-large sample social media data, we propose a novel application of the Smoothing Spline Analysis of Variance (SSANOVA) framework, which is a nonparametric approach capable of discovering latent functional relationships in noisy data. Unlike currently available approaches, our proposed SSANOVA framework (a) makes few assumptions about the nature of the spatiotemporal trend, (b) provides a mean of assessing the uncertainty of the estimated spatiotemporal trend, and (c) is scalable to analyze massive samples of social media data. To demonstrate the potential of our approach, we model the daily spatiotemporal Twitter trend in the United States. Our results reveal that the proposed SSANOVA approach can provide accurate and informative estimates of spatiotemporal social media trends, as well as useful information about the precision of the estimated spatiotemporal trends.
AB - Social media have become an integral part of life for many individuals, and social media websites generate incredible amounts of data on a variety of societal topics. Furthermore, some social media posts contain geolocation information, so social media data can be viewed as a spatiotemporal phenomenon. To understand spatiotemporal trends in ultra-large sample social media data, we propose a novel application of the Smoothing Spline Analysis of Variance (SSANOVA) framework, which is a nonparametric approach capable of discovering latent functional relationships in noisy data. Unlike currently available approaches, our proposed SSANOVA framework (a) makes few assumptions about the nature of the spatiotemporal trend, (b) provides a mean of assessing the uncertainty of the estimated spatiotemporal trend, and (c) is scalable to analyze massive samples of social media data. To demonstrate the potential of our approach, we model the daily spatiotemporal Twitter trend in the United States. Our results reveal that the proposed SSANOVA approach can provide accurate and informative estimates of spatiotemporal social media trends, as well as useful information about the precision of the estimated spatiotemporal trends.
KW - Smoothing spline
KW - Social media
KW - Spatial smoothing
KW - Spatiotemporal smoothing
UR - http://www.scopus.com/inward/record.url?scp=84948960670&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84948960670&partnerID=8YFLogxK
U2 - 10.1016/j.spasta.2015.09.002
DO - 10.1016/j.spasta.2015.09.002
M3 - Article
AN - SCOPUS:84948960670
VL - 14
SP - 491
EP - 504
JO - Spatial Statistics
JF - Spatial Statistics
SN - 2211-6753
ER -