TY - JOUR
T1 - Quantifying political leaning from tweets, retweets, and retweeters
AU - Wong, Felix Ming Fai
AU - Tan, Chee Wei
AU - Sen, Soumya
AU - Chiang, Mung
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2016/8/1
Y1 - 2016/8/1
N2 - The widespread use of online social networks (OSNs) to disseminate information and exchange opinions, by the general public, news media, and political actors alike, has enabled new avenues of research in computational political science. In this paper, we study the problem of quantifying and inferring the political leaning of Twitter users. We formulate political leaning inference as a convex optimization problem that incorporates two ideas: (a) users are consistent in their actions of tweeting and retweeting about political issues, and (b) similar users tend to be retweeted by similar audience. We then apply our inference technique to 119 million election-related tweets collected in seven months during the 2012 U.S. presidential election campaign. On a set of frequently retweeted sources, our technique achieves 94 percent accuracy and high rank correlation as compared with manually created labels. By studying the political leaning of 1,000 frequently retweeted sources, 232,000 ordinary users who retweeted them, and the hashtags used by these sources, our quantitative study sheds light on the political demographics of the Twitter population, and the temporal dynamics of political polarization as events unfold.
AB - The widespread use of online social networks (OSNs) to disseminate information and exchange opinions, by the general public, news media, and political actors alike, has enabled new avenues of research in computational political science. In this paper, we study the problem of quantifying and inferring the political leaning of Twitter users. We formulate political leaning inference as a convex optimization problem that incorporates two ideas: (a) users are consistent in their actions of tweeting and retweeting about political issues, and (b) similar users tend to be retweeted by similar audience. We then apply our inference technique to 119 million election-related tweets collected in seven months during the 2012 U.S. presidential election campaign. On a set of frequently retweeted sources, our technique achieves 94 percent accuracy and high rank correlation as compared with manually created labels. By studying the political leaning of 1,000 frequently retweeted sources, 232,000 ordinary users who retweeted them, and the hashtags used by these sources, our quantitative study sheds light on the political demographics of the Twitter population, and the temporal dynamics of political polarization as events unfold.
KW - Twitter
KW - convex programming
KW - data analytics
KW - inference
KW - political science
KW - signal processing
UR - http://www.scopus.com/inward/record.url?scp=84978647036&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84978647036&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2016.2553667
DO - 10.1109/TKDE.2016.2553667
M3 - Article
AN - SCOPUS:84978647036
SN - 1041-4347
VL - 28
SP - 2158
EP - 2172
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 8
M1 - 7454756
ER -