TY - JOUR
T1 - Improving data quality of smartphone-based activity–travel survey
T2 - A framework for data post-processing
AU - Zhang, Yaxuan
AU - Song, Ying
AU - Fan, Yingling
N1 - Publisher Copyright:
© 2021 John Wiley & Sons Ltd
PY - 2022/2
Y1 - 2022/2
N2 - The smartphone-based activity–travel survey has emerged as an approach to collect detailed data on individuals’ activities and trips throughout the day. The collected data are usually structured as a series of consecutive activity and trip episodes. Recent studies have used these data to advance our understanding of individuals’ activity–travel patterns. However, few studies have explicitly described how to handle new quality issues of such data. This article develops a framework and methods to systematically detect and handle quality issues in the smartphone-based activity–travel survey data to ensure attribute completeness and logical consistency. For attribute completeness, we check if each episode contains all the required thematic, temporal, and spatial attributes. For logical consistency, we check if two consecutive episodes are logically inconsistent regarding spatial and temporal continuity. We classify invalid episodes into distinct groups using mixture models, clustering, and a transition matrix. For each group, we propose specific improvement methods. To demonstrate our methods, we use data collected in the Twin Cities, Minnesota, USA, as a study case. The results show that our framework can systematically deal with various data quality issues. We also show how data before and after quality control may lead to different observations about individuals' behavior patterns.
AB - The smartphone-based activity–travel survey has emerged as an approach to collect detailed data on individuals’ activities and trips throughout the day. The collected data are usually structured as a series of consecutive activity and trip episodes. Recent studies have used these data to advance our understanding of individuals’ activity–travel patterns. However, few studies have explicitly described how to handle new quality issues of such data. This article develops a framework and methods to systematically detect and handle quality issues in the smartphone-based activity–travel survey data to ensure attribute completeness and logical consistency. For attribute completeness, we check if each episode contains all the required thematic, temporal, and spatial attributes. For logical consistency, we check if two consecutive episodes are logically inconsistent regarding spatial and temporal continuity. We classify invalid episodes into distinct groups using mixture models, clustering, and a transition matrix. For each group, we propose specific improvement methods. To demonstrate our methods, we use data collected in the Twin Cities, Minnesota, USA, as a study case. The results show that our framework can systematically deal with various data quality issues. We also show how data before and after quality control may lead to different observations about individuals' behavior patterns.
UR - http://www.scopus.com/inward/record.url?scp=85118445777&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118445777&partnerID=8YFLogxK
U2 - 10.1111/tgis.12865
DO - 10.1111/tgis.12865
M3 - Article
AN - SCOPUS:85118445777
SN - 1361-1682
VL - 26
SP - 475
EP - 504
JO - Transactions in GIS
JF - Transactions in GIS
IS - 1
ER -