The smartphone-based activity–travel survey has emerged as an approach to collect detailed data on individuals’ activities and trips throughout the day. The collected data are usually structured as a series of consecutive activity and trip episodes. Recent studies have used these data to advance our understanding of individuals’ activity–travel patterns. However, few studies have explicitly described how to handle new quality issues of such data. This article develops a framework and methods to systematically detect and handle quality issues in the smartphone-based activity–travel survey data to ensure attribute completeness and logical consistency. For attribute completeness, we check if each episode contains all the required thematic, temporal, and spatial attributes. For logical consistency, we check if two consecutive episodes are logically inconsistent regarding spatial and temporal continuity. We classify invalid episodes into distinct groups using mixture models, clustering, and a transition matrix. For each group, we propose specific improvement methods. To demonstrate our methods, we use data collected in the Twin Cities, Minnesota, USA, as a study case. The results show that our framework can systematically deal with various data quality issues. We also show how data before and after quality control may lead to different observations about individuals' behavior patterns.
Bibliographical notePublisher Copyright:
© 2021 John Wiley & Sons Ltd