Abstract
Given the wide application of automatic fare collection systems in transit systems across the globe, smartcard data with on- and/or off-boarding information has become a new source of data to understand passenger flow patterns. This paper uses Nanjing, China as a case study and examines the possibility of using the data cube technique in data mining to understand space–time travel patterns of Nanjing rail transit users. One month of smartcard data in October, 2013 was obtained from Nanjing rail transit system, with a total of over 22 million transaction records. We define the original data cube for the smartcard data based on four dimensions—Space, Date, Time, and User, design a hierarchy for each dimension, and use the total number of transactions as the quantitative measure. We develop modules using the programming language Python and share them as open-source on GitHub to enable peer production and advancement in the field. The visualizations of two-dimensional slices of the data cube show some interesting patterns such as different travel behaviors across user groups (e.g. students vs. elders), and irregular peak hours during National Holiday (October 1st–7th) compared to regular morning and afternoon peak hours during regular working weeks. Spatially, multidimensional visualizations show concentrations of various activity opportunities near metro rail stations and the changing popularities of rail stations through time accordingly. These findings support the feasibility and efficiency of the data cube technique as a mean of visual exploratory analysis for massive smart-card data, and can contribute to the evaluation and planning of public transit systems.
Original language | English (US) |
---|---|
Pages (from-to) | 311-333 |
Number of pages | 23 |
Journal | Transportation |
Volume | 45 |
Issue number | 2 |
DOIs | |
State | Published - Mar 1 2018 |
Bibliographical note
Publisher Copyright:© 2017, Springer Science+Business Media New York.
Keywords
- Data cube
- Exploratory data mining
- Smartcard data
- Space–time plot
- Transit
- Travel behavior