skip to main content
research-article

TripImputor: Real-Time Imputing Taxi Trip Purpose Leveraging Multi-Sourced Urban Data

Published: 01 October 2018 Publication History

Abstract

Travel behavior understanding is a long-standing and critically important topic in the area of smart cities. Big volumes of various GPS-based travel data can be easily collected, among which the taxi GPS trajectory data is a typical example. However, in GPS trajectory data, there is usually little information on travelers&#x2019; activities, thereby they can only support limited applications. Quite a few studies have been focused on enriching the semantic meaning for raw data, such as travel mode/purpose inferring. Unfortunately, trip purpose imputation receives relatively less attention and requires no real-time response. To narrow the gap, we propose a probabilistic two-phase framework named <italic>TripImputor</italic>, for making the <italic>real-time</italic> taxi trip purpose imputation and recommending services to passengers at their dropoff points. Specifically, in the first phase, we propose a two-stage clustering algorithm to identify candidate activity areas (CAAs) in the urban space. Then, we extract fine-granularity spatial and temporal patterns of human behaviors inside the CAAs from foursquare check-in data to approximate the priori probability for each activity, and compute the posterior probabilities (i.e., infer the trip purposes) using Bayes&#x2019; theorem. In the second phase, we take a sophisticated procedure that clusters historical dropoff points and matches the dropoff clusters and CAAs to immerse the real-time response. Finally, we evaluate the effectiveness and efficiency of the proposed two-phase framework using real-world data sets, which consist of road network, check-in data generated by over 38 000 users in one year, and the large-scale taxi trip data generated by over 19 000 taxis in a month in Manhattan, New York City, USA. Experimental results demonstrate that the system is able to infer the trip purpose accurately, and can provide recommendation results to passengers within 1.6 s in Manhattan on average, just using a single normal PC.

References

[1]
R. K. Balan, K. X. Nguyen, and L. Jiang, “Real-time trip information service for a large taxi fleet,” in Proc. MobiSys, 2011, pp. 99–112.
[2]
P. S. Castro, D. Zhang, C. Chen, S. Li, and G. Pan, “From taxi GPS traces to social and community dynamics: A survey,” ACM Comput. Surv., vol. 46, no. 2, pp. 17:1–17:34, 2013.
[3]
C. Chen, H. Gong, C. Lawson, and E. Bialostozky, “Evaluating the feasibility of a passive travel survey collection in a complex urban environment: Lessons learned from the New York City case study,” Transp. Res. A, Policy Pract., vol. 44, no. 10, pp. 830–840, 2010.
[4]
C. Chen, Z. Wang, and B. Guo, “The road to the Chinese smart city: Progress, challenges, and future directions,” IT Prof., vol. 18, no. 1, pp. 14–17, Jan./Feb. 2016.
[5]
C. Chenet al., “iBOAT: Isolation-based online anomalous trajectory detection,” IEEE Trans. Intell. Transp. Syst., vol. 14, no. 2, pp. 806–818, Jun. 2013.
[6]
C. Chen, D. Zhang, B. Guo, X. Ma, G. Pan, and Z. Wu, “TripPlanner: Personalized trip planning leveraging heterogeneous crowdsourced digital footprints,” IEEE Trans. Intell. Transp. Syst., vol. 16, no. 3, pp. 1259–1273, Jun. 2015.
[7]
C. Chenet al., “CrowdDeliver: Planning city-wide package delivery paths leveraging the crowd of taxis,” IEEE Trans. Intell. Transp. Syst., vol. 18, no. 6, pp. 1478–1496, Jun. 2017.
[8]
K. J. Clifton and S. L. Handy, “Qualitative methods in travel behaviour research,” in Transport Survey Quality and Innovation. Emerald Group Publishing Limited, 2003, pp. 283–302.
[9]
Z. Deng and M. Ji, “Deriving rules for trip purpose identification from GPS travel survey data and land use data: A machine learning approach,” in Proc. 7th Int. Conf. Traffic Transp. Stud., 2010, pp. 768–777.
[10]
Y. Ding, C. Chen, S. Zhang, B. Guo, Z. Yu, and Y. Wang, “GreenPlanner: Planning personalized fuel-efficient driving routes using multi-sourced urban data,” in Proc. PerCom, Mar. 2017, pp. 207–216.
[11]
M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, “A density-based algorithm for discovering clusters in large spatial databases with noise,” in Proc. KDD, vol. 96. 1996, pp. 226–231.
[12]
T. Feng and H. J. P. Timmermans, “Detecting activity type from GPS traces using spatial and temporal information,” Eur. J. Transp. Infrastruct. Res., vol. 15, no. 4, pp. 662–674, 2015.
[13]
B. Furletti, P. Cintia, C. Renso, and L. Spinsanti, “Inferring human activities from GPS tracks,” in Proc. 2nd ACM SIGKDD Int. Workshop Urban Comput., 2013, p. 5.
[14]
Y. Ge, H. Xiong, A. Tuzhilin, K. Xiao, M. Gruteser, and M. Pazzani, “An energy-efficient mobile recommender system,” in Proc. ACM KDD, 2010, pp. 899–908.
[15]
L. Gong, X. Liu, L. Wu, and Y. Liu, “Inferring trip purposes and uncovering travel patterns from taxi trajectory data,” Cartogr. Geogr. Inf. Sci., vol. 43, no. 2, pp. 103–114, 2016.
[16]
L. Gong, T. Morikawa, T. Yamamoto, and H. Sato, “Deriving personal trip data from GPS data: A literature review on the existing methodologies,” Procedia-Social Behavioral Sci., vol. 138, pp. 557–565, Jul. 2014.
[17]
K. Hormann and A. Agathos, “The point in polygon problem for arbitrary polygons,” Comput. Geometry, vol. 20, no. 3, pp. 131–144, 2001.
[18]
J. Huang, Y. Li, R. Crawfis, S.-C. Lu, and S.-Y. Liou, “A complete distance field representation,” in Proc. Conf. Vis., 2001, pp. 247–254.
[19]
L. Huang, Q. Li, and Y. Yue, “Activity identification from GPS trajectories using spatial temporal POIs’ attractiveness,” in Proc. 2nd ACM SIGSPATIAL Int. Workshop LBSNs, 2010, pp. 27–30.
[20]
C. Kang, X. Ma, D. Tong, and Y. Liu, “Intra-urban human mobility patterns: An urban morphology perspective,” Phys. A, Statist. Mech. Appl., vol. 391, no. 4, pp. 1702–1717, 2012.
[21]
K.-R. Koch, Introduction to Bayesian Statistics. Berlin, Germany: Springer, 2007. [Online]. Available: http://www.springer.com/us/book/9783540727231
[22]
J. Krumm and D. Rouhana, “Placer: Semantic place labels from diary data,” in Proc. ACM Int. Joint Conf. Pervasive Ubiquitous Comput., 2013, pp. 163–172.
[23]
M.-P. Kwan, “How GIS can help address the uncertain geographic context problem in social science research,” Ann. GIS, vol. 18, no. 4, pp. 245–255, 2012.
[24]
H. T. Lam, E. Diaz-Aviles, A. Pascale, Y. Gkoufas, and B. Chen. (2015). “(Blue) taxi destination and trip time prediction from partial trajectories.” [Online]. Available: https://arxiv.org/abs/1509.05257
[25]
X. Li, M. Li, Y.-J. Gong, X.-L. Zhang, and J. Yin, “T-DesP: Destination prediction based on big trajectory data,” IEEE Trans. Intell. Transp. Syst., vol. 17, no. 8, pp. 2344–2354, Aug. 2016.
[26]
Y. Lin, H. Wan, R. Jiang, Z. Wu, and X. Jia, “Inferring the travel purposes of passenger groups for better understanding of passengers,” IEEE Trans. Intell. Transp. Syst., vol. 16, no. 1, pp. 235–243, Feb. 2015.
[27]
Y. Lu and L. Zhang, “Imputing trip purposes for long-distance travel,” Transportation, vol. 42, no. 4, pp. 581–595, 2015.
[28]
D. Newman and A. Paasi, “Fences and neighbours in the postmodern world: Boundary narratives in political geography,” Prog. Hum. Geogr., vol. 22, no. 2, pp. 186–207, 1998.
[29]
T. H. Rashidi, A. Abbasi, M. Maghrebi, S. Hasan, and T. S. Waller, “Exploring the capacity of social media data for modelling travel behaviour: Opportunities and challenges,” Transp. Res. C, Emerg. Technol., vol. 75, pp. 197–211, Feb. 2017.
[30]
S. Schönfelder, “Urban rhythms: Modelling the rhythms of individual travel behaviour,” Ph.D. dissertation, ETH Zurich, Zürich, Switzerland, 2006.
[31]
M. Shimrat, “Algorithm 112: Position of point relative to polygon,” Commun. ACM, vol. 5, no. 8, p. 434, 1962.
[32]
L. Wang, Z. Yu, B. Guo, T. Ku, and F. Yi, “Moving destination prediction using sparse dataset: A mobility gradient descent approach,” ACM Trans. Knowl. Discovery Data, vol. 11, no. 3, p. 37, 2017.
[33]
J. Wolf, “Using GPS data loggers to replace travel diaries in the collection of travel data,” Ph.D. dissertation, Georgia Inst. Technol., Atlanta, GA, USA, 2000.
[34]
A. Y. Xue, R. Zhang, Y. Zheng, X. Xie, J. Huang, and Z. Xu, “Destination prediction by sub-trajectory synthesis and privacy protection against such prediction,” in Proc. IEEE ICDE, Apr. 2013, pp. 254–265.
[35]
D. Yang, D. Zhang, V. W. Zheng, and Z. Yu, “Modeling user activity preference by leveraging user spatial temporal characteristics in LBSNs,” IEEE Trans. Syst., Man, Cybern., Syst., vol. 45, no. 1, pp. 129–142, Jan. 2015.
[36]
Z. Yu, H. Xu, Z. Yang, and B. Guo, “Personalized travel package with multi-point-of-interest recommendation based on crowdsourced user footprints,” IEEE Trans. Human–Mach. Syst., vol. 46, no. 1, pp. 151–158, Feb. 2016.
[37]
N. J. Yuan, Y. Zheng, and X. Xie, “Segmentation of urban areas using road networks,” Microsoft Res., Tech. Rep., 2012.
[38]
Y. Yue, T. Lan, A. G. O. Yeh, and Q.-Q. Li, “Zooming into individuals to understand the collective: A review of trajectory-based travel behaviour studies,” Travel Behaviour Soc., vol. 1, no. 2, pp. 69–78, 2014.
[39]
Y. Zheng, Y. Chen, Q. Li, X. Xie, and W.-Y. Ma, “Understanding transportation modes based on GPS data for Web applications,” ACM Trans. Web, vol. 4, no. 1, p. 1, 2010.
[40]
C. Zhong, S. M. Arisona, X. Huang, M. Batty, and G. Schmitt, “Detecting the dynamics of urban structure through spatial network analysis,” Int. J. Geogr. Inf. Sci., vol. 28, no. 11, pp. 2178–2199, 2014.
[41]
Z. Zhu, U. Blanke, and G. Tröster, “Inferring travel purpose from crowd-augmented human mobility data,” in Proc. 1st Int. Conf. IoT Urban Space, 2014, pp. 44–49.

Cited By

View all

Index Terms

  1. TripImputor: Real-Time Imputing Taxi Trip Purpose Leveraging Multi-Sourced Urban Data
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image IEEE Transactions on Intelligent Transportation Systems
        IEEE Transactions on Intelligent Transportation Systems  Volume 19, Issue 10
        Oct. 2018
        342 pages

        Publisher

        IEEE Press

        Publication History

        Published: 01 October 2018

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 01 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media