DocumentCode
3132276
Title
Hunting or waiting? Discovering passenger-finding strategies from a large-scale real-world taxi dataset
Author
Li, Bin ; Zhang, Daqing ; Sun, Lin ; Chen, Chao ; Li, Shijian ; Qi, Guande ; Yang, Qiang
Author_Institution
Inst. Telecome SudParis, Evry, France
fYear
2011
fDate
21-25 March 2011
Firstpage
63
Lastpage
68
Abstract
In modern cities, more and more vehicles, such as taxis, have been equipped with GPS devices for localization and navigation. Gathering and analyzing these large-scale real-world digital traces have provided us an unprecedented opportunity to understand the city dynamics and reveal the hidden social and economic “realities”. One innovative pervasive application is to provide correct driving strategies to taxi drivers according to time and location. In this paper, we aim to discover both efficient and inefficient passenger-finding strategies from a large-scale taxi GPS dataset, which was collected from 5350 taxis for one year in a large city of China. By representing the passenger-finding strategies in a Time-Location-Strategy feature triplet and constructing a train/test dataset containing both top- and ordinary-performance taxi features, we adopt a powerful feature selection tool, L1-Norm SVM, to select the most salient feature patterns determining the taxi performance. We find that the selected patterns can well interpret the empirical study results derived from raw data analysis and even reveal interesting hidden “facts”. Moreover, the taxi performance predictor built on the selected features can achieve a prediction accuracy of 85.3% on a new test dataset, and it also outperforms the one based on all the features, which implies that the selected features are indeed the right indicators of the passenger-finding strategies.
Keywords
Global Positioning System; data mining; data visualisation; road traffic; support vector machines; traffic engineering computing; Global Positioning System; L1-norm SVM; data analysis; digital traces; passenger-finding strategy; pervasive application; support vector machines; taxi GPS dataset; taxi driving strategy; time-location-strategy feature triplet; Airports; Cities and towns; Data mining; Driver circuits; Global Positioning System; Support vector machines; Urban areas; GPS; Large-scale Data; Passenger-Finding Strategy; Reality Mining; Taxi Data Mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Pervasive Computing and Communications Workshops (PERCOM Workshops), 2011 IEEE International Conference on
Conference_Location
Seattle, WA
Print_ISBN
978-1-61284-938-6
Electronic_ISBN
978-1-61284-936-2
Type
conf
DOI
10.1109/PERCOMW.2011.5766967
Filename
5766967
Link To Document