DocumentCode :
3132276
Title :
Hunting or waiting? Discovering passenger-finding strategies from a large-scale real-world taxi dataset
Author :
Li, Bin ; Zhang, Daqing ; Sun, Lin ; Chen, Chao ; Li, Shijian ; Qi, Guande ; Yang, Qiang
Author_Institution :
Inst. Telecome SudParis, Evry, France
fYear :
2011
fDate :
21-25 March 2011
Firstpage :
63
Lastpage :
68
Abstract :
In modern cities, more and more vehicles, such as taxis, have been equipped with GPS devices for localization and navigation. Gathering and analyzing these large-scale real-world digital traces have provided us an unprecedented opportunity to understand the city dynamics and reveal the hidden social and economic “realities”. One innovative pervasive application is to provide correct driving strategies to taxi drivers according to time and location. In this paper, we aim to discover both efficient and inefficient passenger-finding strategies from a large-scale taxi GPS dataset, which was collected from 5350 taxis for one year in a large city of China. By representing the passenger-finding strategies in a Time-Location-Strategy feature triplet and constructing a train/test dataset containing both top- and ordinary-performance taxi features, we adopt a powerful feature selection tool, L1-Norm SVM, to select the most salient feature patterns determining the taxi performance. We find that the selected patterns can well interpret the empirical study results derived from raw data analysis and even reveal interesting hidden “facts”. Moreover, the taxi performance predictor built on the selected features can achieve a prediction accuracy of 85.3% on a new test dataset, and it also outperforms the one based on all the features, which implies that the selected features are indeed the right indicators of the passenger-finding strategies.
Keywords :
Global Positioning System; data mining; data visualisation; road traffic; support vector machines; traffic engineering computing; Global Positioning System; L1-norm SVM; data analysis; digital traces; passenger-finding strategy; pervasive application; support vector machines; taxi GPS dataset; taxi driving strategy; time-location-strategy feature triplet; Airports; Cities and towns; Data mining; Driver circuits; Global Positioning System; Support vector machines; Urban areas; GPS; Large-scale Data; Passenger-Finding Strategy; Reality Mining; Taxi Data Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pervasive Computing and Communications Workshops (PERCOM Workshops), 2011 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
978-1-61284-938-6
Electronic_ISBN :
978-1-61284-936-2
Type :
conf
DOI :
10.1109/PERCOMW.2011.5766967
Filename :
5766967
Link To Document :
بازگشت