• DocumentCode
    3132276
  • Title

    Hunting or waiting? Discovering passenger-finding strategies from a large-scale real-world taxi dataset

  • Author

    Li, Bin ; Zhang, Daqing ; Sun, Lin ; Chen, Chao ; Li, Shijian ; Qi, Guande ; Yang, Qiang

  • Author_Institution
    Inst. Telecome SudParis, Evry, France
  • fYear
    2011
  • fDate
    21-25 March 2011
  • Firstpage
    63
  • Lastpage
    68
  • Abstract
    In modern cities, more and more vehicles, such as taxis, have been equipped with GPS devices for localization and navigation. Gathering and analyzing these large-scale real-world digital traces have provided us an unprecedented opportunity to understand the city dynamics and reveal the hidden social and economic “realities”. One innovative pervasive application is to provide correct driving strategies to taxi drivers according to time and location. In this paper, we aim to discover both efficient and inefficient passenger-finding strategies from a large-scale taxi GPS dataset, which was collected from 5350 taxis for one year in a large city of China. By representing the passenger-finding strategies in a Time-Location-Strategy feature triplet and constructing a train/test dataset containing both top- and ordinary-performance taxi features, we adopt a powerful feature selection tool, L1-Norm SVM, to select the most salient feature patterns determining the taxi performance. We find that the selected patterns can well interpret the empirical study results derived from raw data analysis and even reveal interesting hidden “facts”. Moreover, the taxi performance predictor built on the selected features can achieve a prediction accuracy of 85.3% on a new test dataset, and it also outperforms the one based on all the features, which implies that the selected features are indeed the right indicators of the passenger-finding strategies.
  • Keywords
    Global Positioning System; data mining; data visualisation; road traffic; support vector machines; traffic engineering computing; Global Positioning System; L1-norm SVM; data analysis; digital traces; passenger-finding strategy; pervasive application; support vector machines; taxi GPS dataset; taxi driving strategy; time-location-strategy feature triplet; Airports; Cities and towns; Data mining; Driver circuits; Global Positioning System; Support vector machines; Urban areas; GPS; Large-scale Data; Passenger-Finding Strategy; Reality Mining; Taxi Data Mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pervasive Computing and Communications Workshops (PERCOM Workshops), 2011 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    978-1-61284-938-6
  • Electronic_ISBN
    978-1-61284-936-2
  • Type

    conf

  • DOI
    10.1109/PERCOMW.2011.5766967
  • Filename
    5766967