DocumentCode :
36440
Title :
Why so many people? Explaining Nonhabitual Transport Overcrowding With Internet Data
Author :
Pereira, Francisco C. ; Rodrigues, Filipe ; Polisciuc, Evgheni ; Ben-Akiva, Moshe
Author_Institution :
Singapore-MIT Alliance for Res. & Technol., Singapore, Singapore
Volume :
16
Issue :
3
fYear :
2015
fDate :
Jun-15
Firstpage :
1370
Lastpage :
1379
Abstract :
Public transport smartcard data can be used for detection of large crowds. By comparing statistics on habitual behavior (e.g., average by time of day), one can specifically identify nonhabitual crowds, which are often very problematic for transport systems. While habitual overcrowding (e.g., peak hour) is well understood both by traffic managers and travelers, nonhabitual overcrowding hotspots can become even more disruptive and unpleasant because they are generally unexpected. By quickly understanding such cases, a transport manager can react and mitigate transport system disruptions. We propose a probabilistic data analysis model that breaks each nonhabitual overcrowding hotspot into a set of explanatory components. The potential explanatory components are initially retrieved from social networks and special events websites and then processed through text-analysis techniques. Finally, for each such component, the probabilistic model estimates a specific share in the total overcrowding counts. We first validate with synthetic data and then test our model with real data from the public transport system (EZLink) of Singapore, focused on three case study areas. We demonstrate that it is able to generate explanations that are intuitively plausible and consistent both locally (correlation coefficient, i.e., CC, from 85% to 99% for the three areas) and globally (CC from 41.2% to 83.9%). This model is directly applicable to any other domain sensitive to crowd formation due to large social events (e.g., communications, water, energy, waste).
Keywords :
Internet; behavioural sciences computing; data analysis; information retrieval; probability; public transport; smart cards; social networking (online); text analysis; traffic engineering computing; Internet data; Web sites; correlation coefficient; crowd formation; habitual behavior; nonhabitual transport overcrowding hotspot; probabilistic data analysis; probabilistic model; public transport smart card data; public transport system; social events; social networks; text-analysis techniques; traffic managers; Bayes methods; Data models; Facebook; Google; Intelligent transportation systems; Internet; Predictive models; Information extraction; machine learning; smartcards; special events; travel demand modeling; web mining;
fLanguage :
English
Journal_Title :
Intelligent Transportation Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1524-9050
Type :
jour
DOI :
10.1109/TITS.2014.2368119
Filename :
7021960
Link To Document :
بازگشت