DocumentCode :
175142
Title :
Large Imbalance Data Classification Based on MapReduce for Traffic Accident Prediction
Author :
Seoung-hun Park ; Young-guk Ha
Author_Institution :
Dept. of Comput. Sci. & Eng., Konkuk Univ., Seoul, South Korea
fYear :
2014
fDate :
2-4 July 2014
Firstpage :
45
Lastpage :
49
Abstract :
In modern society, our everyday life has a close connection with traffic issues. One of the most burning issues is about predicting traffic accidents. Predicting accidents on the road can be achieved by classification analysis, a data mining procedure requiring enough data to build a learning model. Regarding building such a predicting system, there are several problems. It requires lots of hardware resources to collect traffic data and analyze it for predicting traffic accidents since the data is very huge. Furthermore, data related to traffic accidents is few comparing with data which is not related to them. The numbers of two types of data are imbalanced. The purpose of this paper is to build a predicting model that can resolve all these problems. This paper suggests using Hadoop framework to process and analyze big traffic data efficiently and a sampling method to resolve the problem of data imbalance. Based on this, the predicting system, first of all, preprocess traffic big data and analyzes it to create data for the learning system. The imbalance of created data is corrected by a sampling method. To improve predicting accuracy, corrected data is classified into several groups, to which classification analysis is applied. These analysis steps are processed by Hadoop framework.
Keywords :
Big Data; data analysis; data mining; pattern classification; road accidents; road traffic; sampling methods; traffic engineering computing; Hadoop framework; MapReduce; data mining procedure; imbalance data classification; road traffic accident prediction; sampling method; traffic big data analysis; Accidents; Accuracy; Data mining; Logistics; Roads; Training; Accident prediction; Big-data inference; Classification; Imbalance data; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS), 2014 Eighth International Conference on
Conference_Location :
Birmingham
Print_ISBN :
978-1-4799-4333-3
Type :
conf
DOI :
10.1109/IMIS.2014.6
Filename :
6975440
Link To Document :
بازگشت