• DocumentCode
    175142
  • Title

    Large Imbalance Data Classification Based on MapReduce for Traffic Accident Prediction

  • Author

    Seoung-hun Park ; Young-guk Ha

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Konkuk Univ., Seoul, South Korea
  • fYear
    2014
  • fDate
    2-4 July 2014
  • Firstpage
    45
  • Lastpage
    49
  • Abstract
    In modern society, our everyday life has a close connection with traffic issues. One of the most burning issues is about predicting traffic accidents. Predicting accidents on the road can be achieved by classification analysis, a data mining procedure requiring enough data to build a learning model. Regarding building such a predicting system, there are several problems. It requires lots of hardware resources to collect traffic data and analyze it for predicting traffic accidents since the data is very huge. Furthermore, data related to traffic accidents is few comparing with data which is not related to them. The numbers of two types of data are imbalanced. The purpose of this paper is to build a predicting model that can resolve all these problems. This paper suggests using Hadoop framework to process and analyze big traffic data efficiently and a sampling method to resolve the problem of data imbalance. Based on this, the predicting system, first of all, preprocess traffic big data and analyzes it to create data for the learning system. The imbalance of created data is corrected by a sampling method. To improve predicting accuracy, corrected data is classified into several groups, to which classification analysis is applied. These analysis steps are processed by Hadoop framework.
  • Keywords
    Big Data; data analysis; data mining; pattern classification; road accidents; road traffic; sampling methods; traffic engineering computing; Hadoop framework; MapReduce; data mining procedure; imbalance data classification; road traffic accident prediction; sampling method; traffic big data analysis; Accidents; Accuracy; Data mining; Logistics; Roads; Training; Accident prediction; Big-data inference; Classification; Imbalance data; MapReduce;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS), 2014 Eighth International Conference on
  • Conference_Location
    Birmingham
  • Print_ISBN
    978-1-4799-4333-3
  • Type

    conf

  • DOI
    10.1109/IMIS.2014.6
  • Filename
    6975440