Title :
Research on statistics-based model for E-commerce user purchase prediction
Author :
Dong, Huailin ; Xie, Lingwei ; Zhang, Zhongnan
Author_Institution :
Software School, Xiamen University, Xiamen, China
Abstract :
This paper describes our work for ALIDATA DISCOVERY competition. Through analyzing massive real-world user action data provided by Tmall, one of the largest B2C online retail platforms in China, we try to predict future user purchases. The prediction results are judged by F1 Score that is consist of two parts, precision and recall rate. The provided data set contains more than 500 million action records from over 12 million distinct users. Such a massive data set drives us to finish the task in MapReduce fashion on the Open Data Processing Service (ODPS) platform. According to statistical results, we classify all users into different groups firstly. Then the rule model, timing model, statistics model are adopted for predicting future user purchases. By comparison, the statistics model obtains the best F1Score.
Keywords :
Big data; Computational modeling; Data mining; Data models; Data preprocessing; Predictive models; Timing; E-commerce user; MapReduce; behavior data; purchase prediction;
Conference_Titel :
Computer Science & Education (ICCSE), 2015 10th International Conference on
Conference_Location :
Cambridge, United Kingdom
Print_ISBN :
978-1-4799-6598-4
DOI :
10.1109/ICCSE.2015.7250308