Title :
A Web content based data mining for car consumption preference in China
Author :
Hang, Xiaoshu ; Dai, Honghua ; Zhang, Youhua
Author_Institution :
Sch. of Inf. Technol., Deakin Univ., Geelong, Vic., Australia
Abstract :
This paper introduces an incremental FP-Growth approach for Web content based data mining and its application in solving a real world problem The problem is solved in the following ways. Firstly, we obtain the semi-structured data from the Web pages of Chinese car market and structure them and save them in local database. Secondly, we use an incremental FP-Growth algorithm for mining association rules to discover Chinese consumers´ car consumption preference. To find more general regularities, an attribute-oriented induction method is also utilized to find customer´s consumption preference among a range of car categories. Experimental results have revealed some interesting consumption preferences that are useful for the decision makers to make the policy to encourage and guide car consumption. Although the current data we used may not be the best representative of the actual market in practice, it is still good enough for the decision making purpose in terms of reflecting the real situation of car consumption preference under the two assumptions in the context.
Keywords :
Internet; Web sites; automobile industry; customer satisfaction; data mining; decision making; electronic commerce; FP-Growth algorithm; Web content; Web pages; World Wide Web; attribute-oriented induction; car consumption; car market; data mining; decision making; mining association rules; Association rules; Australia; Data mining; Databases; Decision making; Explosives; Information retrieval; Internet; Web mining; Web pages;
Conference_Titel :
Information Reuse and Integration, 2003. IRI 2003. IEEE International Conference on
Print_ISBN :
0-7803-8242-0
DOI :
10.1109/IRI.2003.1251419