DocumentCode :
2178070
Title :
Web Site Mining Using Entropy Estimation
Author :
Sonawane, Vijay R ; Halkarnikar, P.P.
Author_Institution :
Dept. of Comput. Eng., Amrutvahini Coll. of Eng., Sangamner, India
fYear :
2010
fDate :
9-10 Feb. 2010
Firstpage :
225
Lastpage :
229
Abstract :
With the unstable growth of the Web, there is an ever-Increasing volume of data and information published in numerous Web pages. Web mining aims to develop new techniques to effectively extract and mine useful knowledge or information from these Web pages. And allows user to easily locate desired object from huge data. In this paper, we propose simple web site mining technique by mining product information from the pages of the e-commercial web site. For this we are taking the benefits of hierarchical structure of HTML language. First it discovers the set of product descriptions based on the measure of entropy at each node in the HTML tag tree of the retrieved web page. Afterward, a set of association rules based on heuristic features is employed for more accuracy in the product extraction.
Keywords :
Web sites; data mining; electronic commerce; entropy; hypermedia markup languages; information retrieval; HTML language; Web page retrieval; Web site mining technique; association rules; e-commercial Web site; entropy estimation; information extraction; product extraction; Data engineering; Entropy; Memory; Entropy; association rule; filter; product description; representative value;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Storage and Data Engineering (DSDE), 2010 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4244-5678-9
Type :
conf
DOI :
10.1109/DSDE.2010.19
Filename :
5452579
Link To Document :
بازگشت