Title : 
Study and application of web data mining based on XML
         
        
            Author : 
Zhang, Pengwei ; Chen, Jingxia
         
        
            Author_Institution : 
Electr. & Inf. Eng. Coll., Shaanxi Univ. of Sci. & Technol., Xi´´an, China
         
        
        
        
        
        
            Abstract : 
With the development of information technologies, web data mining has been put forward and in wide research. It is defined as the discovery, extraction and analysis of useful and potential information from the World Wide Web. But much of inhomogeneous and anomalistic and dynamic updated semi-structured data in web pages makes web data mining difficult. To solve this problem, on the basis of analyzing the characteristics of XML, the paper presents a web data mining model on XML, and introduces the method to implement the model with XML and Java technologies in detail with the combination of an instance. Finally, some valuable discussions are put forward on this model for its shortages.
         
        
            Keywords : 
Data engineering; Data mining; Databases; Educational institutions; Educational technology; HTML; Information technology; Java; Paper technology; XML; RDF; Web; XML; data mining; semi-structured;
         
        
        
        
            Conference_Titel : 
Educational and Network Technology (ICENT), 2010 International Conference on
         
        
            Conference_Location : 
Qinhuangdao, China
         
        
            Print_ISBN : 
978-1-4244-7660-2
         
        
            Electronic_ISBN : 
978-1-4244-7662-6
         
        
        
            DOI : 
10.1109/ICENT.2010.5532169