Title : 
Learning Web Page Block Functions using Roles of Images
         
        
            Author : 
Yang, Xin ; Shi, Yuanchun
         
        
            Author_Institution : 
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing
         
        
        
        
        
        
            Abstract : 
Making use of block information in Web IR and Data Mining tasks calls for a good understanding of the function of each block. Existing works on classifying block functions and judging block importance have not made full use of the image factor, and only simple image features were considered. We regard image as a strong indicator of Web page blocks with various functions and propose to learn block functions using roles of images as part of block features. Blocks are generated from Web page segmentation and roles of images are automatically decided by image classification. We experiment on 140 Web pages and demonstrate that utilizing roles of images can significantly improve the classification quality of learning Web page block functions. We also measure the usefulness of different roles of images and evaluate the effect of two page segmentation methods.
         
        
            Keywords : 
Web sites; data mining; image classification; image segmentation; learning (artificial intelligence); Web page block function learning; data mining; image classification; image segmentation; machine learning; Computer science; Data mining; HTML; Handheld computers; Image classification; Image segmentation; Machine learning; Machine learning algorithms; Pervasive computing; Web pages; Block Function; Machine Learning; Role of Image; Web Page Block;
         
        
        
        
            Conference_Titel : 
Pervasive Computing and Applications, 2008. ICPCA 2008. Third International Conference on
         
        
            Print_ISBN : 
978-1-4244-2020-9
         
        
            Electronic_ISBN : 
978-1-4244-2021-6
         
        
        
            DOI : 
10.1109/ICPCA.2008.4783565