Title : 
Research on Automatic Classification for Deep Web Query Interfaces
         
        
            Author : 
Lin, Peiguang ; Du, Yibing ; Tan, Xiaohua ; LV, Chao
         
        
            Author_Institution : 
Sch. of Comput. & Inf. Eng., Shandong Univ. of Finance, Jinan
         
        
        
        
        
        
            Abstract : 
In recent years, the Web is "deepened" rapidly and users have to browse quantities of Web sites to access Web databases in a specific domain. So, to build an unified query interface which integrates query interfaces of a domain to access various Web databases at the same time becomes a very important issue. In this paper, the schema characteristics of query interfaces and common attributes in a same domain are firstly analyzed, and it also gives a new representation of query interface, then the definition of "Form term" and "Function term" are proposed ,and a new similarity computing algorithm, literal and semantic based similarity computing (LSSC) is proposed, which is based on the two definitions. Secondly, a clustering algorithm for Deep Web query interfaces is given by combining LSSC and NQ algorithm: LSSC-NQ. Finally, experiments show that this algorithm can give accurate similarity computing, and cluster query interfaces efficiently, reliably and quickly.
         
        
            Keywords : 
Internet; classification; query processing; user interfaces; Web database access; Web site browsing; automatic classification; deep Web query interface clustering; literal similarity computing algorithm; semantic based similarity computing algorithm; unified query interface representation; Algorithm design and analysis; Chaos; Clustering algorithms; Clustering methods; Computer interfaces; Concrete; Data engineering; Databases; Finance; Information processing; Automatic Classification; Deep Web; Query Interface;
         
        
        
        
            Conference_Titel : 
Information Processing (ISIP), 2008 International Symposiums on
         
        
            Conference_Location : 
Moscow
         
        
            Print_ISBN : 
978-0-7695-3151-9
         
        
        
            DOI : 
10.1109/ISIP.2008.140