DocumentCode :
1571189
Title :
Deep into web general vs vertical search engine design based on secure and QoS
Author :
Da-quan, Wang ; Tian, Wang ; Lin, Zhang ; Ai-ping, Wu ; Qi-li, Zhou ; Xiao-kai, Wu
Author_Institution :
Comput. Coll., Hangzhou Dianzi Univ., Hangzhou, China
Volume :
1
fYear :
2011
Firstpage :
847
Lastpage :
851
Abstract :
Vertical search engines are targeted to specific areas of the network information of the coverage is relatively high, with a reliable technical and information resources and support, with clear targeting search effectively compensate for a comprehensive search engine on a specific topic areas of expertise and information coverage too low. Mainly by the vertical search engine focused crawler module, the index module, search module, user interface components such as 4, it is the first to use the module from the specified URL reptiles seed starts to crawl, to crawl down the web page content analysis, determine the required after the extraction of information for the structured data, and then the data on the structure of Chinese words segmentation and indexing, and generate an index database, and finally create web pages for users to query the module to search. Database storage is a prerequisite for building the search. Foreground is the search engine system with the user interface.
Keywords :
Internet; Web sites; indexing; information retrieval; natural language processing; quality of service; search engines; Chinese words segmentation; QoS; URL reptiles; Web general search engine design; Web page content analysis; Web pages; Web vertical search engine design; clear targeting search; crawler module; index database; indexing; information extraction; information resources; network information; search engine system; structured data; user interface; Economics; HTML; Indexing; Information filters; Message systems; Web; crawling; database; engine; index;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9792-8
Type :
conf
DOI :
10.1109/CSQRWC.2011.6037083
Filename :
6037083
Link To Document :
بازگشت