Title :
Deep into web general vs vertical search engine design based on secure and QoS
Author :
Da-quan, Wang ; Tian, Wang ; Lin, Zhang ; Ai-ping, Wu ; Qi-li, Zhou ; Xiao-kai, Wu
Author_Institution :
Comput. Coll., Hangzhou Dianzi Univ., Hangzhou, China
Abstract :
Vertical search engines are targeted to specific areas of the network information of the coverage is relatively high, with a reliable technical and information resources and support, with clear targeting search effectively compensate for a comprehensive search engine on a specific topic areas of expertise and information coverage too low. Mainly by the vertical search engine focused crawler module, the index module, search module, user interface components such as 4, it is the first to use the module from the specified URL reptiles seed starts to crawl, to crawl down the web page content analysis, determine the required after the extraction of information for the structured data, and then the data on the structure of Chinese words segmentation and indexing, and generate an index database, and finally create web pages for users to query the module to search. Database storage is a prerequisite for building the search. Foreground is the search engine system with the user interface.
Keywords :
Internet; Web sites; indexing; information retrieval; natural language processing; quality of service; search engines; Chinese words segmentation; QoS; URL reptiles; Web general search engine design; Web page content analysis; Web pages; Web vertical search engine design; clear targeting search; crawler module; index database; indexing; information extraction; information resources; network information; search engine system; structured data; user interface; Economics; HTML; Indexing; Information filters; Message systems; Web; crawling; database; engine; index;
Conference_Titel :
Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9792-8
DOI :
10.1109/CSQRWC.2011.6037083