DocumentCode :
2075696
Title :
A method of focused crawling for software components
Author :
Xu, Liping ; Eli, Samat ; Xu, Haiyin
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
fYear :
2011
fDate :
16-18 Dec. 2011
Firstpage :
1560
Lastpage :
1563
Abstract :
The number of vertical search engines has rapidly increased over the last years, making the importance of a focused crawler. This paper introduces design and implementation of a focused crawler for software components. Before computing the similarity of a page to the topic, analyze its URL whether it is necessary or not. This leads to significant savings in hardware and network resources, and help the crawler avoid irrelevant page´s similarity computation.
Keywords :
information retrieval; search engines; software reusability; URL analysis; focused crawler; focused crawling method; irrelevant page similarity computation avoidance; software components; vertical search engines; Crawlers; Educational institutions; Libraries; Search engines; Software; Vectors; Web pages; focused crawler; similarity computation; software component;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Transportation, Mechanical, and Electrical Engineering (TMEE), 2011 International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4577-1700-0
Type :
conf
DOI :
10.1109/TMEE.2011.6199506
Filename :
6199506
Link To Document :
بازگشت