Title :
A method of focused crawling for software components
Author :
Xu, Liping ; Eli, Samat ; Xu, Haiyin
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Abstract :
The number of vertical search engines has rapidly increased over the last years, making the importance of a focused crawler. This paper introduces design and implementation of a focused crawler for software components. Before computing the similarity of a page to the topic, analyze its URL whether it is necessary or not. This leads to significant savings in hardware and network resources, and help the crawler avoid irrelevant page´s similarity computation.
Keywords :
information retrieval; search engines; software reusability; URL analysis; focused crawler; focused crawling method; irrelevant page similarity computation avoidance; software components; vertical search engines; Crawlers; Educational institutions; Libraries; Search engines; Software; Vectors; Web pages; focused crawler; similarity computation; software component;
Conference_Titel :
Transportation, Mechanical, and Electrical Engineering (TMEE), 2011 International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4577-1700-0
DOI :
10.1109/TMEE.2011.6199506