DocumentCode
2860039
Title
A Scalable Topic-Based Open Source Search Engine
Author
Buntine, Wray ; Löfström, Jaakko ; Perkiö, Jukka ; Perttu, Sami ; Poroshin, Vladimir ; Silander, Tomi ; Tirri, Henry ; Tuominen, Antti ; Tuulos, Ville
Author_Institution
Helsinki Institute for Information Technology, Finland
fYear
2004
fDate
20-24 Sept. 2004
Firstpage
228
Lastpage
234
Abstract
Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topic-based search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.
Keywords
Business; Councils; Crawlers; Information retrieval; Information technology; Internet; Open source software; Packaging; Search engines; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2100-2
Type
conf
DOI
10.1109/WI.2004.10094
Filename
1410808
Link To Document