Title :
Quality Information Retrieval for the World Wide Web
Author :
Kc, Milly ; Hagenbuchner, Markus ; Tsoi, Ah Chung
Author_Institution :
Univ. of Wollongong, Wollongong, NSW
Abstract :
The World Wide Web is an unregulated communication medium which exhibits very limited means of quality control. Quality assurance has become a key issue for many information retrieval services on the Internet, e.g. web search engines. This paper introduces some quality evaluation and assessment methods to assess the quality of Web pages. The proposed quality evaluation mechanisms are based on a set of quality criteria which were extracted from a targeted user survey. A weighted algorithmic interpretation of the most significant user quoted quality criteria is proposed. In addition, the paper utilizes machine learning methods to produce a prediction of quality for Web pages before they are downloaded. The set of quality criteria allows us to implement a Web search engine with quality ranking schemes, leading to Web crawlers which can crawl directly quality Web pages. The proposed approaches produce some very promising results on a sizeable Web repository.
Keywords :
Internet; information retrieval; learning (artificial intelligence); quality assurance; search engines; Internet; Web crawlers; Web pages; Web repository; Web search engine; World Wide Web; information retrieval; machine learning; quality assurance; quality evaluation mechanisms; Data mining; Information retrieval; Machine learning algorithms; Quality assurance; Quality control; Search engines; Web and internet services; Web pages; Web search; Web sites; Crawling; Quality assessment; Quality information; Web page retrieval;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.378