DocumentCode :
3263466
Title :
Hyper-Textual Language Model for web information retrieval
Author :
Xie, Ying ; Raghavan, Vijay V. ; Young, Andrew
Author_Institution :
Dept. of Comput. Sci. & Inf. Syst., Kennesaw State Univ., Kennesaw, GA
fYear :
2008
fDate :
26-28 Aug. 2008
Firstpage :
68
Lastpage :
73
Abstract :
In this paper, we propose a unified retrieval model that is called the hyper-textual language model for Web information retrieval. The proposed model seamlessly integrates information from multiple sources including Web content, URL, hyperlinks, and the topology of the Web in a unified modeling framework. On the one hand, this model extends the language modeling technique to accommodate special structural and semantic information brought by the hyperlinks of the Web; on the other hand, it provides a formal retrieval model that realizes topic-relevant pageranking. Preliminary experimental study on a university website shows that the performance of this formal retrieval model is comparable with the performance of the Googlepsilas University Search.
Keywords :
Internet; information retrieval; search engines; Google University Search; URL; Web content; Web information retrieval; formal retrieval model; hyper-textual language model; hyperlinks; language modeling; semantic information; topic-relevant pageranking; university Web site; Frequency; Indexing; Information retrieval; Interpolation; Maximum likelihood estimation; Sampling methods; Topology; Uniform resource locators; Web pages; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
Type :
conf
DOI :
10.1109/GRC.2008.4664789
Filename :
4664789
Link To Document :
بازگشت