DocumentCode
480747
Title
Integrating Structure in the Probabilistic Model for Information Retrieval
Author
Gery, Mathias ; Largeron, Christine ; Thollard, Franck
Author_Institution
Univ. de Lyon, Lyon
Volume
1
fYear
2008
fDate
9-12 Dec. 2008
Firstpage
763
Lastpage
769
Abstract
In databases or in the World Wide Web, many documents are in a structured format (e.g. XML). We propose in this article to extend the classical IR probabilistic model in order to take into account the structure through the weighting of tags. Our approach includes a learning step in which the weight of each tag is computed. This weight estimates the probability that the tag distinguishes the terms which are the most relevant. Our model has been evaluated on a large collection during INEX IR evaluation campaigns.
Keywords
information retrieval; learning (artificial intelligence); probability; World Wide Web; classical information retrieval probabilistic model; document retrieval; integration structure; learning step; weight estimation; Deductive databases; HTML; Indexing; Information retrieval; Intelligent agent; Intelligent structures; Internet; Markup languages; Web sites; XML; XML; probabilistic model; structure; tags;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-0-7695-3496-1
Type
conf
DOI
10.1109/WIIAT.2008.346
Filename
4740545
Link To Document