DocumentCode
259387
Title
An Approach to Improve Precision and Recall for Ad-hoc Information Retrieval Using SBIR Algorithm
Author
Selvi, R. Thamarai ; Raj, E. George Dharma Prakash
Author_Institution
Dept. of Comput. Applic., Bishop Heber Coll., Trichirappalli, India
fYear
2014
fDate
Feb. 27 2014-March 1 2014
Firstpage
137
Lastpage
141
Abstract
Information Retrieval is a process of finding the documents in a collection based on a specific topic. The information need is expressed by the user as a query. Documents that satisfy the given query in the judgment of the user are said to be relevant. The documents that are not of the given topic are said to be non-relevant. An IR engine may use the query to classify the documents in a collection, returning to the user a subset of documents that satisfy some classification criterion. There are several search engines to find information in the given repositories containing large amounts of unstructured form of text data. However, the task of ad hoc information retrieval is, finding documents within a corpus like Bible, that are relevant to the user remains a hard challenge. Sometimes the relevant documents may not contain the specified keyword. The lack of the given term in a document does not necessarily mean that the document is not a relevant. Because more than one terms can be semantically similar although they are lexicographically different. In this paper a new algorithm called "Semantic based Boolean Information Retrieval" (SBIR) is proposed to retrieve the documents with semantically similar terms to enhance the performance of Boolean Information Model by improving the recall and precision.
Keywords
pattern classification; query processing; search engines; semantic Web; text analysis; word processing; IR engine; SBIR algorithm; ad hoc information retrieval; boolean information model; corpus; document classification criterion; document retrieval; document subset; lexicographic document; precision and recall approach; query processing; search engine; semantic based boolean information retrieval; semantic similarity; unstructured text data; Algorithm design and analysis; Databases; Educational institutions; Search engines; Semantics; Vectors; Boolean Information Retrieval; Information Retrieval; Semantic; Stemming Algorithm; WordNet;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing and Communication Technologies (WCCCT), 2014 World Congress on
Conference_Location
Trichirappalli
Print_ISBN
978-1-4799-2876-7
Type
conf
DOI
10.1109/WCCCT.2014.68
Filename
6755122
Link To Document