DocumentCode :
259387
Title :
An Approach to Improve Precision and Recall for Ad-hoc Information Retrieval Using SBIR Algorithm
Author :
Selvi, R. Thamarai ; Raj, E. George Dharma Prakash
Author_Institution :
Dept. of Comput. Applic., Bishop Heber Coll., Trichirappalli, India
fYear :
2014
fDate :
Feb. 27 2014-March 1 2014
Firstpage :
137
Lastpage :
141
Abstract :
Information Retrieval is a process of finding the documents in a collection based on a specific topic. The information need is expressed by the user as a query. Documents that satisfy the given query in the judgment of the user are said to be relevant. The documents that are not of the given topic are said to be non-relevant. An IR engine may use the query to classify the documents in a collection, returning to the user a subset of documents that satisfy some classification criterion. There are several search engines to find information in the given repositories containing large amounts of unstructured form of text data. However, the task of ad hoc information retrieval is, finding documents within a corpus like Bible, that are relevant to the user remains a hard challenge. Sometimes the relevant documents may not contain the specified keyword. The lack of the given term in a document does not necessarily mean that the document is not a relevant. Because more than one terms can be semantically similar although they are lexicographically different. In this paper a new algorithm called "Semantic based Boolean Information Retrieval" (SBIR) is proposed to retrieve the documents with semantically similar terms to enhance the performance of Boolean Information Model by improving the recall and precision.
Keywords :
pattern classification; query processing; search engines; semantic Web; text analysis; word processing; IR engine; SBIR algorithm; ad hoc information retrieval; boolean information model; corpus; document classification criterion; document retrieval; document subset; lexicographic document; precision and recall approach; query processing; search engine; semantic based boolean information retrieval; semantic similarity; unstructured text data; Algorithm design and analysis; Databases; Educational institutions; Search engines; Semantics; Vectors; Boolean Information Retrieval; Information Retrieval; Semantic; Stemming Algorithm; WordNet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing and Communication Technologies (WCCCT), 2014 World Congress on
Conference_Location :
Trichirappalli
Print_ISBN :
978-1-4799-2876-7
Type :
conf
DOI :
10.1109/WCCCT.2014.68
Filename :
6755122
Link To Document :
بازگشت