DocumentCode :
1832752
Title :
ReadFast: Optimizing structural search relevance for big biomedical text
Author :
Gubanov, Michael ; Pyayt, Anna
Author_Institution :
Massachusetts Inst. of Technol., Cambridge, MA, USA
fYear :
2013
fDate :
14-16 Aug. 2013
Firstpage :
717
Lastpage :
719
Abstract :
While the problem to find needed information on the Web is critical, it is arguably much less pressing nowadays than it was over a decade ago when the Web was emerging. Back then it was much more difficult to find a Web resource of interest, because the search engines were in their infancy covering much lesser portion of the Web by their indices, armed with embryonic page ranking algorithms. Now, Web-search is by far not perfect yet, but definitely went a long way to become an everyday “go-to” resource for millions of people. By contrast, access to textual information is not even close to what Web-search algorithms offer today. In fact, it does not differ much from what everyone had a decade ago. That is keyword-search (exact substring match) is often the only way to find needle in a haystack in most modern word processors and text corpora search engines. Here we demonstrate ReadFast - a system, capable to extract certain structure from any natural language text corpus and use it to provide more relevant search results than keyword-search for specific classes of queries. Our evaluation justified significant relevance gain (20-30%) for two large Biomedical text corpora.
Keywords :
bioinformatics; query processing; relevance feedback; search engines; string matching; text analysis; READFAST system; Web resource; Web-search algorithms; big-biomedical text; biomedical text corpora; keyword-search; natural language text corpus; page ranking algorithms; query classes; relevance gain; structural search relevance optimization; substring matching; text corpora search engines; textual information access; word processors; Indexes; Natural languages; Navigation; Pain; Radio frequency; Search engines; Standards;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on
Conference_Location :
San Francisco, CA
Type :
conf
DOI :
10.1109/IRI.2013.6642540
Filename :
6642540
Link To Document :
بازگشت