DocumentCode :
3334242
Title :
Acronym-Expansion Recognition and Ranking on the Web
Author :
Jain, Alpa ; Cucerzan, Silviu ; Azzam, Saliha
fYear :
2007
fDate :
13-15 Aug. 2007
Firstpage :
209
Lastpage :
214
Abstract :
The paper presents a study on large-scale automatic extraction of acronyms and associated expansions from Web data and from the user interactions with this data through Web search engines. We investigate three information sources for extracting and ranking acronym-expansion pairs, as provided by a large-scale search engine: the crawled web documents, the search engine logs, and the search results. We evaluate and compare the acronym-expansion pairs generated from these sources on three dimensions: (1) the precision and recall of each source; (2) the overlap and inclusion among the acronym-expansion sets; and (3) the rank-order correlation of the ordered expansion sets. Our results show that all three data sources play an important role in building a comprehensive up-to-date collection of acronym-expansion pairs.
Keywords :
information retrieval systems; search engines; Web search engines; acronym-expansion recognition; rank-order correlation; Computational linguistics; Data mining; Human immunodeficiency virus; Information retrieval; Large-scale systems; Query processing; Search engines; Vaccines; Web pages; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location :
Las Vegas, IL
Print_ISBN :
1-4244-1500-4
Electronic_ISBN :
1-4244-1500-4
Type :
conf
DOI :
10.1109/IRI.2007.4296622
Filename :
4296622
Link To Document :
بازگشت