Title :
Linking Documents to Encyclopedic Knowledge
Author :
Csomai, Andras ; Mihalcea, Rada
Author_Institution :
Univ. of North Texas, Denton, TX
Abstract :
Wikipedia has become one of the largest online repositories of encyclopedic knowledge. Wikipedia editions are available for more than 200 languages, with entries varying from a few pages to more than 1 million articles per language. Embedded in each Wikipedia article is an abundance of links connecting the most important words or phrases in the text to other pages, thereby letting users quickly access additional information. An automatic text-annotation system combines keyword extraction and word-sense disambiguation to identify relevant links to Wikipedia pages.
Keywords :
Web sites; encyclopaedias; information retrieval; natural language processing; text analysis; Wikipedia; automatic text-annotation system; encyclopedic knowledge; keyword extraction; natural language processing; online repository; word-sense disambiguation; Word-sense disambiguation; computers in education; keyword extraction; text annotation;
Journal_Title :
Intelligent Systems, IEEE
DOI :
10.1109/MIS.2008.86