DocumentCode :
3141059
Title :
On the use of density distribution of keywords for automated generation of hypertext links from arbitrary parts of documents
Author :
Kise, Koichi ; Mizuno, Hiroyuki ; Yamaguchi, Masashi ; Matsumoto, Keinosuke
Author_Institution :
Dept. of Comput. & Syst. Sci., Osaka Prefecture Univ., Japan
fYear :
1999
fDate :
20-22 Sep 1999
Firstpage :
301
Lastpage :
304
Abstract :
This paper presents a method of automated generation of hypertext links for electronic documents. The goal is to generate links from an arbitrary part of a document (a source of a link) to its relevant parts of target documents (destinations). To achieve this goal, we assume that words are often shared by parts of documents if these parts are relevant with each other. In order to extract parts densely including words of a source (keywords), we employ density distributions of keywords. This enables us to determine destinations simply by extracting parts whose density exceeds a threshold. Experiments on generating links from figures/tables to parts of documents, as well as from texts to parts of different documents show that our method with the optimal parameters yields recall of 60% and precision of 50%
Keywords :
hypermedia; image recognition; visual databases; arbitrary parts of documents; density distribution; hypertext links; keywords; precision; recall; Educational institutions; Hoses; Information analysis; Read only memory; Telecommunications; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
Type :
conf
DOI :
10.1109/ICDAR.1999.791784
Filename :
791784
Link To Document :
بازگشت