DocumentCode :
226831
Title :
Filtering operation in cross-document linking
Author :
Mitocariu, Elena
Author_Institution :
Fac. of Comput. Sci., Al.I. Cuza Univ. of Iasi, Iasi, Romania
fYear :
2014
fDate :
24-26 Sept. 2014
Firstpage :
171
Lastpage :
175
Abstract :
In this paper a filtering operation for cross-document connections is presented. Different texts could be connected to each other if they refer to the same entity. Linking different documents only in terms of overlap leads to too many False Positives. The method proposed in this paper has as start point Centering Theory (CT). A list of Principal Centers (Cp) is created for each sentence in the document. This list filters the results in cross-document linking. A bigraph representation is proposed to highlight the connections between texts. A score for classifying the topics is also presented. The score is calculated based on entities occurrence frequency in the whole document. Such an approach eliminated some of the False Positive results (Fps).
Keywords :
graph theory; information filtering; text analysis; bigraph representation; centering theory; cross-document connections; cross-document linking; false positives; filtering operation; principal centers; Filtering theory; Information filters; Joining processes; Text analysis; XML; centering theory; cross-document analysis; topics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technologies (ISCIT), 2014 14th International Symposium on
Conference_Location :
Incheon
Type :
conf
DOI :
10.1109/ISCIT.2014.7011894
Filename :
7011894
Link To Document :
بازگشت