DocumentCode
226831
Title
Filtering operation in cross-document linking
Author
Mitocariu, Elena
Author_Institution
Fac. of Comput. Sci., Al.I. Cuza Univ. of Iasi, Iasi, Romania
fYear
2014
fDate
24-26 Sept. 2014
Firstpage
171
Lastpage
175
Abstract
In this paper a filtering operation for cross-document connections is presented. Different texts could be connected to each other if they refer to the same entity. Linking different documents only in terms of overlap leads to too many False Positives. The method proposed in this paper has as start point Centering Theory (CT). A list of Principal Centers (Cp) is created for each sentence in the document. This list filters the results in cross-document linking. A bigraph representation is proposed to highlight the connections between texts. A score for classifying the topics is also presented. The score is calculated based on entities occurrence frequency in the whole document. Such an approach eliminated some of the False Positive results (Fps).
Keywords
graph theory; information filtering; text analysis; bigraph representation; centering theory; cross-document connections; cross-document linking; false positives; filtering operation; principal centers; Filtering theory; Information filters; Joining processes; Text analysis; XML; centering theory; cross-document analysis; topics;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications and Information Technologies (ISCIT), 2014 14th International Symposium on
Conference_Location
Incheon
Type
conf
DOI
10.1109/ISCIT.2014.7011894
Filename
7011894
Link To Document