Title :
Knowledge Discovery across Documents through Concept Chain Queries
Author :
Jin, Wei ; Srihari, Rohini K.
Author_Institution :
New York State Univ., Buffalo, NY
Abstract :
This paper focuses on detecting links between two concepts across text documents (e.g. two persons). We interpret such a query as finding the most meaningful evidence trail across documents that connect these two concepts. Here we propose a fast and efficient algorithm to perform this task. It is based on the idea of hypothesis generation originated by Swanson called "complementary structures in disjoint literatures" (CSD). We adapted the technique by (i) developing an alternate method of generating semantic profiles and (ii) extending the technique to generate concept chains. Counterterrorism corpus is used to evaluate the performance of this approach and demonstrates the effectiveness of our algorithm
Keywords :
data mining; document handling; complementary structures; concept chain queries; concept chains; counterterrorism corpus; disjoint literatures; document knowledge discovery; hypothesis generation; meaningful evidence trail; semantic profiles; text documents; Algorithm design and analysis; Document handling; Fuels; Joining processes; Law; Legal factors; Social network services; Text mining; Text recognition; Weapons;
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
DOI :
10.1109/ICDMW.2006.105