Title :
Fuzzy Orders-of-Magnitude-Based Link Analysis for Qualitative Alias Detection
Author :
Shen, Qiang ; Boongoen, Tossapon
Author_Institution :
Dept. of Comput. Sci., Aberystwyth Univ., Aberystwyth, UK
fDate :
4/1/2012 12:00:00 AM
Abstract :
Alias detection has been the significant subject being extensively studied for several domain applications, especially intelligence data analysis. Many preliminary methods rely on text-based measures, which are ineffective with false descriptions of terrorists´ name, date-of-birth, and address. This barrier may be overcome through link information presented in relationships among objects of interests. Several numerical link-based similarity techniques have proven effective for identifying similar objects in the Internet and publication domains. However, as a result of exceptional cases with unduly high measure, these methods usually generate inaccurate similarity descriptions. Yet, they are either computationally inefficient or ineffective for alias detection with a single-property based model. This paper presents a novel orders-of-magnitude based similarity measure that integrates multiple link properties to refine the estimation process and derive semantic-rich similarity descriptions. The approach is based on order-of-magnitude reasoning with which the theory of fuzzy set is blended to provide quantitative semantics of descriptors and their unambiguous mathematical manipulation. With such explanatory formalism, analysts can validate the generated results and partly resolve the problem of false positives. It also allows coherent interpretation and communication within a decision-making group, using this computing-with-word capability. Its performance is evaluated over a terrorism-related data set, with further generalization over publication and email data collections.
Keywords :
data analysis; decision making; fuzzy set theory; Internet; computing-with-word capability; decision-making group; estimation process; fuzzy orders-of-magnitude; fuzzy set; intelligence data analysis; link analysis; link property; numerical link-based similarity technique; order-of-magnitude reasoning; orders-of-magnitude based similarity measure; qualitative alias detection; semantic-rich similarity description; similar object identification; single-property based model; terrorism-related data set; text-based measures; unambiguous mathematical manipulation; Algebra; Atmospheric measurements; Cognition; Electronic mail; Information retrieval; Particle measurements; Semantics; Orders-of-magnitude reasoning; alias detection; fuzzy set; intelligence data.; link analysis; similarity measure;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2010.255