DocumentCode
1965232
Title
Database editing metrics for pattern matching
Author
Ruspini, Enrique H. ; Thomere, Jerome ; Wolverton, Michael
Author_Institution
Artificial Intelligence Center, SRI Int., Menlo Park, CA, USA
fYear
2004
fDate
21-22 July 2004
Firstpage
31
Lastpage
38
Abstract
Pattern-matching techniques are important tools to treat problems in several fields, including bioinformatics, case-based reasoning, information retrieval, and pattern recognition. These procedures are important in homeland security and crime prevention because the underlying problems require discovery, in large databases, of instances of patterns known to be associated with illegal activities. While pattern matching may be defined in strict terms as the satisfaction of a logical expression, defining the pattern, by a set of assertions contained in the database, the value of relevant procedures is considerably enhanced by permitting the discovery of approximate matches between database and patterns. The notion of approximate matching is based on soft predicates, which may be satisfied to a degree, rather than the conventional crisp predicates of classical logic. This paper introduces a family of metrics to measure the degree of qualitative match between a database and a pattern, that is, an elastic constraint on database objects and their relations. These metrics provide a formal foundation for the application of graph-editing metrics - measures of the cost associated with graph transformations - to pattern-matching problems. The degree of matching between database and patterns is determined by means of similarity measures that gauge the resemblance between pairs of objects. In our treatment, these measures have a semantic basis stemming from consideration of knowledge structures, such as ontologies, describing common properties of two objects. Approximate pattern matching is treated as the process of modifying databases into a transformed database that strictly satisfies the constraints expressed by the pattern. Associated with each transformation is a measure of admissibility derived from the similarity between the original and transformed databases. The degree of matching of database to pattern is defined as the admissibility of the transformation with highest admissibility value.
Keywords
case-based reasoning; data mining; graph theory; information retrieval; object-oriented databases; ontologies (artificial intelligence); pattern matching; police data processing; relational databases; security; very large databases; admissibility value; approximate matching; bioinformatics; case-based reasoning; classical logic; crime prevention; database editing metrics; database objects; elastic constraint; graph transformations; graph-editing metrics; homeland security; illegal activity pattern; information retrieval; knowledge discovery; knowledge structures; large databases; link analysis; logical expression; object resemblance; ontologies; pattern matching; pattern recognition; semantic similarity measures; Artificial intelligence; Data structures; Databases; Distributed computing; Information retrieval; Logic; Ontologies; Pattern matching; Pattern recognition; Terrorism;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence for Homeland Security and Personal Safety, 2004. CIHSPS 2004. Proceedings of the 2004 IEEE International Conference on
Print_ISBN
0-7803-8381-8
Type
conf
DOI
10.1109/CIHSPS.2004.1360204
Filename
1360204
Link To Document