• DocumentCode
    3423791
  • Title

    Anomaly Detection in XML databases by means of Association Rules

  • Author

    Bruno, Giulia ; Garza, Paolo ; Quintarelli, Elisa ; Rossato, Rosalba

  • Author_Institution
    Politecnico di Torino, Torino
  • fYear
    2007
  • fDate
    3-7 Sept. 2007
  • Firstpage
    387
  • Lastpage
    391
  • Abstract
    Anomaly detection has the double purpose of discovering interesting exceptions and identifying incorrect data in huge amounts of data. Since anomalies are rare events which violate the frequent relationships among data, we propose a method to detect frequent relationships and then extract anomalies. The RADAR (Research of Anomalous Data through Association Rules) method is based on data mining techniques to extract frequent "rules" from datasets, in the form of quasi-functional dependencies. Such dependencies are extracted by using association rules. Given a quasi-functional dependency, we can discover the associated anomalies by querying either the original database or the association rules previously mined. The analysis on this kind of anomaly can either derive the presence of erroneous data or highlight novel information which represents significant outliers of frequent rules. Our method does not require any previous knowledge and directly infers rules from the data. Experiments performed on real XML databases are reported to show the applicability and effectiveness of the proposed approach.
  • Keywords
    XML; data mining; database management systems; security of data; RADAR; XML database anomaly detection; association rules; data mining technique; frequent rule extraction; quasi functional dependency; Association rules; Data analysis; Data mining; Error correction; Event detection; Expert systems; Information analysis; Radar detection; Relational databases; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2007. DEXA '07. 18th International Workshop on
  • Conference_Location
    Regensburg
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-2932-5
  • Type

    conf

  • DOI
    10.1109/DEXA.2007.68
  • Filename
    4312922