• DocumentCode
    3143365
  • Title

    Bidirectional mining of non-redundant recurrent rules from a sequence database

  • Author

    Lo, David ; Ding, Bolin ; Lucia ; Han, Jiawei

  • Author_Institution
    Sch. of Inf. Syst., Singapore Manage. Univ., Singapore, Singapore
  • fYear
    2011
  • fDate
    11-16 April 2011
  • Firstpage
    1043
  • Lastpage
    1054
  • Abstract
    We are interested in scalable mining of a non-redundant set of significant recurrent rules from a sequence database. Recurrent rules have the form “whenever a series of precedent events occurs, eventually a series of consequent events occurs”. They are intuitive and characterize behaviors in many domains. An example is the domain of software specification, in which the rules capture a family of properties beneficial to program verification and bug detection. We enhance a past work on mining recurrent rules by Lo, Khoo, and Liu to perform mining more scalably. We propose a new set of pruning properties embedded in a new mining algorithm. Performance and case studies on benchmark synthetic and real datasets show that our approach is much more efficient and outperforms the state-of-the-art approach in mining recurrent rules by up to two orders of magnitude.
  • Keywords
    data mining; formal specification; bidirectional mining; bug detection; nonredundant recurrent rule mining; program verification; pruning property; sequence database; software specification domain; Association rules; Complexity theory; Databases; Redundancy; Semantics; Software;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2011 IEEE 27th International Conference on
  • Conference_Location
    Hannover
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4244-8959-6
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2011.5767848
  • Filename
    5767848