• DocumentCode
    457371
  • Title

    Statistical Borders for Incremental Mining

  • Author

    Nock, Richard ; Laur, Pierre-Alain ; Symphor, Jean-Emile

  • Volume
    3
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    212
  • Lastpage
    215
  • Abstract
    Data streams - dataflows in which the information arrives in a timely manner - have recently become a major subfield of knowledge extraction. One of their most important singularity is that only a part of the information remains available at a time, which makes it necessary to cope with uncertainty. In this paper, we introduce a novel statistical approach which biases the initial support for patterns mining. This approach holds the advantage to maximize one of two parameters (precision or recall) chosen by the user, while guaranteeing a statistical near optimal degradation of the other. This leads us to introduce the statistical borders, the relevant sets of frequent patterns in incremental mining of data streams. Experiments performed on sequential patterns demonstrate the potential of this approach
  • Keywords
    data mining; pattern recognition; statistical analysis; data stream; dataflow; frequent pattern mining; incremental mining; knowledge extraction; sequential patterns; statistical border; statistical near optimal degradation; Data mining; Databases; Degradation; Frequency; History; Knowledge management; Pattern recognition; Probability; Sampling methods; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2521-0
  • Type

    conf

  • DOI
    10.1109/ICPR.2006.1076
  • Filename
    1699504