• DocumentCode
    3316986
  • Title

    Approximate Sequential Patterns for Incomplete Sequence Database Mining

  • Author

    Fiot, Céline ; Laurent, Anne ; Teisseire, Maguelonne

  • Author_Institution
    Univ. of Montpellier II - CNRS, Montpellier
  • fYear
    2007
  • fDate
    23-26 July 2007
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Databases available from many industrial or research fields are often imperfect. In particular, they are most of the time incomplete in the sense that some of the values are missing. When facing this kind of imperfect data, two techniques can be investigated: either using only the available information or estimating the missing values. In this paper we propose an estimation-based approach for sequence mining. This approach considers partial inclusion of an item within a record using fuzzy sets. Experiments run on various synthetic datasets show the feasibility and validity of our proposal as well in terms of quality as in terms of the robustness to the rate of missing values.
  • Keywords
    approximation theory; data mining; database management systems; fuzzy set theory; pattern classification; sequences; sequential estimation; approximate sequential patterns; fuzzy sets; incomplete sequence database mining; missing data estimation; synthetic datasets; Association rules; Data analysis; Data mining; Databases; Fuzzy sets; Information analysis; Mining industry; Pattern analysis; Proposals; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
  • Conference_Location
    London
  • ISSN
    1098-7584
  • Print_ISBN
    1-4244-1209-9
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2007.4295445
  • Filename
    4295445