DocumentCode
3316986
Title
Approximate Sequential Patterns for Incomplete Sequence Database Mining
Author
Fiot, Céline ; Laurent, Anne ; Teisseire, Maguelonne
Author_Institution
Univ. of Montpellier II - CNRS, Montpellier
fYear
2007
fDate
23-26 July 2007
Firstpage
1
Lastpage
6
Abstract
Databases available from many industrial or research fields are often imperfect. In particular, they are most of the time incomplete in the sense that some of the values are missing. When facing this kind of imperfect data, two techniques can be investigated: either using only the available information or estimating the missing values. In this paper we propose an estimation-based approach for sequence mining. This approach considers partial inclusion of an item within a record using fuzzy sets. Experiments run on various synthetic datasets show the feasibility and validity of our proposal as well in terms of quality as in terms of the robustness to the rate of missing values.
Keywords
approximation theory; data mining; database management systems; fuzzy set theory; pattern classification; sequences; sequential estimation; approximate sequential patterns; fuzzy sets; incomplete sequence database mining; missing data estimation; synthetic datasets; Association rules; Data analysis; Data mining; Databases; Fuzzy sets; Information analysis; Mining industry; Pattern analysis; Proposals; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems Conference, 2007. FUZZ-IEEE 2007. IEEE International
Conference_Location
London
ISSN
1098-7584
Print_ISBN
1-4244-1209-9
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZY.2007.4295445
Filename
4295445
Link To Document