Title :
MWASP: Multiple-Width Approximate Sequential Patterns
Author :
Yip, Kelly Kingchi ; Nembhard, David A.
Author_Institution :
Dept. of Ind. & Manuf. Eng., Pennsylvania State Univ., University Park, PA
fDate :
March 30 2009-April 2 2009
Abstract :
Time series data are often found in diverse fields, such as science, business, medicine and engineering. In this paper, we focus on sequential pattern mining for categorical time series datasets that contain multiple independent timeseries. Frequent patterns are considered important in many applications. However, collected data are generally afflicted with noise. Conventional sequential pattern mining methods that use exact matching may meet difficulties in mining databases with long sequences and noise. We propose a framework that uncovers frequent approximate sequential patterns with multiple widths. A mined pattern in this framework is a representative of a group of sequences (with various widths) that follow the pattern´s event flow order. The presentation of the patterns also gives insight into the occurrence of the pattern longitudinally and across the population. The pattern can be recognized as a common pattern across the multiple time series, time, or both. We name this novel framework MWASP: Multiple-Width Approximate Sequential Patterns.
Keywords :
data mining; pattern matching; statistical databases; time series; MWASP; database mining; multiple-width approximate sequential pattern mining; pattern matching; pattern recognition; time series database; Association rules; DNA; Data mining; Databases; Frequency; Genetic mutations; Pattern analysis; Pattern matching; Pattern recognition; Sequences;
Conference_Titel :
Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2765-9
DOI :
10.1109/CIDM.2009.4938665