Title :
Discovery of various sequential patterns within top-k from sequential data
Author :
Sakurai, Shigeaki ; Nishizawa, Minoru
Author_Institution :
Big Data Cloud Technol. Center, Toshiba Corp. Cloud & Solutions Co., Kanagawa, Japan
Abstract :
This paper proposes a method that discovers various sequential patterns from sequential data. The sequential data is a set of sequences. Each sequence is a row of item sets. Many previous methods discover frequent sequential patterns from the data. However, the patterns tend to be similar to each other because they are composed of limited items. The patterns do not always correspond to the interests of analysts. Therefore, this paper tackles on the issue discovering various sequential patterns. The proposed method discovers them by evaluating the variety of items and deleting redundant patterns based on three kinds of delete processes. It can discover various patterns within the upper bound for the number of sequential patterns given by the analysts. This paper applies the method to the synthetic sequential data which is characterized by number of items, their kind, and length of sequence. The effect of the method is verified through numerical experiments.
Keywords :
data handling; pattern recognition; frequent sequential patterns discovery; item sets; synthetic sequential data; top-k; Bars; Big data; Companies; Dairy products; Noise; Time measurement; Upper bound; Delete of redundant patterns; Sequential Pattern; Upper pattern constraint; Variety of items;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
DOI :
10.1109/SCIS-ISIS.2014.7044644