Title :
A new Granular Computing approach for sequences representation and classification
Author :
Rizzi, Antonello ; Del Vescovo, Guido ; Livi, Lorenzo ; Mascioli, Fabio Massimo Frattale
Author_Institution :
Dept. of Inf. Eng., Electron. & Telecommun., SAPIENZA Univ. of Rome, Rome, Italy
Abstract :
In this paper we present an innovative procedure for sequence mining and representation. It can be used as its own in Data Mining problems or as the core of a classification system based on a Granular Computing approach to represent sequences in a suited embedding space. By adopting an inexact sequence matching procedure, the algorithm is able to extract a symbols alphabet of frequent subsequences to be used as prototypes for the embedding stage. Experimental evaluation over both synthetically generated and biological datasets confirms that the modeling system is able to synthesize effective models when facing even complex and noisy problems defined by frequency-based classification rules.
Keywords :
data mining; granular computing; pattern classification; pattern matching; biological dataset; data mining problem; frequency-based classification rule; granular computing approach; inexact sequence matching procedure; innovative procedure; modeling system; sequence classification; sequence mining; sequence representation; symbols alphabet extraction; Clustering algorithms; Complexity theory; Computational modeling; Data mining; Data models; Training;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252680