Title :
An approach to graph mining using gSpan algorithm
Author :
Priyadarshini, Sadhana ; Mishra, Debahuti
Author_Institution :
Dept. of Comput. Applic., SOA Univ., Bhubaneswar, India
Abstract :
Complex information is prevailing in every sphere of activities This call for the necessity to represent, store and manipulate complex information (e.g. detects correlations and patterns, discover explanations, construct predictive models etc.). Furthermore, being autonomously maintained, data can change in time or even change its base structure, making it difficult for representation systems to accommodate these changes. Current representation and storage systems are not very flexible in dealing with big changes and also they are not concerned with the ability of performing complex data manipulations of the sort mentioned above. On the other hand, data manipulation systems cannot easily work with structural or relational data, but just with flat data representations. We want to bridge the gap between the two, by introducing a new type of database structure, called Graph Databases (GDB), based on a natural graph representation. Our Graph Databases are able to represent as graphs any kind of information, naturally accommodate changes in data, and they also make easier for Machine Learning methods to use the stored information. Graph mining is the process of extracting sub graphs from graph database or database of graphs. The problem of discovering frequent sub graphs of graph data can be solved by constructing a candidate set of sub graphs first, and then, identifying within this candidate set those sub graphs that meet the frequent sub graph requirement.
Keywords :
data mining; database management systems; learning (artificial intelligence); Gspan algorithm; data manipulation systems; graph database; graph mining; machine learning methods; Algorithm design and analysis; Classification algorithms; Communications technology; Computers; Data mining; Databases; Transforms; Frequent sub graph; Label graph; Sub graph Isomorphism;
Conference_Titel :
Computer and Communication Technology (ICCCT), 2010 International Conference on
Conference_Location :
Allahabad, Uttar Pradesh
Print_ISBN :
978-1-4244-9033-2
DOI :
10.1109/ICCCT.2010.5640496