• DocumentCode
    3500696
  • Title

    A hierarchical approach to represent relational data applied to clustering tasks

  • Author

    Xavier, Joao C. ; Canuto, Anne M P ; Freitas, Alex A. ; Gonçalves, Luiz M G ; Silla, Carlos N., Jr.

  • Author_Institution
    Comput. & Autom. Eng. Dept., Fed. Univ. of Rio Grande do Norte (UFRN), Natal, Brazil
  • fYear
    2011
  • fDate
    July 31 2011-Aug. 5 2011
  • Firstpage
    3055
  • Lastpage
    3062
  • Abstract
    Nowadays, the representation of many real word problems needs to use some type of relational model. As a consequence, information used by a wide range of systems has been stored in multi relational tables. However, from a data mining point of view, it has been a problem, since most of the traditional data mining algorithms have not been originally proposed to handle this type of data without discarding relationship information. Aiming to ameliorate this problem, we propose a hierarchical approach for handling relational data. In this approach the relational data is converted into a hierarchical structure (the main table as the root and the relations as the nodes). This hierarchical way to represent relational data can be used either for classification or clustering purposes. In this paper, we will use it in clustering algorithms. In order to do so, we propose a hierarchical distance metric to compute the similarity between the tables. In the empirical analysis, we will apply the proposed approach in two well-known clustering algorithms (k-means and agglomerative hierarchical). Finally, this paper also compares the effectiveness of our approach with one existing relational approach.
  • Keywords
    data handling; data mining; data structures; pattern clustering; clustering algorithm; clustering task; data mining algorithm; hierarchical structure; multirelational tables; relational data handling; relational data representation; relational model; Clustering algorithms; Data mining; Indexes; Measurement; Relational databases; Sediments;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2011 International Joint Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4244-9635-8
  • Type

    conf

  • DOI
    10.1109/IJCNN.2011.6033624
  • Filename
    6033624