• DocumentCode
    783577
  • Title

    Using evolutionary algorithms for defining the sampling policy of complex n-partite networks

  • Author

    Goldstein, Michel L. ; Yen, Gary G.

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Oklahoma State Univ., Stillwater, OK, USA
  • Volume
    17
  • Issue
    6
  • fYear
    2005
  • fDate
    6/1/2005 12:00:00 AM
  • Firstpage
    762
  • Lastpage
    773
  • Abstract
    N-partite networks are natural representations of complex multientity databases. However, processing these networks can be a highly memory and computation-intensive task, especially when positive correlation exists between the degrees of vertices from different partitions. In order to improve the scalability of this process, this paper proposes two algorithms that make use of sampling for obtaining less expensive approximate results. The first algorithm is optimal for obtaining homogeneous discovery rates with a low memory requirement, but can be very slow in cases where the combined branching factor of these networks is too large. A second algorithm that incorporates concepts from evolutionary computation aims toward dealing with this slow convergence in the case when it is more interesting to increase approximation convergence speed of elements with high feature values. This algorithm makes use of the positive correlation between "local" branching factors and the feature values. Two applications examples are demonstrated in searching for most influential authors in collections of journal articles and in analyzing most active earthquake regions from a collection of earthquake events.
  • Keywords
    approximation theory; convergence; data mining; distributed databases; earthquakes; evolutionary computation; graph theory; sampling methods; N-partite networks; active earthquake regions; approximation convergence speed; evolutionary algorithms; graphic-structured database; journal articles; local branching factors; memory requirement; multientity databases; Approximation algorithms; Computer networks; Convergence; Distributed databases; Earthquakes; Evolutionary computation; Partitioning algorithms; Sampling methods; Scalability; Spatial databases; Index Terms- N-partite network; evolutionary algorithm; graphic-structured database.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.100
  • Filename
    1423977