• DocumentCode
    774948
  • Title

    Allocating data and operations to nodes in distributed database design

  • Author

    March, Salvatore T. ; Rho, Sangkyu

  • Author_Institution
    Carlson Sch. of Manage., Minnesota Univ., Minneapolis, MN, USA
  • Volume
    7
  • Issue
    2
  • fYear
    1995
  • fDate
    4/1/1995 12:00:00 AM
  • Firstpage
    305
  • Lastpage
    317
  • Abstract
    The allocation of data and operations to nodes in a computer communications network is a critical issue in distributed database design. An efficient distributed database design must trade off performance and cost among retrieval and update activities at the various nodes. It must consider the concurrency control mechanism used as well as capacity constraints at nodes and on links in the network. It must determine where data will be allocated, the degree of data replication, which copy of the data will be used for each retrieval activity, and where operations such as select, project, join, and union will be performed. We develop a comprehensive mathematical modeling approach for this problem. The approach first generates units of data (file fragments) to be allocated from a logical data model representation and a characterization of retrieval and update activities. Retrieval and update activities are then decomposed into relational operations on these fragments. Both fragments and operations on them are then allocated to nodes using a mathematical modeling approach. The mathematical model considers network communication, local processing, and data storage costs. A genetic algorithm is developed to solve this mathematical formulation
  • Keywords
    concurrency control; data handling; data structures; database theory; distributed databases; genetic algorithms; local area networks; mathematical analysis; modelling; query processing; capacity constraints; computer communications network; concurrency control mechanism; cost; data allocation; data replication; data units; distributed database design; file fragments; join operation; links; logical data model representation; mathematical modeling; nodes; operation allocation; performance; project operation; relational operations; retrieval activities; select operation; union operation; update activities; Character generation; Communication networks; Computer networks; Concurrency control; Costs; Data models; Distributed computing; Distributed databases; Information retrieval; Mathematical model;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.382299
  • Filename
    382299