• DocumentCode
    2405193
  • Title

    Decoupled query optimization for federated database systems

  • Author

    Deshpande, Amol ; Hellerstein, Joseph M.

  • Author_Institution
    Div. of Comput. Sci., California Univ., Berkeley, CA, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    716
  • Lastpage
    727
  • Abstract
    We study the problem of query optimization in federated relational database systems. The nature of federated databases explicitly decouples many aspects of the optimization process, often making it imperative for the optimizer to consult underlying data sources while doing cost-based optimization. This not only increases the cost of optimization, but also changes the trade-offs involved in the optimization process significantly. The dominant cost in the decoupled optimization process is the "cost of costing" that traditionally has been considered insignificant. The optimizer can only afford a few rounds of messages to the underlying data sources and hence the optimization techniques in this environment must be geared toward gathering all the required cost information with minimal communication. In this paper, we explore the design space for a query optimizer in this environment and demonstrate the need for decoupling various aspects of the optimization process. We present minimum-communication decoupled variants of various query optimization techniques, and discuss tradeoffs in their performance in this scenario. We have implemented these techniques in the Cohera federated database system and our experimental results, somewhat surprisingly, indicate that a simple two-phase optimization scheme performs fairly well as long as the physical database design is known to the optimizer, though more aggressive algorithms are required otherwise
  • Keywords
    distributed databases; query processing; relational databases; Cohera federated database; decoupled optimization; federated databases; federated relational database systems; query optimization; query optimizer; Algorithm design and analysis; Computer science; Corporate acquisitions; Cost function; Database systems; Design optimization; Internet; Query processing; Relational databases; Space exploration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2002. Proceedings. 18th International Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-1531-2
  • Type

    conf

  • DOI
    10.1109/ICDE.2002.994788
  • Filename
    994788