• DocumentCode
    3323675
  • Title

    Optimizing Star Join Queries for Data Warehousing in Microsoft SQL Server

  • Author

    Galindo-Legaria, Cesar A. ; Grabs, Torsten ; Gukal, Sreenivas ; Herbert, Steve ; Surna, Aleksandras ; Wang, Shirley ; Yu, Wei ; Zabback, Peter ; Zhang, Shin

  • Author_Institution
    Microsoft Corp., Redmond, WA
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    1190
  • Lastpage
    1199
  • Abstract
    As mainstream data warehouses are growing into the multi-terabyte range, adequate performance for decision support queries remains challenging for database query processors. Proper choice of query plan is essential in data warehouses where fact tables often store billions of rows. This paper discusses query optimization and execution strategies that Microsoft SQL Server employs for decision support queries in dimensionally modeled relational data warehouses. Our approach is based on pattern matching to detect typical star query patterns. When matching the pattern, the optimizer generates additional query plan alternatives specifically optimized for data warehouse performance. For high selectivity queries, the plans use nested loops joins and seeks. Medium selectivity queries in turn rely on right-deep hash joins with bitmap filters. Bitmap filters perform semi-join reductions to efficiently prune out non-qualifying rows early. Final plan choice is left for cost-based optimization which also compares the data warehouse specific plans against conventional query plans. We conducted an extensive experimental investigation using both synthetic workloads and several customer workloads. As our results show, the new plan shapes and execution strategies yield significant performance improvements across the targeted workloads as compared to earlier versions of Microsoft SQL Server.
  • Keywords
    SQL; data warehouses; decision support systems; pattern matching; query processing; relational databases; bitmap filters; data warehousing; database query processors; decision support query; mainstream data warehouses; microsoft SQL server; multiterabyte range; pattern matching; query optimization; relational data warehouses; selectivity query; star join query; star query patterns; Business; Companies; Data warehouses; Filters; Pattern matching; Query processing; Relational databases; Shape; Transaction databases; Warehousing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497528
  • Filename
    4497528