• DocumentCode
    265244
  • Title

    JOMR: Multi-join optimizer technique to enhance map-reduce job

  • Author

    Shanoda, Mina Samir ; Senbel, Samah Ahmed ; Khafagy, Mohamed Helmy

  • Author_Institution
    Coll. of Comput. & Inf. Technol., Arab Acad. for Sci. & Technol., Cairo, Egypt
  • fYear
    2014
  • fDate
    15-17 Dec. 2014
  • Abstract
    Map-Reduce is a programming model and execution an environment developed by Google to process very large amounts of data. Query optimizer is needed to find more efficient plans for declarative SQL query. In classic database: join algorithms are optimized to execute the entire query result, but they ignore the importance of tables order especially in multi-join query. But we can see that the orders for tables are an important factor to get the best performance of a query plan and will be very effective in performance when join tables content huge number of rows in addition to more than one join operation. In this paper we proposed a new technique called JOMR (Join Order In Map-Reduce) that optimizes and enhances Map-Reduce job. This technique uses enhanced parallel Travel Salesman Problem (TSP) using Map-Reduce for improving the performance of query plans according to change the order for join tables. Also we build a cost model that supports our algorithm to find best join order. We will focus on Hive especially multi-join query and our experiments result for JOMR algorithm proving the effectiveness of our query optimizer and this performance is improved more when increasing the number of join and size of data.
  • Keywords
    SQL; parallel programming; query processing; Google; Hive query; JOMR technique; MapReduce job enhancement; MapReduce programming model; Structured Query Language; TSP; cost model; declarative SQL query; multijoin optimizer technique; multijoin query; parallel travel salesman problem; query optimization; query plan; Algorithm design and analysis; Distributed computing; Educational institutions; Informatics; Mathematical model; Partitioning algorithms; Query processing; Column Statistics; Hadoop; Hive; Join; Map-Reduce; Query Optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics and Systems (INFOS), 2014 9th International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-977-403-689-7
  • Type

    conf

  • DOI
    10.1109/INFOS.2014.7036682
  • Filename
    7036682