• DocumentCode
    3584660
  • Title

    A Framework for Identifying Skylines over Incomplete Data

  • Author

    Alwan, Ali A. ; Ibrahim, Hamidah ; Udzir, Nur Izura

  • Author_Institution
    Dept. of Comput. Sci., Int. Islamic Univ. Malaysia, Kuala Lumpur, Malaysia
  • fYear
    2014
  • Firstpage
    79
  • Lastpage
    84
  • Abstract
    Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other data items in all dimensions (attributes) of the database. Most of the existing skyline techniques determine the skylines by assuming that the values of dimensions for every data item are available (complete). However, this assumption is not always true particularly for multidimensional database as some values may be missing. The incompleteness of data leads to the loss of the transitivity property of skyline technique and results into failure in test dominance as some data items are incomparable to each other. Furthermore, incompleteness of data influences negatively on the process of finding skylines, leading to high overhead, due to exhaustive pair wise comparisons between the data items. This paper proposed a framework to process skyline queries for incomplete data with the aim of avoiding the issue of cyclic dominance in deriving skylines. The proposed framework for identifying skylines for incomplete data consists of four components, namely: Data Clustering Builder, Group Constructor and Local Skylines Identifier, k-dom Skyline Generator, and Incomplete Skylines Identifier. Including these processes in the proposed framework has optimized the process of identifying skylines in incomplete database by reducing the necessary number of pair wise comparison through eliminating the dominated data items as early as possible before applying the skyline technique.
  • Keywords
    pattern clustering; query processing; cyclic dominance; data clustering builder; data incompleteness; data item; group constructor; incomplete skylines identifier; k-dom skyline generator; local skylines identifier; multidimensional database; skyline queries identification; skyline techniques; Algorithm design and analysis; Computer science; Distributed databases; Generators; Indexes; incomplete data; preference queries; query processing; skyline queries;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Science Applications and Technologies (ACSAT), 2014 3rd International Conference on
  • Type

    conf

  • DOI
    10.1109/ACSAT.2014.21
  • Filename
    7076873