DocumentCode
3716484
Title
Enhancing Aggregation over Uncertain Databases
Author
Nermin Abdelhakim Othman;Ahmed Sharaf Eldin;Doaa Saad Elzanfaly
Author_Institution
Fac. of Comput. &
fYear
2015
Firstpage
132
Lastpage
139
Abstract
Queries with aggregation represent an important aspect in database systems. They are widely used in online analytical processing, decision support systems, and data analytics. Aggregate functions usually perform calculations on a set of values of a particular column and return a single summarized value. However, handling aggregate functions becomes a challenge when dealing with uncertain data as there can be an exponential number of possible instances, with potentially different aggregation results for each one. The aim of this paper is to enhance aggregate queries over uncertain databases through a twofold aspect: First, proposing a Probability-Based Aggregation (PBA) technique that considers the probability of each instance in the database. Second, proposing a Probability-Based Entropy (PBE) technique that introduces a new class of aggregate functions to measure the level of uncertainty over databases. Entropy and information gain are two well-known measures stemmed from the information theory but can be used in uncertain databases. The two measures, if used as two aggregate functions in uncertain databases, will allow for more data analytics and mining. Experimental results show that the proposed aggregation technique (PBA) outperforms other similar techniques in terms of precision, recall, uncertainty density, and answer decisiveness. Moreover, using the proposed probabilistic entropy function (PBE) which considers the probability of each instance while calculating the entropy helps in identifying the threshold that gives the maximum information gain.
Keywords
"Aggregates","Entropy","Probabilistic logic","Databases","Uncertainty","Data models","Decision trees"
Publisher
ieee
Conference_Titel
Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/CIT/IUCC/DASC/PICOM.2015.21
Filename
7363062
Link To Document