DocumentCode
108879
Title
Consensus-Based Ranking of Multivalued Objects: A Generalized Borda Count Approach
Author
Ying Zhang ; Wenjie Zhang ; Jian Pei ; Xuemin Lin ; Qianlu Lin ; Aiping Li
Author_Institution
Sch. of Comput. Sci. & Eng., Univ. of New South Wales, Sydney, NSW, Australia
Volume
26
Issue
1
fYear
2014
fDate
Jan. 2014
Firstpage
83
Lastpage
96
Abstract
In this paper, we tackle a novel problem of ranking multivalued objects, where an object has multiple instances in a multidimensional space, and the number of instances per object is not fixed. Given an ad hoc scoring function that assigns a score to a multidimensional instance, we want to rank a set of multivalued objects. Different from the existing models of ranking uncertain and probabilistic data, which model an object as a random variable and the instances of an object are assumed exclusive, we have to capture the coexistence of instances here. To tackle the problem, we advocate the semantics of favoring widely preferred objects instead of majority votes, which is widely used in many elections and competitions. Technically, we borrow the idea from Borda Count (BC), a well-recognized method in consensus-based voting systems. However, Borda Count cannot handle multivalued objects of inconsistent cardinality, and is costly to evaluate top (k) queries on large multidimensional data sets. To address the challenges, we extend and generalize Borda Count to quantile-based Borda Count, and develop efficient computational methods with comprehensive cost analysis. We present case studies on real data sets to demonstrate the effectiveness of the generalized Borda Count ranking, and use synthetic and real data sets to verify the efficiency of our computational method.
Keywords
probability; query processing; random processes; BC; consensus-based ranking; consensus-based voting systems; cost analysis; generalized Borda count approach; hoc scoring function; large multidimensional data sets; multidimensional instance; multidimensional space; multivalued object problem; probabilistic data; quantile-based Borda count; random variable; ranking uncertain models; Biological system modeling; Cities and towns; Data models; Economics; Educational institutions; Indexes; Probabilistic logic; Multivalued objects; consensus-based ranking;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2012.250
Filename
6399469
Link To Document