• DocumentCode
    108879
  • Title

    Consensus-Based Ranking of Multivalued Objects: A Generalized Borda Count Approach

  • Author

    Ying Zhang ; Wenjie Zhang ; Jian Pei ; Xuemin Lin ; Qianlu Lin ; Aiping Li

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of New South Wales, Sydney, NSW, Australia
  • Volume
    26
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    83
  • Lastpage
    96
  • Abstract
    In this paper, we tackle a novel problem of ranking multivalued objects, where an object has multiple instances in a multidimensional space, and the number of instances per object is not fixed. Given an ad hoc scoring function that assigns a score to a multidimensional instance, we want to rank a set of multivalued objects. Different from the existing models of ranking uncertain and probabilistic data, which model an object as a random variable and the instances of an object are assumed exclusive, we have to capture the coexistence of instances here. To tackle the problem, we advocate the semantics of favoring widely preferred objects instead of majority votes, which is widely used in many elections and competitions. Technically, we borrow the idea from Borda Count (BC), a well-recognized method in consensus-based voting systems. However, Borda Count cannot handle multivalued objects of inconsistent cardinality, and is costly to evaluate top (k) queries on large multidimensional data sets. To address the challenges, we extend and generalize Borda Count to quantile-based Borda Count, and develop efficient computational methods with comprehensive cost analysis. We present case studies on real data sets to demonstrate the effectiveness of the generalized Borda Count ranking, and use synthetic and real data sets to verify the efficiency of our computational method.
  • Keywords
    probability; query processing; random processes; BC; consensus-based ranking; consensus-based voting systems; cost analysis; generalized Borda count approach; hoc scoring function; large multidimensional data sets; multidimensional instance; multidimensional space; multivalued object problem; probabilistic data; quantile-based Borda count; random variable; ranking uncertain models; Biological system modeling; Cities and towns; Data models; Economics; Educational institutions; Indexes; Probabilistic logic; Multivalued objects; consensus-based ranking;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.250
  • Filename
    6399469