• DocumentCode
    2710999
  • Title

    Active Learning of Equivalence Relations by Minimizing the Expected Loss Using Constraint Inference

  • Author

    Rendle, Steffen ; Schmidt-Thieme, Lars

  • Author_Institution
    Machine Learning Lab., Inst. for Comput. Sci. Univ. of Hildesheim, Hildesheim
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    1001
  • Lastpage
    1006
  • Abstract
    Selecting promising queries is the key to effective active learning. In this paper, we investigate selection techniques for the task of learning an equivalence relation where the queries are about pairs of objects. As the target relation satisfies the axioms of transitivity, from one queried pair additional constraints can be inferred. We derive both the upper and lower bound on the number of queries needed to converge to the optimal solution. Besides restricting the set of possible solutions, constraints can be used as training data for learning a similarity measure. For selecting queries that result in a large number of meaningful constraints, we present an approximative optimal selection technique that greedily minimizes the expected loss in each round of active learning. This technique makes use of inference of expected constraints. Besides the theoretical results, an extensive evaluation for the application of record linkage shows empirically that the proposed selection method leads to both interesting and a high number of constraints.
  • Keywords
    learning (artificial intelligence); active learning; constraint inference; equivalence relations; queries; Computer science; Constraint theory; Couplings; Data mining; Machine learning; Merging; Sampling methods; Social network services; Training data; Uncertainty; Active Learning; Equivalence Relation; Record Linkage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3502-9
  • Type

    conf

  • DOI
    10.1109/ICDM.2008.41
  • Filename
    4781215