DocumentCode :
3724145
Title :
CrowdTC: Crowdsourced Taxonomy Construction
Author :
Rui Meng;Yongxin Tong;Lei Chen;Caleb Chen Cao
Author_Institution :
Dept. of Comput. Sci. &
fYear :
2015
Firstpage :
913
Lastpage :
918
Abstract :
Recently, taxonomy has attracted much attention. Both automatic construction solutions and human-based computation approaches have been proposed. The automatic methods suffer from the problem of either low precision or low recall and human computation, on the other hand, is not suitable for large scale tasks. Motivated by the shortcomings of both approaches, we present a hybrid framework, which combines the power of machine-based approaches and human computation (the crowd) to construct a more complete and accurate taxonomy. Specifically, our framework consists of two steps: we first construct a complete but noisy taxonomy automatically, then crowd is introduced to adjust the entity positions in the constructed taxonomy. However, the adjustment is challenging as the budget (money) for asking the crowd is often limited. In our work, we formulate the problem of finding the optimal adjustment as an entity selection optimization (ESO) problem, which is proved to be NP-hard. We then propose an exact algorithm and a more efficient approximation algorithm with an approximation ratio of 1/2(1-1/e). We conduct extensive experiments on real datasets, the results show that our hybrid approach largely improves the recall of the taxonomy with little impairment for precision.
Keywords :
"Taxonomy","Optimization","Uncertainty","Approximation algorithms","Syntactics","Noise measurement","Approximation methods"
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2015 IEEE International Conference on
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2015.77
Filename :
7373411
Link To Document :
بازگشت