DocumentCode
2783755
Title
Automatic Construction of Domain Concept Hierarchy
Author
Qiao, Sun ; Chunhui, Zhang ; Zhibo, Chen
Author_Institution
Sch. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
fYear
2010
fDate
10-12 Oct. 2010
Firstpage
433
Lastpage
436
Abstract
A general automatic domain concept hierarchy construction procedure is presented in this paper. This is a domain independent construct a domain concept hierarchy from a domain corpus. The construction procedure mainly includes domain terminology extraction, word sense disambiguation, similarity computation, hierarchy construction and subsumption relation detection. All extracted candidate terms are ranked first, then one can select the top terms as domain terminologies. Frequency ratio and entropy of a word are considered to rank candidate terms. Relations between terms are taken into account for words in WordNet, while distributional similarity is used to compute similarity between words outside WordNet. Experiments on two domain corpus show that the proposed procedure is feasible and can get reasonable concept hierarchy.
Keywords
Internet; entropy; information retrieval; text analysis; word processing; WordNet; automatic domain concept hierarchy construction; domain corpus; domain terminology extraction; frequency ratio; similarity computation; subsumption relation detection; word entropy; word sense disambiguation; Entropy; Feature extraction; Frequency domain analysis; Learning; Terminology; construction; domain concept; extraction; hierarchy;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2010 International Conference on
Conference_Location
Huangshan
Print_ISBN
978-1-4244-8434-8
Electronic_ISBN
978-0-7695-4235-5
Type
conf
DOI
10.1109/CyberC.2010.85
Filename
5617014
Link To Document