DocumentCode :
3756097
Title :
Unsupervised Data Driven Taxonomy Learning
Author :
Mahmoud M. Hosny;Samhaa R. El-Beltagy;Mahmoud E. Allam
Author_Institution :
Nile Univ., Giza, Egypt
fYear :
2015
fDate :
4/1/2015 12:00:00 AM
Firstpage :
9
Lastpage :
14
Abstract :
The ability to effectively organize textual information is a big challenge in intelligent text processing. With the increase in the amount of textual data being generated, this task is becoming more and more essential. In this paper we present an unsupervised computer-aided tool for automatically building classification schemes and taxonomies for enhancing the process of automated text classification. The tool utilizes the Wikipedia knowledge base and its categorization system to achieve its goal. Validation of the tool was done using a subset of a large language dataset obtained from the Google moderator series (Egypt 2.0) idea bank. The output of the tool was evaluated by comparing the similarity between the results obtained automatically from the tool, and those manually annotated by three different human evaluators, verifying the effectiveness of the tool. The tool showed effectiveness with a precision of 88.6% and recall of 81.2%.
Keywords :
"Taxonomy","Encyclopedias","Electronic publishing","Internet","Buildings","Clustering algorithms"
Publisher :
ieee
Conference_Titel :
Arabic Computational Linguistics (ACLing), 2015 First International Conference on
Print_ISBN :
978-1-4673-9154-2
Type :
conf
DOI :
10.1109/ACLing.2015.8
Filename :
7422273
Link To Document :
بازگشت