DocumentCode :
3277583
Title :
Bilingual topic taxonomy generation based on bilingual documents clustering
Author :
Zhang, Cheng-zhi
Author_Institution :
Dept. of Inf. Manage., Nanjing Univ. of Sci. & Technol., Nanjing, China
Volume :
4
fYear :
2011
fDate :
10-13 July 2011
Firstpage :
1889
Lastpage :
1895
Abstract :
Bilingual taxonomy is one of key components of multilingual Ontology. In this paper, affinity propagation clustering algorithm is used to cluster bilingual documents collection and generate bilingual topic taxonomy. Two bilingual topic taxonomy generation methods, i.e. bilingual documents clustering before or after text feature reconstruction, are described. Dataset in two domains are tested and result shows that: according to net similarity, the result of documents clustering after feature reconstruction is better than that before feature reconstruction.
Keywords :
document handling; linguistics; ontologies (artificial intelligence); pattern clustering; affinity propagation clustering algorithm; after text feature reconstruction; before text feature reconstruction; bilingual documents clustering; bilingual documents collection; bilingual topic taxonomy generation; multilingual ontology; Clustering algorithms; Dictionaries; Entropy; Feature extraction; Law; Ontologies; Taxonomy; Bilingual documents clustering; Multilingual Ontology; Ontology learning; Parallel corpus; Topic taxonomy generation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location :
Guilin
ISSN :
2160-133X
Print_ISBN :
978-1-4577-0305-8
Type :
conf
DOI :
10.1109/ICMLC.2011.6016948
Filename :
6016948
Link To Document :
بازگشت