Title :
Bilingual topic taxonomy generation based on bilingual documents clustering
Author :
Zhang, Cheng-zhi
Author_Institution :
Dept. of Inf. Manage., Nanjing Univ. of Sci. & Technol., Nanjing, China
Abstract :
Bilingual taxonomy is one of key components of multilingual Ontology. In this paper, affinity propagation clustering algorithm is used to cluster bilingual documents collection and generate bilingual topic taxonomy. Two bilingual topic taxonomy generation methods, i.e. bilingual documents clustering before or after text feature reconstruction, are described. Dataset in two domains are tested and result shows that: according to net similarity, the result of documents clustering after feature reconstruction is better than that before feature reconstruction.
Keywords :
document handling; linguistics; ontologies (artificial intelligence); pattern clustering; affinity propagation clustering algorithm; after text feature reconstruction; before text feature reconstruction; bilingual documents clustering; bilingual documents collection; bilingual topic taxonomy generation; multilingual ontology; Clustering algorithms; Dictionaries; Entropy; Feature extraction; Law; Ontologies; Taxonomy; Bilingual documents clustering; Multilingual Ontology; Ontology learning; Parallel corpus; Topic taxonomy generation;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location :
Guilin
Print_ISBN :
978-1-4577-0305-8
DOI :
10.1109/ICMLC.2011.6016948