Title :
Keyphrase Extraction Using Semantic Networks Structure Analysis
Author :
Huang, Chong ; Tian, Yonghong ; Zhou, Zhi ; Ling, Charles X. ; Huang, Tiejun
Author_Institution :
Chinese Acad. of Sci., Grad. Univ., Beijing
Abstract :
Keyphrases play a key role in text indexing, summarization and categorization. However, most of the existing keyphrase extraction approaches require human-labeled training sets. In this paper, we propose an automatic keyphrase extraction algorithm, which can be used in both supervised and unsupervised tasks. This algorithm treats each document as a semantic network. Structural dynamics of the network are used to extract keyphrases (key nodes) unsupervised. Experiments demonstrate the proposed algorithm averagely improves 50% in effectiveness and 30% in efficiency in unsupervised tasks and performs comparatively with supervised extractors. Moreover, by applying this algorithm to supervised tasks, we develop a classifier with an overall accuracy up to 80%.
Keywords :
classification; feature extraction; text analysis; automatic keyphrase extraction; classification; semantic networks structure analysis; structural dynamics; unsupervised task; Books; Computer science; Data mining; Frequency; Indexing; Software libraries; Supervised learning; Training data; Unsupervised learning; Web pages;
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2701-7
DOI :
10.1109/ICDM.2006.92