Title :
Keyword Extraction Using Language Network
Author :
Liu, Jianyi ; Wang, Jinghua
Author_Institution :
Beijing Univ. of Posts & Telecommun, Beijing
fDate :
Aug. 30 2007-Sept. 1 2007
Abstract :
In this paper, we introduced language network and described three kinds of networks. Keyword extraction is an important technology in many areas of document processing. In particularly, a keyword extraction algorithm based on language network and PageRank is proposed. Firstly a semantic network for a single document is build, then Pagerank is applied in the network to decide on the importance of a word, finally top-ranked words are selected as keywords of the document. The algorithm is tested on the corpus of CISTR, and the experiment result proves practical and effective.
Keywords :
document handling; feature extraction; natural languages; CISTR corpus; Pagerank; document processing; keyword extraction algorithm; language network; Clustering algorithms; Computer networks; Content based retrieval; Data mining; Information retrieval; Information technology; National electric code; Telecommunication computing; Testing; Text categorization;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1611-0
Electronic_ISBN :
978-1-4244-1611-0
DOI :
10.1109/NLPKE.2007.4368023