Title :
Keyword Extraction Using PageRank on Synonym Networks
Author :
Liu, Zhengyang ; Liu, Jianyi ; Yao, Wenbin ; Wang, Cong
Author_Institution :
Nat. Eng. Lab. for Disaster Backup & Recovery, Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Keyword extraction is an important application in the area of information technology. Automatic keyword extraction can help people know what is the article primarily talking about without reading the long passage carefully. This paper mainly introduced a keyword extraction algorithm using pagerank on Synonym. Firstly, the content in a single document is represented as a weighted synonym co-occurrence network. Then pagerank algorithm is using on this synonym network to assign the rank for each synonym group. Finally, several synonym groups with top rank are picked out as keywords of the document. The algorithm is tested on the corpus of blog pages, and the experiment results prove practical and effective.
Keywords :
document handling; information retrieval; information technology; network theory (graphs); vocabulary; PageRank; automatic keyword extraction; document content; information technology; synonym co-occurrence network; Algorithm design and analysis; Data mining; Information services; Internet; Joining processes; Prediction algorithms; Web sites;
Conference_Titel :
E-Product E-Service and E-Entertainment (ICEEE), 2010 International Conference on
Conference_Location :
Henan
Print_ISBN :
978-1-4244-7159-1
DOI :
10.1109/ICEEE.2010.5660630