DocumentCode :
2243868
Title :
Applying frequency and location information to keyword extraction in single document
Author :
Ying Qin
Author_Institution :
Dept. of Comput. Sci., Beijing Foreign Studies Univ., Beijing, China
fYear :
2012
fDate :
Oct. 30 2012-Nov. 1 2012
Firstpage :
1398
Lastpage :
1402
Abstract :
Keyword extraction from single document is not same to the task of text classification, in which a collection of texts can be compared and referred to. The paper focuses on the keyword extraction based on statistical information of words, that is, self features of keywords in the single document. Besides of general features such as word frequency and POS of a word, location features of a keyword are deep investigated and applied to select the candidate words. Experimental results of the extraction approach based on this method outperform TFIDF, TextRank and other unsupervised methods by comparing with them on the same corpus.
Keywords :
document handling; information retrieval; POS; TFIDF; TextRank; frequency information; location information; single document keyword extraction; words statistical information; Abstracts; Feature extraction; Frequency measurement; Pragmatics; Text categorization; keyword extraction; single document; unsupervised approach;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
Type :
conf
DOI :
10.1109/CCIS.2012.6664615
Filename :
6664615
Link To Document :
بازگشت