Title :
Applying frequency and location information to keyword extraction in single document
Author_Institution :
Dept. of Comput. Sci., Beijing Foreign Studies Univ., Beijing, China
fDate :
Oct. 30 2012-Nov. 1 2012
Abstract :
Keyword extraction from single document is not same to the task of text classification, in which a collection of texts can be compared and referred to. The paper focuses on the keyword extraction based on statistical information of words, that is, self features of keywords in the single document. Besides of general features such as word frequency and POS of a word, location features of a keyword are deep investigated and applied to select the candidate words. Experimental results of the extraction approach based on this method outperform TFIDF, TextRank and other unsupervised methods by comparing with them on the same corpus.
Keywords :
document handling; information retrieval; POS; TFIDF; TextRank; frequency information; location information; single document keyword extraction; words statistical information; Abstracts; Feature extraction; Frequency measurement; Pragmatics; Text categorization; keyword extraction; single document; unsupervised approach;
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
DOI :
10.1109/CCIS.2012.6664615