مرکز منطقه ای اطلاع رساني علوم و فناوري - Applying frequency and location information to keyword extraction in single document

DocumentCode :

2243868

Title :

Applying frequency and location information to keyword extraction in single document

Author :

Ying Qin

Author_Institution :

Dept. of Comput. Sci., Beijing Foreign Studies Univ., Beijing, China

fYear :

2012

fDate :

Oct. 30 2012-Nov. 1 2012

Firstpage :

1398

Lastpage :

1402

Abstract :

Keyword extraction from single document is not same to the task of text classification, in which a collection of texts can be compared and referred to. The paper focuses on the keyword extraction based on statistical information of words, that is, self features of keywords in the single document. Besides of general features such as word frequency and POS of a word, location features of a keyword are deep investigated and applied to select the candidate words. Experimental results of the extraction approach based on this method outperform TFIDF, TextRank and other unsupervised methods by comparing with them on the same corpus.

Keywords :

document handling; information retrieval; POS; TFIDF; TextRank; frequency information; location information; single document keyword extraction; words statistical information; Abstracts; Feature extraction; Frequency measurement; Pragmatics; Text categorization; keyword extraction; single document; unsupervised approach;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on

Conference_Location :

Hangzhou

Print_ISBN :

978-1-4673-1855-6

Type :

conf

DOI :

10.1109/CCIS.2012.6664615

Filename :

6664615

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2243868