Title :
To Determine the Weight in a Weighted Sum Method for Domain-Specific Keyword Extraction
Author :
Liu, Wenshuo ; Li, Wenxin
Author_Institution :
Key Lab. of Machine Perception, Beijing Supertool Internet Technol. Co.Ltd., Beijing
Abstract :
Keyword extraction has been a very traditional topic in Natural Language Processing. However, most methods have been too complicated and slow to be applied in real applications, for example in web-based system. This paper proposes an approach which will complete some preparing works focusing on exploring the linguistic characteristics of a specific domain. This part can be completed once and for all and thus reduce the burden in the real extraction process. It is a weighted sum method and the preparing work focus on finding out the weight. Once we have the weight, the extraction can be completed by addition, multiplication and sort, which are quite simple for modern computer. Experimental results show the effectiveness of the proposed approach.
Keywords :
feature extraction; linguistics; natural language processing; text analysis; domain-specific keyword extraction; linguistic characteristics; modern computer; natural language processing; weighted sum method; Data mining; Genetic algorithms; Internet; Laboratories; Machine learning; Machine learning algorithms; Natural language processing; Predictive models; Speech; Statistics; keyword extraction; weight vector;
Conference_Titel :
Computer Engineering and Technology, 2009. ICCET '09. International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-3334-6
DOI :
10.1109/ICCET.2009.136