Title :
Keyterm extraction from microblogs´ messages using Wikipedia-based keyphraseness measure
Author :
Korshunov, Anton
Author_Institution :
Inf. Syst. Dept., Inst. for Syst. Program., Moscow, Russia
Abstract :
The paper describes a method for keyterm extraction from messages of microblogs. The described approach utilizes the information obtained by the analysis of structure and content of Wikipedia. The algorithm is based on computation of “keyphraseness” measure for each term, i.e. an estimation of probability that it can be selected as a key in the text. The experimental study of the proposed technique demonstrated satisfactory results which significantly outpaces analogues. As a demonstration of possible application of the algorithm, the prototype of context-sensitive advertising system has been implemented. This system is able to obtain the descriptions of the goods relevant to the found keyterms from Amazon online store. Several suggestions are also made on how to utilize the information obtained by the analysis of Twitter messages in different auxiliary services.
Keywords :
Web sites; estimation theory; Twitter messages; Wikipedia based keyphraseness measure; auxiliary services; context sensitive advertising system; keyterm extraction; microblogs message; probability estimation; Blogs; Databases; Electronic publishing; Encyclopedias; Internet; Twitter; Twitter; Wikipedia; context-sensitive advertising; keyterm extraction; microblogging;
Conference_Titel :
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location :
Sousse
Print_ISBN :
978-1-4673-1657-6
DOI :
10.1109/SETIT.2012.6482038