DocumentCode :
3425819
Title :
Learning from Twitter Hashtags: Leveraging Proximate Tags to Enhance Graph-Based Keyphrase Extraction
Author :
Bellaachia, Abdelghani ; Al-Dhelaan, M.
Author_Institution :
Comput. Sci. Dept., George Washington Univ., Washington, DC, USA
fYear :
2012
fDate :
20-23 Nov. 2012
Firstpage :
348
Lastpage :
357
Abstract :
In the micro-blogging service Twitter, the sparseness of text messages is an enormous obstacle in extracting key phrases from tweets. However, regardless of the sparseness in text, tweets include an abundant number of links in the form of hash tags. This paper investigates the possibility of leveraging hash tags in tweets to enhance the graph-based key phrase extraction. By using an auxiliary set of tweets found in hash tags, we show that we can improve extracting key phrases from tweets by augmenting the graph with a wider knowledge context. Specifically, we propose two different approaches for choosing the best hash tags links to use for enhancing graph-based key phrase extraction by either using a frequency approach or a hybrid approach that uses multiple methods for cleverly choosing the best hash tags. Experiments on the proposed approaches showed an improvement in the range of 9% to 37% over the case of ignoring the hash tag links.
Keywords :
cryptography; graph theory; social networking (online); text analysis; Twitter hashtags; graph augmentation; graph-based keyphrase extraction; hashtag links; knowledge context; microblogging service; text messages; Blogs; Conferences; Context; Feature extraction; Frequency measurement; Twitter; Graph-based Ranking; Hashtag; Keyphrase Extraction; NE-Rank; Social Network Analysis; Twitter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Green Computing and Communications (GreenCom), 2012 IEEE International Conference on
Conference_Location :
Besancon
Print_ISBN :
978-1-4673-5146-1
Type :
conf
DOI :
10.1109/GreenCom.2012.58
Filename :
6468336
Link To Document :
بازگشت