DocumentCode :
1908211
Title :
Mining Recipes in Microblog
Author :
Shengyu Liu ; Qingcai Chen ; Shanshan Guan ; Xiaolong Wang ; Huimiao Shi
Author_Institution :
Intell. Comput. Res. Center, Harbin Inst. of Technol., Shenzhen, China
fYear :
2013
fDate :
17-19 Aug. 2013
Firstpage :
29
Lastpage :
32
Abstract :
Microblog, as an online communication platform, is becoming more and more popular. Users generate volumes of data everyday and the user generated content contains a lot of useful knowledge such as practical skills and technical expertise. This paper proposes a cross-data method to mine recipes in Microblog. In the proposed method, snippets of text relevant to recipes are firstly extracted from Baidu Encyclopedia. Secondly, the extracted snippets of text are used to train a domain-specific unigram language model. Thirdly, candidate recipes in Microblog are mined based on the unigram language model. Finally, some heuristic rules are used to identify real recipes from the candidate recipes. Experimental results show the effectiveness of the proposed method.
Keywords :
Web sites; data mining; encyclopaedias; information retrieval; natural language processing; text analysis; Baidu encyclopedia; cross-data method; domain-specific unigram language model; heuristic rules; knowledge; microblog; online communication platform; practical skills; recipes miining; technical expertise; text snippets extraction; user generated content; Communities; Data mining; Domain specific languages; Encyclopedias; Media; Twitter; Baidu encyclopedia; Microblog; language model; recipes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location :
Urumqi
Type :
conf
DOI :
10.1109/IALP.2013.13
Filename :
6645996
Link To Document :
بازگشت