DocumentCode
1908211
Title
Mining Recipes in Microblog
Author
Shengyu Liu ; Qingcai Chen ; Shanshan Guan ; Xiaolong Wang ; Huimiao Shi
Author_Institution
Intell. Comput. Res. Center, Harbin Inst. of Technol., Shenzhen, China
fYear
2013
fDate
17-19 Aug. 2013
Firstpage
29
Lastpage
32
Abstract
Microblog, as an online communication platform, is becoming more and more popular. Users generate volumes of data everyday and the user generated content contains a lot of useful knowledge such as practical skills and technical expertise. This paper proposes a cross-data method to mine recipes in Microblog. In the proposed method, snippets of text relevant to recipes are firstly extracted from Baidu Encyclopedia. Secondly, the extracted snippets of text are used to train a domain-specific unigram language model. Thirdly, candidate recipes in Microblog are mined based on the unigram language model. Finally, some heuristic rules are used to identify real recipes from the candidate recipes. Experimental results show the effectiveness of the proposed method.
Keywords
Web sites; data mining; encyclopaedias; information retrieval; natural language processing; text analysis; Baidu encyclopedia; cross-data method; domain-specific unigram language model; heuristic rules; knowledge; microblog; online communication platform; practical skills; recipes miining; technical expertise; text snippets extraction; user generated content; Communities; Data mining; Domain specific languages; Encyclopedias; Media; Twitter; Baidu encyclopedia; Microblog; language model; recipes;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location
Urumqi
Type
conf
DOI
10.1109/IALP.2013.13
Filename
6645996
Link To Document