• DocumentCode
    1908211
  • Title

    Mining Recipes in Microblog

  • Author

    Shengyu Liu ; Qingcai Chen ; Shanshan Guan ; Xiaolong Wang ; Huimiao Shi

  • Author_Institution
    Intell. Comput. Res. Center, Harbin Inst. of Technol., Shenzhen, China
  • fYear
    2013
  • fDate
    17-19 Aug. 2013
  • Firstpage
    29
  • Lastpage
    32
  • Abstract
    Microblog, as an online communication platform, is becoming more and more popular. Users generate volumes of data everyday and the user generated content contains a lot of useful knowledge such as practical skills and technical expertise. This paper proposes a cross-data method to mine recipes in Microblog. In the proposed method, snippets of text relevant to recipes are firstly extracted from Baidu Encyclopedia. Secondly, the extracted snippets of text are used to train a domain-specific unigram language model. Thirdly, candidate recipes in Microblog are mined based on the unigram language model. Finally, some heuristic rules are used to identify real recipes from the candidate recipes. Experimental results show the effectiveness of the proposed method.
  • Keywords
    Web sites; data mining; encyclopaedias; information retrieval; natural language processing; text analysis; Baidu encyclopedia; cross-data method; domain-specific unigram language model; heuristic rules; knowledge; microblog; online communication platform; practical skills; recipes miining; technical expertise; text snippets extraction; user generated content; Communities; Data mining; Domain specific languages; Encyclopedias; Media; Twitter; Baidu encyclopedia; Microblog; language model; recipes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2013 International Conference on
  • Conference_Location
    Urumqi
  • Type

    conf

  • DOI
    10.1109/IALP.2013.13
  • Filename
    6645996