DocumentCode :
2718240
Title :
Mining the blogosphere to generate local cuisine hotspots for mobile map service
Author :
Shih, Chia-Chun ; Peng, Ting-Chun ; Lai, Wei-Shen
Author_Institution :
Inst. for Inf. Ind., Taipei, Taiwan
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
1
Lastpage :
8
Abstract :
On-the-go consumers require dynamic information, particularly "word of mouth, " to make better purchase decisions. A popular genre of mobile map services is travel/cuisine, which is a popular topic for bloggers as well. This study attempts to generate local cuisine hotspot maps through blog content mining. The main obstacle in doing this involves recognizing and extracting restaurants and essential restaurant information (i.e., restaurant dishes) in unstructured content. In contrast to traditional Named Entity Recognition (NER) targets, dish name is a promising target that received little attention in previous studies. This study develops methods for recognizing and extracting restaurant names and dish names from Chinese blog posts and achieves satisfactory performance. The extraction results are arranged into hotspots and presented in map views. The extracted information can be fed back to POI (Point of Interest) databases, and thus a brand-new POI database comprising information extracted from User Generated Content (UGC) can be realized.
Keywords :
Web sites; catering industry; data mining; pattern classification; blog content mining; blogosphere mining; information extraction; local cuisine hotspot generation; mobile map service; named entity recognition; point of interest database; user generated content; Cities and towns; Data mining; Databases; Information services; Internet; Mining industry; Mouth; Target recognition; User-generated content; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information Management, 2009. ICDIM 2009. Fourth International Conference on
Conference_Location :
Ann Arbor, MI
Print_ISBN :
978-1-4244-4253-9
Electronic_ISBN :
978-1-4244-4254-6
Type :
conf
DOI :
10.1109/ICDIM.2009.5356785
Filename :
5356785
Link To Document :
بازگشت