DocumentCode :
1911859
Title :
Gathering Public Concerns from Web Towards Building Corpus of Japanese Regional Concerns
Author :
Shiramatsu, Shun ; Hirata, Norifumi ; Swezey, Robin M E ; Sano, Hiroyuki ; Ozono, Tadachika ; Shintani, Toramatsu
Author_Institution :
Grad. Sch. of Eng., Nagoya Inst. of Technol., Nagoya, Japan
fYear :
2012
fDate :
20-22 Sept. 2012
Firstpage :
248
Lastpage :
253
Abstract :
Importance of concern assessment has been increased in Japanese regional communities. We have developed an e-Participation web platform based on a Linked Open Data set called SOCIA (Social Opinions and Concerns for Ideal Argumentation). To sophisticate text mining technologies for supporting concern assessment, building a corpus of public concerns is an urgent task. There are two issues to utilize the dataset SOCIA as a corpus: (1) it is required to manage reliability of annotation and (2) to filter out noisy text not relevant to public concerns. To address these research issues, (1) we incorporate schema for describing meta-context information of annotation, that is, who is annotator, whether the annotator is a human or a software agent, and how reliable the annotation is. Furthermore, (2) we investigate the difference between features of concerns and that of non-concerns in Japanese microblog posts (i.e., tweets). Through the investigation, we address sample selection bias by formulating a novel metric for ranking features, i.e., bias-penalized information gain (BPIG).
Keywords :
Internet; Web sites; data mining; social sciences computing; software agents; Japanese microblog posts; Japanese regional concerns; SOCIA; World Wide Web; annotation; building corpus; e-Participation Web platform; linked open data set; meta-context information; public concerns; social opinions and concerns for ideal argumentation; software agent; text mining technologies; Buildings; Communities; Feature extraction; Government; Software reliability; Text mining; Twitter; concern assessment; corpus; e-Participation; sample selection bias;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Applied Informatics (IIAIAAI), 2012 IIAI International Conference on
Conference_Location :
Fukuoka
Print_ISBN :
978-1-4673-2719-0
Type :
conf
DOI :
10.1109/IIAI-AAI.2012.57
Filename :
6337197
Link To Document :
بازگشت