DocumentCode :
1338483
Title :
Harvesting and Summarizing User-Generated Content for Advanced Speech-Based HCI
Author :
Jingjing Liu ; Seneff, S. ; Zue, V.
Author_Institution :
Comput. Sci. & Artificial Intell. Lab., Massachusetts Inst. of Technol., Cambridge, MA, USA
Volume :
6
Issue :
8
fYear :
2012
Firstpage :
982
Lastpage :
992
Abstract :
There are many Web-based platforms where people could share user-generated content such as reviews, posts, blogs, and tweets. However, online communities and social networks are expanding so rapidly that it is impossible for people to digest all the information. To help users obtain information more efficiently, both the interface for data access and the information representation need to be improved. An intuitive and personalized interface, such as a dialogue system, could be an ideal assistant, which engages a user in a continuous dialogue to garner the user´s interest, assists the user via speech-navigated interactions, harvests and summarizes the Web data as well as presenting it in a natural way. This work, therefore, aims to conduct research on a universal framework for developing a speech-based interface that can aggregate user-generated content and present the summarized information via speech-based human-computer interactions. The challenge is two-fold. Firstly, how to interpret the semantics and sentiment of user-generated data and aggregate them into structured yet concise summaries? Secondly, how to develop a dialogue modeling mechanism to present the highlighted information via natural language? This work explores plausible approaches to tackling these challenges. We will investigate a parse-and-paraphrase paradigm and a sentiment scoring mechanism for information extraction from unstructured user-generated content. We will also explore sentiment-involved opinion summarization and dialogue modeling approaches for aggregated information representation. A restaurant-domain prototype system has been implemented for demonstration.
Keywords :
Internet; human computer interaction; programming language semantics; social networking (online); speech processing; Web-based platform; advanced speech-based HCI; aggregated information representation extraction; continuous dialogue system; data access interface; data aggregation; dialogue modeling mechanism; human-computer interaction; online community; parse-and-paraphrase paradigm; restaurant-domain prototype system; sentiment scoring mechanism; sentiment-involved opinion summarization; social network; speech-navigated interaction; user-generated content; Human computer interaction; Information representation; Information retrieval; Prototypes; Social network services; User interfaces; User-generated content; Spoken dialogue systems; user-generated content processing;
fLanguage :
English
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
Publisher :
ieee
ISSN :
1932-4553
Type :
jour
DOI :
10.1109/JSTSP.2012.2229690
Filename :
6359745
Link To Document :
بازگشت