Title :
Personalized trip planning by integrating multimodal user-generated content
Author :
Patri, Om P. ; Singh, Ketan ; Szekely, Pedro ; Panangadan, Anand V. ; Prasanna, Viktor K.
Author_Institution :
Univ. of Southern California, Los Angeles, CA, USA
Abstract :
We address the problem of record linkage and semantic integration in the context of large collections of user-generated content. These datasets are often large since it contains the contributions of millions of Internet users. We present an approach based on approximate string matching between the metadata associated with such data. The discovered linkages are stored in an ontology for answering queries on the integrated data sources. We demonstrate this approach in Photo Odyssey, an interactive web application which integrates multimodal content from image hosting and travel websites to create a user interface with a graphical trip plan and personalization options.We discuss several practical challenges faced in building such an application - integrating and mining large-scale multimodal user-generated data, resolving semantic heterogeneity, and machine learning for matching and ranking items. Photo Odyssey operates in an online manner without using any previously stored knowledge base. We also describe methods to compute relevance of images, remove bad data instances and duplicates, perform contextual filtering, and assign a category to uncatalogued images which enable an interactive application even on Big Data with real-world characteristics.
Keywords :
Internet; Web sites; content management; humanities; information filtering; interactive systems; meta data; ontologies (artificial intelligence); query processing; string matching; Big Data; Internet; Photo Odyssey; approximate string matching; bad data instances; contextual filtering; image hosting; integrated data sources; interactive Web application; large-scale muItimodal user-generated data; machine learning; metadata; muItimodal content integration; multimodal user-generated content; ontology; personalized trip planning; query answering; record linkage; semantic heterogeneity; semantic integration; travel Websites; user interface; ISO; ISO standards; Matched filters; Ontologies; TV; Information Integration; Metadata; Ontology; Photography; Semantic Heterogeneity;
Conference_Titel :
Semantic Computing (ICSC), 2015 IEEE International Conference on
Conference_Location :
Anaheim, CA
DOI :
10.1109/ICOSC.2015.7050837