• DocumentCode
    1816751
  • Title

    MONGOOSE: MONitoring Global Online Opinions via Semantic Extraction

  • Author

    Bhagwan, Varun ; Grandison, Tyrone ; Alba, Alfredo ; Gruhl, Daniel ; Pieper, Jan

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • fYear
    2009
  • fDate
    21-25 Sept. 2009
  • Firstpage
    214
  • Lastpage
    220
  • Abstract
    The ever increasing amount of content on the Internet has fostered many efforts seeking to leverage this potentially yottascale information source. Service systems using advanced data and text analytics techniques have been developed to perform knowledge gathering and information discovery over Web data. Information gathered from free and public sources on the Web is frequently integrated with enterprise and proprietary data to create sophisticated service systems able to provide insight in an increasing number of business critical areas. Unfortunately, for fixed and or limited resource projects, consistent and reliable ingestion and integration of content often dominates the effort, reducing the time available for developing core analytics and presentations that differentiate and define an information service. If this initial data extraction, translation and loading of information (known as ETL in the database world) can be abstracted for these web sources, it would provide an important core technology on which Web-based information services could be more rapidly and inexpensively developed and deployed. This paper presents such a system - MONGOOSE - an approach that seeks to reduce the time spent creating a reliable data ingest and integration system and thus reducing the time-to-impact of advanced analytics service solutions.
  • Keywords
    Internet; data analysis; data mining; globalisation; information resources; text analysis; Internet; MONGOOSE; Web data; World Wide Web; data analytics; global online opinions; information discovery; knowledge gathering; semantic extraction; service systems; text analytics; yottascale information source; Cloud computing; Control systems; Data analysis; Data mining; Databases; Information analysis; Internet; Monitoring; Performance analysis; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing, 2009. CLOUD '09. IEEE International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    978-1-4244-5199-9
  • Electronic_ISBN
    978-0-7695-3840-2
  • Type

    conf

  • DOI
    10.1109/CLOUD.2009.85
  • Filename
    5283898