Author_Institution :
Coll. of Inf. & Syst., Muroran Inst. of Technol., Muroran, Japan
Abstract :
Many researches on mining the Web, especially Social Networking Media such as web logs and microblogging sites which seem to store vast amounts of information about human societies, for knowledge about various phenomena and events in the physical world have been done actively, and Web applications with Web-mined knowledge have begun to be developed for the public. However, there is no detailed investigation on how accurately Web-mined data reflect real-world data. It must be problematic to idolatrously utilize the Web-mined data in public Web applications without ensuring their accuracy sufficiently. Therefore, this paper defines spatio-temporal Web Sensors by analyzing Twitter, Facebook, web logs, news sites, or the whole Web for a target natural phenomenon, and tries to validate the potential and reliability of the Web Sensors´ spatio-temporal data by measuring the coefficient correlation with Japanese weather, earthquake, and influenza statistics per week by region as real-world data.
Keywords :
data mining; sensors; social networking (online); Facebook; Japanese weather; Twitter; Web applications; Web logs; Web mining; Web-mined knowledge; coefficient correlation measurement; earthquake; influenza statistics; microblogging sites; news sites; real-world data; social network analysis; social networking media; spatio-temporal Web sensors; Aerospace electronics; Earthquakes; Rain; Sensor phenomena and characterization; Snow; Tablet computers; Web Sensor; Web credibility; Web mining; microblogging; social network analysis; spatio-temporal data mining;