Title : 
Addressing data veracity in big data applications
         
        
            Author : 
Aman, Saima ; Chelmis, Charalampos ; Prasanna, Viktor
         
        
            Author_Institution : 
Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
         
        
        
        
        
        
            Abstract : 
Big data applications such as in smart electric grids, transportation, and remote environment monitoring involve geographically dispersed sensors that periodically send back information to central nodes. In many cases, data from sensors is not available at central nodes at a frequency that is required for real-time modeling and decision-making. This may be due to physical limitations of the transmission networks, or due to consumers limiting frequent transmission of data from sensors located at their premises for security and privacy concerns. Such scenarios lead to partial data problem and raise the issue of data veracity in big data applications. We describe a novel solution to the problem of making short term predictions (up to a few hours ahead) in absence of real-time data from sensors in Smart Grid. A key implication of our work is that by using real-time data from only a small subset of influential sensors, we are able to make predictions for all sensors. We thus reduce the communication complexity involved in transmitting sensory data in Smart Grids. We use real-world electricity consumption data from smart meters to empirically demonstrate the usefulness of our method. Our dataset consists of data collected at 15-min intervals from 170 smart meters in the USC Microgrid for 7 years, totaling 41,697,600 data points.
         
        
            Keywords : 
Big Data; power engineering computing; smart power grids; Big Data applications; USC Microgrid; communication complexity; data veracity; electricity consumption data; remote environment monitoring; sensory data transmission; smart electric grids; transportation; Big data; Data models; Intelligent sensors; Predictive models; Real-time systems; Smart meters; data veracity; prediction model; smart grid;
         
        
        
        
            Conference_Titel : 
Big Data (Big Data), 2014 IEEE International Conference on
         
        
            Conference_Location : 
Washington, DC
         
        
        
            DOI : 
10.1109/BigData.2014.7004473