Title : 
Scheduling Updates in a Real-Time Stream Warehouse
         
        
            Author : 
Golab, Lukasz ; Johnson, Theodore ; Shkapenyuk, Vladislav
         
        
            Author_Institution : 
AT&TLabs - Res., Florham Park, NJ
         
        
        
            fDate : 
March 29 2009-April 2 2009
         
        
        
        
            Abstract : 
This paper discusses updating a data warehouse that collects near-real-time data streams from a variety of external sources. The objective is to keep all the tables and materialized views up-to-date as new data arrive over time. We define the notion of data staleness, formalize the problem of scheduling updates in a way that minimizes average data staleness, and present scheduling algorithms designed to handle the complex environment of a real-time stream warehouse. A novel feature of our scheduling framework is that it considers the effect of an update on the staleness of the underlying tables rather than any property of the update job itself (such as deadline).
         
        
            Keywords : 
data handling; data warehouses; real-time systems; scheduling; data staleness; data warehouse; near-real-time data stream; scheduling algorithm; Algorithm design and analysis; Credit cards; Current measurement; Data engineering; Data warehouses; IP networks; Monitoring; Scheduling algorithm; Time measurement; USA Councils;
         
        
        
        
            Conference_Titel : 
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
         
        
            Conference_Location : 
Shanghai
         
        
        
            Print_ISBN : 
978-1-4244-3422-0
         
        
            Electronic_ISBN : 
1084-4627
         
        
        
            DOI : 
10.1109/ICDE.2009.202