• DocumentCode
    3172949
  • Title

    Data compression and query for large scale sensor data on COTS DBMS

  • Author

    Suei, Pei-Lun ; Kuo, Che-Wei ; Luoh, Ren-Shan ; Kuo, Tai-Wei ; Shih, Chi-Sheng ; Liang, Min-Siong

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2010
  • fDate
    13-16 Sept. 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Multi-dimensional temporal data set is the common format in sensor network applications to store sampled temporal data. As time goes on, the size of the core tables in the data set may increase to enormous size and the tables become not managable. In order to reduce storage space and allow on-line query, how to trade off data compression effectiveness for on-line query performance is a challenge issue. In this paper, we are concerned with an effective framework for temporal data set that does not scarify on-line query performance and is specifically designed for very large sensor network database. The sampled data are compressed using several candidate approaches including dictionary-base compress and lossless vector quantization. In the mean time, on-line queries are conducted without decompressing the compressed data set so as to enhance the query performance. Experiments are conducted on a power meter database and sonoma database to evaluate the proposed methodologies in terms of data compression rate and data query speed. The results show that the compression rate ranges from 70% for numerical data to 20% for character data. In the mean time, the increased overhead for online query is limited up to 2%.
  • Keywords
    dictionaries; query processing; software packages; telecommunication computing; temporal databases; vector quantisation; very large databases; wireless sensor networks; COTS DBMS; data compression; data query speed; dictionary-base compress; large scale sensor data; lossless vector quantization; multidimensional temporal data set; online query performance; power meter database; sampled temporal data; sensor network applications; sonoma database; storage space reduction; very large sensor network database; Data Compression; sensor network; sensor network applications;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Technologies and Factory Automation (ETFA), 2010 IEEE Conference on
  • Conference_Location
    Bilbao
  • ISSN
    1946-0740
  • Print_ISBN
    978-1-4244-6848-5
  • Type

    conf

  • DOI
    10.1109/ETFA.2010.5641312
  • Filename
    5641312