• DocumentCode
    18337
  • Title

    A Bayesian Inference-Based Framework for RFID Data Cleansing

  • Author

    Wei-Shinn Ku ; Haiquan Chen ; Haixun Wang ; Min-Te Sun

  • Author_Institution
    Dept. of Comput. Sci. & Software Eng., Auburn Univ., Auburn, AL, USA
  • Volume
    25
  • Issue
    10
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    2177
  • Lastpage
    2191
  • Abstract
    The past few years have witnessed the emergence of an increasing number of applications for tracking and tracing based on radio frequency identification (RFID) technologies. However, raw RFID readings are usually of low quality and may contain numerous anomalies. An ideal solution for RFID data cleansing should address the following issues. First, in many applications, duplicate readings of the same object are very common. The solution should take advantage of the resulting data redundancy for data cleaning. Second, prior knowledge about the environment may help improve data quality, and a desired solution must be able to take into account such knowledge. Third, the solution should take advantage of physical constraints in target applications to elevate the accuracy of data cleansing. There are several existing RFID data cleansing techniques. However, none of them support all the aforementioned features. In this paper, we propose a Bayesian inference-based framework for cleaning RFID raw data. We first design an n-state detection model and formally prove that the three-state model can maximize the system performance. Then, we extend the n-state model to support two-dimensional RFID reader arrays and compute the likelihood efficiently. In addition, we devise a Metropolis-Hastings sampler with constraints, which incorporates constraint management to clean RFID data with high efficiency and accuracy. Moreover, to support real-time object monitoring, we present the streaming Bayesian inference method to cope with realtime RFID data streams. Finally, we evaluate the performance of our solutions through extensive experiments.
  • Keywords
    belief networks; data analysis; inference mechanisms; radiofrequency identification; redundancy; Bayesian inference-based framework; Metropolis-Hastings sampler; RFID raw data cleansing accuracy elevation; constraint management; data quality improvement; data redundancy; n-state detection model design; performance evaluation; physical constraints; radio frequency identification technologies; raw RFID readings; reading duplication; real-time RFID data streams; real-time object monitoring; streaming Bayesian inference method; system performance maximization; target applications; three-state model; two-dimensional RFID reader arrays; Accuracy; Bayesian methods; Computational modeling; Equations; Mathematical model; Radiofrequency identification; Redundancy; Accuracy; Bayesian methods; Computational modeling; Data cleaning; Equations; Mathematical model; Radiofrequency identification; Redundancy; probabilistic algorithms; spatiotemporal databases; uncertainty;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.116
  • Filename
    6216377