• DocumentCode
    39530
  • Title

    How to Conduct Distributed IncompletePattern Matching

  • Author

    Siyuan Liu ; Lei Kang ; Lei Chen ; Ni, Lionel

  • Author_Institution
    Heinz Coll., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    25
  • Issue
    4
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    982
  • Lastpage
    992
  • Abstract
    In this paper, we first propose a very interesting and practical problem, pattern matching in a distributed mobile environment. Pattern matching is a well-known problem and extensive research has been conducted for performing effective and efficient search. However, previous proposed approaches assume that data are centrally stored, which is not the case in a mobile environment (e.g., mobile phone networks), where one person´s pattern could be separately stored in a number of different stations, and such a local pattern is incomplete compared with the global pattern. A simple solution to pattern matching over a mobile environment is to collect all the data distributed in base stations to a data center and conduct pattern matching at the data center afterwards. Clearly, such a simple solution will raise huge amount of communication traffic, which could cause the communication bottleneck brought by the limited wireless bandwidth to be even worse. Therefore, a communication efficient and search effective solution is necessary. In our work, we present a novel solution which is based on our well-designed weighted bloom filter (WBF), called, Distributed Incomplete pattern matching ( DI-matching), to find target patterns over a distributed mobile environment. Specifically, to save communication cost and ensure pattern matching in distributed incomplete patterns, we use WBF to encode a query pattern and disseminate the encoded data to each base station. Each base station conducts a local pattern search according to the received WBF. Only qualified IDs and corresponding weights in each base station are sent to the data center for aggregation and verification. Through non-trivial theoretical analysis and extensive empirical experiments on a real city-scale mobile networks data set, we demonstrate the effectiveness and efficiency of our proposed solutions.
  • Keywords
    computer centres; data structures; mobile computing; pattern matching; data center; distributed incomplete pattern matching; distributed mobile environment; global pattern; local pattern search; real city-scale mobile networks data set; weighted bloom filter; wireless bandwidth; Base stations; Distributed databases; Mobile communication; Mobile handsets; Pattern matching; Search problems; Time series analysis; Incomplete pattern matching; distributed mobile environment; time series; weighted bloom filter;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2013.128
  • Filename
    6509884