• DocumentCode
    2877397
  • Title

    On the Subset Matching

  • Author

    Chen, Yangjun

  • Author_Institution
    University of Winnipeg, Canada
  • fYear
    2006
  • fDate
    38869
  • Firstpage
    9
  • Lastpage
    9
  • Abstract
    The subset matching problem is to find all occurrences of a pattern string p of length m in a text string t of length n, where each pattern and text location is a set of characters drawn from some alphabet. The pattern is said to occur at text position i if the set p[j] is a subset of the set t[i + j - 1], for all j (1 leqslant j leqslant m). This is an extension of the classic string matching and can be used for finding matching subtree patterns. In this paper, we propose a new algorithm and conduct a probabilistic analysis of its performance, which shows that we need only O(n + m cdot n^{0.5} ) time to solve the problem on average.
  • Keywords
    Algorithm design and analysis; Character generation; Computer science; Data engineering; Image processing; Information retrieval; Pattern analysis; Pattern matching; Performance analysis; Tree data structures; probabilistic analysis.; signature trees; signatures; string matching; subset matching; tree pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web-Age Information Management Workshops, 2006. WAIM '06. Seventh International Conference on
  • Conference_Location
    Hong Kong, China
  • Print_ISBN
    0-7695-2705-1
  • Type

    conf

  • DOI
    10.1109/WAIMW.2006.21
  • Filename
    4027169