DocumentCode
2582045
Title
Extracting Partition Statistics from Semistructured Data
Author
Wilson, John N. ; Gourlay, Richard ; Japp, Robert ; Neumüller, Mathias
Author_Institution
Dept. of Comput. & Inf. Sci., Strathclyde Univ., Glasgow
fYear
0
fDate
0-0 0
Firstpage
497
Lastpage
501
Abstract
The effective grouping, or partitioning, of semistructured data is of fundamental importance when providing support for queries. Partitions allow items within the data set that share common structural properties to be identified efficiently. This allows queries that make use of these properties, such as branching path expressions, to be accelerated. Here, we evaluate the effectiveness of several partitioning techniques by establishing the number of partitions that each scheme can identify over a given data set. In particular, we explore the use of parameterised indexes, based upon the notion of forward and backward bisimilarity, as a means of partitioning semistructured data; demonstrating that even restricted instances of such indexes can be used to identify the majority of relevant partitions in the data
Keywords
data structures; query processing; backward bisimilarity; branching path expressions; forward bisimilarity; parameterised indexes; partition statistics extraction; semistructured data partitioning; Acceleration; Data mining; Databases; Expert systems; Indexing; Query processing; Statistics; Terminology; Tree graphs; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications, 2006. DEXA '06. 17th International Workshop on
Conference_Location
Krakow
ISSN
1529-4188
Print_ISBN
0-7695-2641-1
Type
conf
DOI
10.1109/DEXA.2006.59
Filename
1698393
Link To Document