On supporting containment queries in relational database management systems

147Citations
Citations of this article
53Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Virtually all proposals for querying XML include a class of query we term "containment queries". It is also clear that in the foreseeable future, a substantial amount of XML data will be stored in relational database systems. This raises the question of how to support these containment queries. The inverted list technology that underlies much of Information Retrieval is well-suited to these queries, but should we implement this technology (a) in a separate loosely-coupled IR engine, or (b) using the native tables and query execution machinery of the RDBMS? With option (b), more than twenty years of work on RDBMS query optimization, query execution, scalability, and concurrency control and recovery immediately extend to the queries and structures that implement these new operations. But all this will be irrelevant if the performance of option (b) lags that of (a) by too much. In this paper, we explore some performance implications of both options using native implementations in two commercial relational database systems and in a special purpose inverted list engine. Our performance study shows that while RDBMSs are generally poorly suited for such queries, under certain conditions they can outperform an inverted list engine. Our analysis further identifies two significant causes that differentiate the performance of the IR and RDBMS implementations: the join algorithms employed and the hardware cache utilization. Our results suggest that contrary to most expectations, with some modifications, a native implementation in an RDBMS can support this class of query much more efficiently.

References Powered by Scopus

The Lorel query language for semistructured data

642Citations
N/AReaders
Get full text

Lore: A Database Management System for Semistructured Data

383Citations
N/AReaders
Get full text

Making B<sup>+</sup>-Trees cache conscious in main memory

310Citations
N/AReaders
Get full text

Cited by Powered by Scopus

On supporting containment queries in relational database management systems

147Citations
N/AReaders
Get full text

TAX: A tree algebra for XML

111Citations
N/AReaders
Get full text

TwigList: Make twig pattern matching fast

70Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhang, C., Naughton, J., DeWitt, D., Luo, Q., & Lohman, G. (2001). On supporting containment queries in relational database management systems. SIGMOD Record (ACM Special Interest Group on Management of Data), 30(2), 425–436. https://doi.org/10.1145/376284.375722

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 26

79%

Professor / Associate Prof. 3

9%

Researcher 3

9%

Lecturer / Post doc 1

3%

Readers' Discipline

Tooltip

Computer Science 36

86%

Engineering 4

10%

Agricultural and Biological Sciences 1

2%

Psychology 1

2%

Save time finding and organizing research with Mendeley

Sign up for free