Deciding equivalences among conjunctive aggregate queries

Sara Cohen*, Werner Nutt, Yehoshua Sagiv

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

64 Scopus citations

Abstract

Equivalence of aggregate queries is investigated for the class of conjunctive queries with comparisons and the aggregate operators count, count-distinct, min, max, and sum. Essentially, this class contains unnested SQL queries with the above aggregate operators, with a where clause consisting of a conjunction of comparisons, and without a having clause. The comparisons are either interpreted over a domain with a dense order (like the rationals) or with a discrete order (like the integers). Characterizations of equivalence differ for the two cases. For queries with either max or min, equivalence is characterized in terms of dominance mappings, which can be viewed as a generalization of containment mappings. For queries with the count-distinct operator, a sufficient condition for equivalence is given in terms of equivalence of conjunctive queries under set semantics. For some special cases, it is shown that this condition is also necessary. For conjunctive queries with comparisons but without aggregation, equivalence under bag-set semantics is characterized in terms of isomorphism. This characterization essentially remains the same also for queries with the count operator. Moreover, this characterization also applies to queries with the sum operator if the queries have either constants or comparisons, but not both. In the general case (i.e., both comparisons and constants), the characterization of the equivalence of queries with the sum operator is more elaborate. All the characterizations given in the paper are decidable in polynomial space.

Original languageEnglish
Article number1219093
JournalJournal of the ACM
Volume54
Issue number2
DOIs
StatePublished - 1 Apr 2007
Externally publishedYes

Keywords

  • Aggregation
  • Bag-set semantics
  • Datalog
  • Query equivalence

Fingerprint

Dive into the research topics of 'Deciding equivalences among conjunctive aggregate queries'. Together they form a unique fingerprint.

Cite this