0% found this document useful (0 votes)

31 views9 pages

(Slide) Containment Conjunctive Queries

Slide de Containment Conjunctive Queries

Uploaded by

thbinhqn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views9 pages

(Slide) Containment Conjunctive Queries

Slide de Containment Conjunctive Queries

Uploaded by

thbinhqn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Conjunctive Queries

= safe, Datalog rules:

H :- G1 & & Gn

Most common form of query; equivalent to

select-project-join queries.
Useful for optimization of active elements,
e.g., checking distributed constraints,
maintaining materialized views.)
Useful for information integration.

Applying a CQ to a Database

If Q is a CQ, and D is a database of EDB facts,

then Q(D) is the set of heads of Q that we get
when we:
Substitute constants for variables in the body
of Q in all possible ways.
Require all subgoals to become true.

Example
p(X; Y ) : , q(X; Z ) & q(Z; Y )

EDB = fq(1; 2); q(2; 3); q(3; 4)g.

Only substitutions that make subgoals both

true:
1. X ! 1; Y ! 3; Z ! 2.
2. X ! 2; Y ! 4; Z ! 3.
Yield heads p(1; 3) and p(2; 4).

Containment
Q1 Q2 i for every database D, Q1(D) Q2(D).
Containment problem is NP-complete, but

not a \hard" problem in practical situations

(short queries, few pairs of subgoals with same
predicate).
Function symbols do not make problems more
dicult.
Adding negated subgoals and/or arithmetic
subgoals, e.g., X < Y , makes things more
complex, but important special cases.

Example
1

A: p(X,Y) :- r(X,W) & b(W,Z) & r(Z,Y)

B : p(X,Y) :- r(X,W) & b(W,W) & r(W,Y)

Claim: B A.
In proof, suppose p(x; y) is in B (D). Then
there is some w such that r(x; w), b(w; w), and
r(w; y) are in D.
In A, make the substitution X ! x, Y ! y,
W ! w, Z ! w.
Thus, the head of A becomes p(x; y), and all
subgoals of A are in D.
Thus, p(x; y) is also in A(D), proving B A.

Testing Containment of CQ's

1. Containment mappings.
2. Canonical databases.
Similar for basic CQ case, but (2) is useful for
more general cases like negated subgoals.

Containment Mappings

Mapping from variables of CQ Q2 to variables of

CQ Q1 such that
1. Head of Q2 becomes head of Q1 .
2. Each subgoal of Q2 becomes some subgoal of
Q1.
It is not necessary that every subgoal of
Q1 is the target of some subgoal of Q2.

Example

A, B as above:
A: p(X,Y) :- r(X,W) & b(W,Z) & r(Z,Y)
B : p(X,Y) :- r(X,W) & b(W,W) & r(W,Y)

Containment mapping from A to B : X ! X ,

Y ! Y , W ! W, Z ! W.
No containment mapping from B to A.
Subgoal b(W; W ) in B can only go to b(W; Z )
in A. That would require both W ! W and
W ! Z.

Example
C1: p(X) :- a(X,Y) & a(Y,Z) & a(Z,W)
C2: p(X) :- a(X,Y) & a(Y,X)
2

Containment mapping from C1 to C2. X !

X, Y ! Y , Z ! X, W ! Y .
No containment mapping from C2 to C1.

Proof:
a) X ! X required for head.
b) Thus, rst subgoal of C2 must map to
rst subgoal of C1; Y must map to Y .
c) Similarly, 2nd subgoal of C2 must map to
2nd subgoal of C1, so X must map to Z .
d) But we already found X maps to X .

Containment Mapping Theorem

Q1 Q2 i there exists a containment mapping
from Q2 to Q1 .

Proof (If)
Let : Q2 ! Q1 be a containment mapping. Let D
be any DB.
Every tuple t in Q1(D) is produced by some
substitution on the variables of Q1 that
makes Q1's subgoals all become facts in D.
Claim: is a substitution for variables of
Q2 that produces t.
1. (Fi) = (some Gj ). Therefore, it is
in D.
2. (H2 ) = (H1 ) = t.
Thus, every t in Q1(D) is also in Q2 (D); i.e.,
Q1 Q2.

Proof (Only If)

Key idea: frozen CQ.

1. Create a unique constant for each variable of
the CQ Q.
2. Frozen Q is a database consisting of all the
subgoals of Q, with the chosen constants
substituted for variables.

Example
p(X) :- a(X,Y) & a(Y,Z) & a(Z,W)

Let x be the constant for X , etc. The relation

for predicate a consists of the three tuples (x; y),
(y; z ), and (z; w).
3

Proof (Only If) Continued

Let Q1 Q2 . Let database D be the frozen Q1 .
Q1(D) contains t, the \frozen" head of Q1

Sounds gruesome, but the reason is that

we can use the substitution in which
each variable of Q1 is replaced by its
corresponding constant.
Since Q1 Q2 , Q2(D) must also contain t.
Let be the substitution of constants from
D for the variables of Q2 that makes each
subgoal of Q2 a tuple of D and yields t as the
head.
Let be the substitution that maps constants
of D to their unique, corresponding variable of
Q1.

Q2:

Q1:

E :- F1 & Fm (X; Y )
t

H :- G1 & & Gi(A; B ) &

is a containment mapping from Q2 to Q1

because:
a) The head of Q2 is mapped by to t, and
t is the frozen head of Q1, so maps
the head of Q2 to the \unfrozen" t, that
is, the head of Q1 .
b) Each subgoal Fi of Q2 is mapped by to
some tuple of D, which is a frozen version
of some subgoal Gj of Q1. Then
maps Fi to the unfrozen tuple, that is, to
Gj itself.

Dual View of Containment Mappings

A containment mapping, dened as a mapping on

variables, induces a mapping on subgoals.
Therefore, we can alternatively dene a
containment mapping as a function on
subgoals, thus inducing a mapping on
variables.
The containment mapping condition becomes:
the subgoal mapping does not cause a variable
to be mapped to two dierent variables or
4

constants, nor cause a constant to be mapped

to a variable or a constant other than itself.

Example
Again consider
A: p(X,Y) :- r(X,W) & b(W,Z) & r(Z,Y)
B : p(X,Y) :- r(X,W) & b(W,W) & r(W,Y)

Previously, we found the containment

mapping X ! X , Y ! Y , W ! W , Z ! W
from A to B .
We could as well describe this mapping as
r(X; W ) ! r(X; W ), b(W; Z ) ! b(W; W ),
and r(Z; Y ) ! r(W; Y ).

Method of Canonical Databases

Instead of looking for a containment mapping from
Q2 to Q1 in order to test Q1 Q2, we can apply
the following test:
1. Create a canonical database D that is the
frozen body of Q1.
2. Compute Q2(D).
3. If Q2(D) contains the frozen head of Q1, then
Q1 Q2; else not.
The proof that this method works is
essentially the same as the argument for
containment mappings:
The only way the frozen head of Q1
can be in Q2 (D) is for there to be a
containment mapping Q2 ! Q1 .

Example
C1: p(X) :- a(X,Y) & a(Y,Z) & a(Z,W)
C2: p(X) :- a(X,Y) & a(Y,X)
Here is the test for C2 C1 :
Choose constants X ! 0, Y ! 1.
Canonical DB from C1 is
D = fa(0; 1); a(1; 0)g

C1(D) = fp(0); p(1)g.

Since the frozen head of C2 is p(0), which is in

C1(D), we conclude C2 C1.
Note that the instantiation of C1 that
shows p(0) is in C1(D) is X ! 0, Y ! 1,
Z ! 0, and W ! 1.

If we replace 0 and 1 by the variables

X and Y they stand for, we have the
containment mapping from C1 to C2 .

Saraiya's Containment Test

Containment of CQ's is NP-complete in

general.
Sariaya's algorithm is a polynomial-time test
of Q1 Q2 for the common case that no
predicate appears more than twice among the
subgoals of Q1.
They can appear any number of times in
Q2.
The algorithm is a reduction to 2SAT and
yields a linear-time algorithm.
Our algorithm is more direct, but quadratic.

The Algorithm

Pick a subgoal of Q2, and consider the

consequences of mapping it to the two possible
subgoals of Q1.
Follow all consequences of this choice:
subgoals that must map to subgoals, and
variables that must map to variables.
If we know p(X1 ; : : :; Xn) must map to
p(Y1 ; : : :; Yn), then infer that each Xi
must map to Yi .
If p(X1 ; : : :; Xn ) is a subgoal of Q2, and
we know Xi maps to some variable Z ,
and exactly one of the p-subgoals of
Q1 has Z in the ith component, then
conclude p(X1 ; : : :; Xn) maps to this
subgoal.
One of two things must happen:
1. We derive a contradiction: a subgoal or
variable that must map to two dierent
things.
If so, try the other choice if there is one;
fail if there is no other choice.
6

2. We close the set of inferences we must make.

Then we can forever forget about the
question of how to map the determined
subgoals and variables.
We have found one mapping that works
and that can't interfere with the mapping
of any other subgoals or variables, so we
make another arbitrary choice if there are
any unmapped subgoals.

Example

Let us test C1 C2 , where:

C1: p(B) :- a(A,B) & a(B,A) & b(A,C) & b(C,B)
C2: p(X) :- a(X,Y) & b(Y,Z) & b(Z,W) & a(W,X)

Note this simple example omits some options:

C1 could have a predicate appearing only once
in the body, and C2 could have 3 or more
occurrences of some predicates.
Here is a description of inferences that might
be made:
(1) Suppose a(X; Y ) ! a(A; B )
(2)
Then X ! A, Y ! B
(3)
Now, b(Y; Z ) ! b(B; ?)
(4)
Since there is no b(B; ?), fail
(5) Thus, we must map a(X; Y ) ! a(B; A)
(6)
Then X ! B and Y ! A,
(7)
b(Y; Z ) ! b(A; C ), Z ! C ,
(8)
b(Z; W ) ! b(C; B ), W ! B
(9)
Now, a(W; X ) must map to a(B; B )
(10)
Since a(B; B ) does not exist, fail

Note, however, that if the last subgoal of C1

were b(C; A), we would have W ! A at
line (8) and a(W; X ) ! a(A; B ) at line (9).

That completes the containment mapping

successfully, with X ! B , Y ! A, Z !
C , and W ! A.

Generalization to Unions of CQ's

P1 [ P2 [ [ Pk Q1 [ Q2 [ [ Qn i for all
Pi there exists some Qj such that Pi Qj .
Proof (If)
Obvious.
7

Proof (Only If)

Assume the containment holds.

Let D be the canonical (frozen) database from
CQ Pi .
Since the containment holds, and Pi (D) surely
includes the frozen head of Pi, there must be
some Qj such that Qj (D) includes the frozen
head of Pi .
Thus, Pi Qj .

Union Theorem Just Misses Being False

Consider generalized CQ's allowing arithmeticcomparison subgoals.

P1: p(X) :- e(X) & 10 <= X & X <= 20
Q1: p(X) :- e(X) & 10 <= X & X <= 15
Q2: p(X) :- e(X) & 15 <= X & X <= 20

P1 Q1 [ Q2, but P1 Q1 and P1 Q2 are

both false.

CQ Contained in Recursive Datalog

Test relies on method of canonical DB's;

containment mapping approach doesn't work (it's
meaningless).
Make DB D from frozen body of CQ.
Apply program to D. If frozen head of CQ
appears in result, then yes (contained), else
no.

Example
CQ Q1 is:

Q1: path(X,Y) :- arc(X,Z) &

arc(Z,W) & arc(W,Y)

Q2 is the value of path in the following

recursive Datalog program:

r1: path(X,Y) :- arc(X,Y)
r2: path(X,Y) :- path(X,Z) & path(Z,Y)

Intuitively, Q1 = paths of length 3; Q2 =

paths of length 1 or more.

Freeze Q1, say with 0, 1, 2, 3 as constants for
X , Z , W , Y , respectively.
D = farc(0; 1); arc(1; 2); arc(2; 3)g
8

Frozen head is path(0; 3).

Easy to infer that path(0; 3) is in Q2(D) |
use r1 three times to infer path(0; 1),
path(1; 2), path(2; 3), then use r2 to infer
path(0; 2), path(0; 3).

Harder Cases
Datalog program CQ: doubly exponential

complexity. Reference: Chaudhuri, S. and

M. Y. Vardi [1992]. \On the equivalence of
datalog programs," Proc. Eleventh ACM
Symposium on Principles of Database
Systems, pp. 55{66.
Datalog program Datalog program:
undecidable.

Eapp Module 1
100% (1)
Eapp Module 1
29 pages
Wenzhong Shi, Peter Fisher, Michael F. Goodchild - Spatial Data Quality-CRC Press (2002)
No ratings yet
Wenzhong Shi, Peter Fisher, Michael F. Goodchild - Spatial Data Quality-CRC Press (2002)
354 pages
NMTC-2022 - Previous Year Question Papers For Class 5 and 6
0% (1)
NMTC-2022 - Previous Year Question Papers For Class 5 and 6
10 pages
Chapter 6: Query Decomposition and Data Localization
0% (2)
Chapter 6: Query Decomposition and Data Localization
26 pages
Heidenhain FK-Programming TNC 530i
100% (1)
Heidenhain FK-Programming TNC 530i
83 pages
Class-X MCQS
No ratings yet
Class-X MCQS
8 pages
General Mathematics: Inverse Function & Its Graph
No ratings yet
General Mathematics: Inverse Function & Its Graph
21 pages
Fractions Improper1 PDF
100% (1)
Fractions Improper1 PDF
2 pages
4MA1 1HR Que 20220111
100% (1)
4MA1 1HR Que 20220111
32 pages
Jan 2022 M1 MS
No ratings yet
Jan 2022 M1 MS
15 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
63 pages
Query Processing
No ratings yet
Query Processing
28 pages
Numerical Methods For The Simulation of Chemical Engineering Processes
No ratings yet
Numerical Methods For The Simulation of Chemical Engineering Processes
14 pages
Query Optimization
No ratings yet
Query Optimization
103 pages
Math LF
No ratings yet
Math LF
169 pages
Lesson04 PDF
No ratings yet
Lesson04 PDF
51 pages
HK - F3 Math 08-09 Final Exam Paper I - QN & Ans
100% (1)
HK - F3 Math 08-09 Final Exam Paper I - QN & Ans
12 pages
Query Execution
No ratings yet
Query Execution
87 pages
April2015 PDF
No ratings yet
April2015 PDF
59 pages
L15-16 (Query Decomposition) PDF
No ratings yet
L15-16 (Query Decomposition) PDF
57 pages
Query Processing
0% (1)
Query Processing
15 pages
Relational Algebra: R & G, Chapter 4
No ratings yet
Relational Algebra: R & G, Chapter 4
27 pages
CS240A: Databases and Knowledge Bases
No ratings yet
CS240A: Databases and Knowledge Bases
26 pages
Graph Inequalities
No ratings yet
Graph Inequalities
37 pages
DBMS Relational Algebra
No ratings yet
DBMS Relational Algebra
48 pages
Lecture09 Optimization Structural
No ratings yet
Lecture09 Optimization Structural
27 pages
RBBMS
No ratings yet
RBBMS
17 pages
A.2 Linear Summation Queries With - Bit Answers
No ratings yet
A.2 Linear Summation Queries With - Bit Answers
21 pages
DBMS Last Minute Notes
No ratings yet
DBMS Last Minute Notes
7 pages
Information Integration Using Logical Views
No ratings yet
Information Integration Using Logical Views
22 pages
Indian Statistical Institute: PGDBA, First Year, Final of First Semester Examination, 2019-20
No ratings yet
Indian Statistical Institute: PGDBA, First Year, Final of First Semester Examination, 2019-20
3 pages
Lesson Plan Template MAED 3224: Ccss - Math.Content.2.Nbt.B.7
No ratings yet
Lesson Plan Template MAED 3224: Ccss - Math.Content.2.Nbt.B.7
6 pages
Datalog Query Language: Xi XJ
No ratings yet
Datalog Query Language: Xi XJ
4 pages
Relational Algebra: Module 3, Lecture 1
No ratings yet
Relational Algebra: Module 3, Lecture 1
20 pages
Introduction To Standard Query Language: Erik Zeitler Udbl Erik - Zeitler@it - Uu.se
No ratings yet
Introduction To Standard Query Language: Erik Zeitler Udbl Erik - Zeitler@it - Uu.se
42 pages
Relational Query Languages
No ratings yet
Relational Query Languages
42 pages
The Analysis of Runge Phenomenon
No ratings yet
The Analysis of Runge Phenomenon
29 pages
Finite Difference Method
100% (1)
Finite Difference Method
16 pages
Queries and Query Languages: Passive. That Is, They Do Not Modify The
No ratings yet
Queries and Query Languages: Passive. That Is, They Do Not Modify The
27 pages
Lecture 11 SQ Li
No ratings yet
Lecture 11 SQ Li
58 pages
Rozenshtein, David - Bondur, Tom (Editor) - Essence of SQL - A Guide To Learning The Most SQL in The Least Amount of Time-Coriolis Group Books - Peer To Peer Communications (1998)
No ratings yet
Rozenshtein, David - Bondur, Tom (Editor) - Essence of SQL - A Guide To Learning The Most SQL in The Least Amount of Time-Coriolis Group Books - Peer To Peer Communications (1998)
136 pages
Project Num Boost
No ratings yet
Project Num Boost
5 pages
WGU C170 Data Management
No ratings yet
WGU C170 Data Management
5 pages
2019 Spring Final Sol
No ratings yet
2019 Spring Final Sol
19 pages
Line It Up Understood
No ratings yet
Line It Up Understood
4 pages
Introduction To Computer Science CHAPTER 3: Pseudocode
No ratings yet
Introduction To Computer Science CHAPTER 3: Pseudocode
29 pages
CS143 Notes: Relational Algebra: Book Chapters
No ratings yet
CS143 Notes: Relational Algebra: Book Chapters
9 pages
EXERCISE Coordinate
No ratings yet
EXERCISE Coordinate
2 pages
Unit-2 - Relational Database Concepts
No ratings yet
Unit-2 - Relational Database Concepts
55 pages
Part I - Eigenvalue Problem
No ratings yet
Part I - Eigenvalue Problem
15 pages
Ch-2 (B) Overview of Query Processing
No ratings yet
Ch-2 (B) Overview of Query Processing
73 pages
DBMS 2019
No ratings yet
DBMS 2019
4 pages
12 Datalog
No ratings yet
12 Datalog
4 pages
Chapter 6-Relearional Algebra and Calcules (Autosaved)
No ratings yet
Chapter 6-Relearional Algebra and Calcules (Autosaved)
31 pages
Lecture 17
No ratings yet
Lecture 17
52 pages
ADB Chapter 2
No ratings yet
ADB Chapter 2
40 pages
Chapter 1 Query Processing and Optimization
No ratings yet
Chapter 1 Query Processing and Optimization
108 pages
ADBS - Chapter Two
No ratings yet
ADBS - Chapter Two
41 pages
L - Other Relational Query Languages
No ratings yet
L - Other Relational Query Languages
49 pages
DBMS End Term
No ratings yet
DBMS End Term
27 pages
Chapter 2-Query Processing and Optimi
No ratings yet
Chapter 2-Query Processing and Optimi
43 pages
1.10 Taylor and Maclaurin Series
No ratings yet
1.10 Taylor and Maclaurin Series
12 pages
1st Q Math Exam
No ratings yet
1st Q Math Exam
3 pages
Grade 8 Itmc2019
No ratings yet
Grade 8 Itmc2019
11 pages
Non-Restoring Division Algorithm
100% (1)
Non-Restoring Division Algorithm
4 pages
Agriculture 13 01757 v2
No ratings yet
Agriculture 13 01757 v2
32 pages
Phases of QP
No ratings yet
Phases of QP
6 pages
On An Irreducibility Theorem of A Cohn
No ratings yet
On An Irreducibility Theorem of A Cohn
5 pages
DBMS - Unit 3 1
No ratings yet
DBMS - Unit 3 1
17 pages
Adbms Unit2
No ratings yet
Adbms Unit2
20 pages
Grade 5 - WW3 - Math
No ratings yet
Grade 5 - WW3 - Math
3 pages
Lec3-14 Review
No ratings yet
Lec3-14 Review
28 pages
Class 5
No ratings yet
Class 5
10 pages
Year 2 Maths Addition and Subtraction
No ratings yet
Year 2 Maths Addition and Subtraction
4 pages
Assignment Dataintegration 300341285 Gurdarshan Singh
No ratings yet
Assignment Dataintegration 300341285 Gurdarshan Singh
7 pages
Syllabus For Entrance Exam - 2024-25
No ratings yet
Syllabus For Entrance Exam - 2024-25
6 pages
cs317 s2022 Midsem
No ratings yet
cs317 s2022 Midsem
7 pages
2023W2 Midterm
No ratings yet
2023W2 Midterm
12 pages
MATHEMATICS
No ratings yet
MATHEMATICS
30 pages
DE Module5 QueryOptimization
No ratings yet
DE Module5 QueryOptimization
11 pages
Black Worksheets
No ratings yet
Black Worksheets
160 pages
Mcs 23
No ratings yet
Mcs 23
7 pages
DDBS Unit 2
No ratings yet
DDBS Unit 2
7 pages
Chapter 2-Query Processing - 110554
No ratings yet
Chapter 2-Query Processing - 110554
38 pages
Queryoptimization Examples
No ratings yet
Queryoptimization Examples
26 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
51 pages
DBMS Unit 2
No ratings yet
DBMS Unit 2
11 pages
Chapter - 2 Query Processing
No ratings yet
Chapter - 2 Query Processing
64 pages
Zyqwadawfafslecture09 Query Optimization
No ratings yet
Zyqwadawfafslecture09 Query Optimization
90 pages
Mẫu Slide Về Toán Học Siêu CUTEEE
No ratings yet
Mẫu Slide Về Toán Học Siêu CUTEEE
51 pages
Project Timeline Neumorph 1 Dark
No ratings yet
Project Timeline Neumorph 1 Dark
43 pages
CNHP - Mẫu Slide Kể Tội Người Khác
No ratings yet
CNHP - Mẫu Slide Kể Tội Người Khác
40 pages
MẪU SLIDE - Thể Thao Điện Tử
No ratings yet
MẪU SLIDE - Thể Thao Điện Tử
30 pages
CNHP - Slide Chủ Đề Công Nghệ Thông Tin
No ratings yet
CNHP - Slide Chủ Đề Công Nghệ Thông Tin
22 pages
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet