0% found this document useful (0 votes)

78 views47 pages

9 Design Theory

The document discusses foundational concepts in database design theory including functional dependencies, normal forms, and decompositions. It covers topics such as the goals of normalization including removing redundancy and expressing constraints, rules for functional dependencies including splitting/combining and transitivity, identifying keys and prime attributes, and using closure tests to determine if a functional dependency is implied by a set of given dependencies. The overall document provides an introduction to key theoretical concepts for systematically improving database schemas through normalization.

Uploaded by

Miranda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views47 pages

9 Design Theory

Uploaded by

Miranda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Design Theory for Relational DBs:

Functional Dependencies,
Decompositions, Normal Forms
Introduction to Databases
Manos Papagelis

Thanks to Ryan Johnson, John Mylopoulos, Arnold Rosenbloom

and Renee Miller for material in these slides
2

Database Design Theory

• Guides systematic improvements to database schemas
• General idea:
– Express constraints on the data
– Use these to decompose the relations
• Ultimately, get a schema that is in a “normal form”
– guarantees certain desirable properties
– “normal” in the sense of conforming to a standard
• The process of converting a schema to a normal form is called
normalization

2
3

Goal #1: remove redundancy

• Consider this schema
Student Name Student Email Course Instructor
Xiao xiao@gmail CSC333 Smith
Xiao xiao@gmail CSC444 Brown
Jaspreet jaspreet@gmail CSC333 Smith

• What if…
– Xiao changes email addresses? (update anomaly)
– Xiao drops CSC444? (deletion anomaly)
– Need to create a new course, CSC222 (insertion anomaly)

Multiple relations => exponentially worse

Goal #2: expressing constraints

• Consider the following sets of schemas:
Students(utorid, name, email)
vs.
Students(utorid, name)
Emails(utorid, address)
• Consider also:
House(street, city, value, owner, propertyTax)
vs.
House(street, city, value, owner)
TaxRates(city, value, propertyTax)

Dependencies, constraints are domain-dependent

Overview
• Part I: Functional Dependencies
• Part II: Decompositions
• Part III: Normal Forms
6

PART 1:
FUNCTIONAL DEPENDENCIES
7

Functional dependencies
• Let X, Y be sets of attributes from relation R
• X -> Y is an assertion about tuples in R
– Any tuples in R which agree in all attributes of X must also agree in all
attributes of Y
• “X functionally determines Y”
– Or, “The values of attributes Y are a function of those in X”
– Not necessarily an easy function to compute, mind you
=> Consider X -> h, where h is the hash of attributes in X
• Notational conventions
– “a”, “b”, “c” – specific attributes
– “A”, “B”, “C” – sets of (unnamed) attributes
– abc -> def – same as {a,b,c} -> {d,e,f}

Most common to see singletons (X -> y or abc -> d)

Rules and principles about FDs

• Rules
– The splitting/combining rule
– Trivial FDs
– The transitive rule
• Algorithms related to FDs
– the closure of a set of attributes of a relation
– a minimal basis of a relation
9

The Splitting/Combining rule of FDs

• Attributes on right independent of each other
– Consider a,b,c -> d,e,f
– “Attributes a, b, and c functionally determine d, e, and f”
=> No mention of d relating to e or f directly
• Splitting rule (Useful to split up right side of FD)
– abc -> def becomes abc -> d, abc -> e and abc -> f
• No safe way to split left side
– abc -> def is NOT the same as ab -> def and c -> def!
• Combining rule (Useful to combine right sides):
– if abc -> d, abc -> e, abc -> f holds, then abc -> def holds
10

Splitting FDs – example

• Consider the relation and FD
– EmailAddress(user, domain, firstName, lastName)
– user, domain -> firstName, lastName
• The following hold
– user, domain -> firstName
– user, domain -> lastName
• The following do NOT hold!
– user -> firstName, lastName
– domain -> firstName, lastName

Gotcha: “doesn’t hold” = “not all tuples” != “all tuples not”

Trivial FDs
• Not all functional dependencies are useful
– A -> A always holds
– abc -> a also always holds (right side is subset of left side)
• FD with an attribute on both sides is “trivial”
– Simplify by removing L ∩ R from R
abc -> ad becomes abc -> d
– Or, in singleton form, delete trivial FDs
abc -> a and abc -> d becomes just abc -> d
12

Transitive rule
• The transitive rule holds for FDs
– Consider the FDs: a -> b and b -> c; then a->c holds
– Consider the FDs: ad -> b and b -> cd; then ad->cd holds or
just ad->c (because of the trivial dependency rule)
13

Identifying functional dependencies

• FDs are domain knowledge
– Intrinsic features of the data you’re dealing with
– Something you know (or assume) about the data
• Database engine cannot identify FDs for you
– Designer must specify them as part of schema
– DBMS can only enforce FDs when told to
• DBMS cannot safely “optimize” FDs either
– It has only a finite sample of the data
– An FD constrains the entire domain
14

Coincidence or FD?
ID Email City Country Surname
1983 [email protected] Toronto Canada Fairgrieve
8624 [email protected] London Canada Samways
9141 [email protected] Winnipeg Canada Samways
1204 [email protected] Aachen Germany Lakemeyer

• What if we try to infer FDs from the data?

– ID -> email, city, country, surname
– email -> city, country, surname
– city -> country
– surname -> country

Domain knowledge required to validate FDs

Keys and FDs

• Consider relation R with attributes A
• Superkey
– Any S  A s.t. S -> A
=> Any subset of A which determines all remaining attributes in A
• Candidate key (or key)
– C  A s.t. C -> A and X -> A does not hold for any X C
=> A superkey which contains no other superkeys
=> Remove any attribute and you no longer have a key
• Primary key
– The candidate key we use to identify the relation
=> Always exists, only one allowed, doesn’t matter which C we use
• Prime attribute
–  candidate key C s.t. xC (attribute that participates in at least one key)
17

FD: relaxes the concept of a “key”

• Functional dependency: X -> Y
• Superkey: X -> R
• A superkey must include all remaining attributes
of the relation on the RHS (Right-Hand-Side)
• An FD can involve just a subset of them
• Example:
Houses(street, city, value, owner, tax)
– street,city -> value,owner,tax (both FD and key)
– city,value -> tax (FD only)
18

Cyclic functional dependencies?

• Attributes on right side of one FD may appear
on left side of another!
– Simple example: assume relation (A, B) & FDs: A -> B, B -> A
– What does this say about A and B?
• Example
– studentID -> email email -> studentID
19

Geometric view of FDs

• Let D be the domain of tuples in R
– Every possible tuple is a point in D
• FD X on R restricts tuples in R to a subset of D
– Points in D which violate X cannot be in R
• Example: D(x,y,z)
– xy -> z
(-1, -1, 2)
=> z = abs(x) + abs(y) (1,1,0) (0,0,1)
– z -> x,y (1, 1, 2)
=> x=y=abs(z)/2 (1, 1, -2) (0, 0, 0) (2, 2, -4)
(2, 2, 4)
(1,-1,-2) (3,2,1)
(1, 2, 3)
20

Inferring functional dependencies

• Problem
– Given FDs X1 -> a1, X2 -> a2, etc.
– Does some FD Y -> B (not given) also hold?
• Consider the dependencies
A -> B, B -> C
Does A -> C hold?

Intuitively, A -> C also holds

The given FDs entail (imply) it (transitivity rule)

How to prove it in the general case?

Closure test for FDs

• Given attribute set A and FD set F
– Denote AF+ as the closure of A relative to F
=> AF+ = set of all FDs given or implied by A
• Computing the [transitive] closure of A
– Start: AF+ = A, F’ = F
– While X  F’ s.t. LHS(X)  AF+ :
AF+ = AF+ U RHS(X)
F’ = F’ - X
– At end: A -> B B  AF+
22

Closure test – example

• Consider R(a,b,c,d,e,f)
with FDs ab -> c, ac -> d, c -> e, ade -> f
• Find A+ if A = ab or find {a,b}+

a b c d e f a b c d e f

{a,b}+={a,b,c,d,e,f} or ab -> cdef -- ab is a candidate key!

Example : Closure Test

R(A, B, C, D, E) X XF+

F: AB -> C A {A, D, E}
A -> D AB {A, B, C, D, E}
D -> E AC {A, C, B, D, E}
AC -> B B {B}
D {D, E}

Is AB -> E entailed by F? Yes

Is D -> C entailed by F? No

Result: XF+ allows us to determine all FDs of the form

X -> Y entailed by F
24

Discarding redundant FDs

• Minimal basis: opposite extreme from closure
• Given a set of FDs F, want to minimize F’ s.t.
– F’  F
– F’ entails X XF
• Properties of a minimal basis F’
– RHS is always singleton
– If any FD is removed from F’, F’ is no longer a minimal basis
– If for any FD in F’ we remove one or more attributes from
the LHS of X  F’, the result is no longer a minimal basis
25

Constructing a minimal basis

• Straightforward but time-consuming
1. Split all RHS into singletons
2. X  F’, test whether J = (F’-X)+ is still equivalent to F+

=> Might make F’ too small

3. i  LHS(X) X  F’, let LHS(X’)=LHS(X)-i
Test whether (F’-X+X’)+ is still equivalent to F+
=> Might make F’ too big
4. Repeat (2) and (3) until neither makes progress
26

Minimal Basis: Example

• Relation R: R(A, B, C, D)
• Defined FDs:
– F = {A->AC, B->ABC, D->ABC}

Find the minimal Basis M of F

Minimal Basis: Example (cont.)

1st Step
– H = {A->A, A->C, B->A, B->B, B->C, D->A, D->B, D->C}
2nd Step
– A->A, B->B: can be removed as trivial
– A->C: can’t be removed, as there is no other LHS with A
– B->A: can’t be removed, because for J=H-{B->A} is B+=BC
– B->C: can be removed, because for J=H-{B->C} is B+=ABC
– D->A: can be removed, because for J=H-{D->A} is D+=DBA
– D->B: can’t be removed, because for J=H-{D->B} is D+=DC
– D->C: can be removed, because for J=H-{D->C} is D+=DBAC
Step outcome => H = {A->C, B->A, D->B}
28

Minimal Basis: Example (cont.)

3rd Step
– H doesn’t change as all LHS in H are single attributes
4th Step
– H doesn’t change

Minimal Basis: M = H = {A->C, B->A, D->B}

Minimal Basis: Example 2

• Relation R: R(A, B, C)
• Defined FDs:
– A->B, A->C, B->C, B->A, C->A, C->B
– AB->, AC-B, BC->A
– A->BC
– A->A
• Possible Minimal Bases:
– {A->B, B->A, B->C, C->B} or
– {A->B, B->C, C->A}
– …
34

PART II:
SCHEMA DECOMPOSITION
35

FDs and redundancy

• Given relation R and FDs F
– R often exhibits anomalies due to redundancy
– F identifies many (not all) of the underlying problems
• Idea
– Use F to identify “good” ways to split relations
– Split R into 2+ smaller relations having less redundancy
– Split up F into subsets which apply to the new relations
(compute the projection of functional dependencies)
36

Schema decomposition
• Given relation R and FDs F
– Split R into Ri s.t. i Ri  R (no new attributes)
– Split F into Fi s.t. i F entails Fi (no new FDs)
– Fi involves only attributes in Ri
• Caveat: entirely possible to lose information
– F+ may entail FD X which is not in (Ui Fi)+
=> Decomposition lost some FDs
– Possible to have R  i Ri
=> Decomposition lost some relationships
• Goal: minimize anomalies without losing info
We’ll revisit information loss later
37

Splitting relations – example

• Consider the following relation:
Student Name Student Email Course Instructor
Xiao xiao@gmail CSC333 Smith
Xiao xiao@gmail CSC444 Brown
Jaspreet jaspreet@gmail CSC333 Smith
• One possible decomposition
– Students(email, name)
Taking(studentEmail, courseName)
Courses(name, instructor)
38

Gotcha: lossy join decomposition

• Consider a relation with one more tuple
Student Name Student Email Course Instructor
Xiao xiao@gmail CSC333 Smith
Xiao xiao@gmail CSC444 Brown
Jaspreet jaspreet@gmail CSC333 Smith
Mary mary@gmail CSC444 Rosenburg

• Students Taking Courses has bogus tuples!

– Mary is not taking Brown’s section of CSC444
– Xiao is not in Rosenburg’s section of CSC444
Why did this happen? How to prevent it?
39

Information loss with decomposition

• Decompose R into S and T
– Consider FD a->b, with a only in S and b only in T
• FD loss
– Attributes a and b no longer in same relation
=> Must join T and S to enforce a->b (expensive)
• Join loss
– LHS and RHS no longer in same relation, no other connection
– Neither (S ∩ T) -> S nor (S ∩ T) -> T in F+
=> Joining T and S produces bogus tuples (irreparable)
• In our example:
– ({email,course} ∩ {course,instructor}) = {course}
– course -/-> instructor and course -/-> email
42

Projecting FDs
• Once we’ve split a relation we have to refactor
our FDs to match
– Each FDs must only mention attributes from one relation
• Similar to geometric projection
– Many possible projections (depends on how we slice it)
– Keep only the ones we need (minimal basis)
43

FD projection algorithm
• Start with Fi = Ø
• For each subset X of Ri
– Compute X+
– For each attribute a in X+
• If a is in Ri
– add X -> a to Fi

• Compute the minimal basis of Fi

• Projection is expensive
– Suppose R1 has n attributes
– How many subsets of R1 are there?
44

Making projection more efficient

• Ignore trivial dependencies
– No need to add X -> A if A is in X itself
• Ignore trivial subsets
– The empty set or the set of all attributes (both are subsets of
X)
• Ignore supersets of X if X + = R
– They can only give us “weaker” FDs (with more on the LHS)

44
45

Example: Projecting FD’s

• ABC with FD’s A->B and B->C
– A +=ABC ; yields A->B, A->C
• We ignore A->A as trivial
• We ignore the supersets of A, AB + and AC +, because they can only give us
“weaker” FDs (with more on the LHS)
– B +=BC ; yields B->C
– C +=C ; yields nothing.
– BC +=BC ; yields nothing.

45
46

Example -- Continued
• Resulting FD’s: A->B, A->C, and B->C
• Projection onto AC : A->C
– Only FD that involves a subset of {A,C}
• Projection on BC: B->C
– Only FD that involves subset of {B, C}

46
47

PART III:
NORMAL FORMS
48

Motivation for normal forms

• Identify a “good” schema
– For some definition of “good”
– Avoid anomalies, redundancy, etc.
• Many normal forms
– 1st
– 2nd
– 3rd
– Boyce-Codd
– ... and several more we won’t discuss…

BCNF  3NF  2NF  1NF (focus on 3NF/BCNF)

1st normal form (1NF)

• No multi-valued attributes allowed
– Imagine storing a list/set of things in an attribute
=> Not really even expressible in RA
• Counterexample
– Course(name, instructor, [student,email]*)
– Redundancy in non-list attributes

Name Instructor Student Name Student Email

CSCC43 Johnson Xiao xiao@gmail
Jaspreet jaspreet@utsc
Mary mary@utsc
CSCD08 Rosenburg Jaspreet jaspreet@utsc
51

2nd normal form (2NF)

• Non-prime attributes depend on candidate keys
– Consider non-prime (ie. not part of a key) attribute ‘a’
– Then FD X s.t. X -> a and X is a candidate key
• Counterexample
– Movies(title, year, star, studio, studioAddress, salary)
– FD: title, year -> studio; studio -> studioAddress; star->salary
Title Year Star Studio StudioAddr Salary
Star Wars 1977 Hamill Lucasfilm 1 Lucas Way $100,000
Star Wars 1977 Ford Lucasfilm 1 Lucas Way $100,000
Star Wars 1977 Fisher Lucasfilm 1 Lucas Way $100,000
Patriot Games 1992 Ford Paramount Cloud 9 $2,000,000
Last Crusade 1989 Ford Lucasfilm 1 Lucas Way $1,000,000
53

3rd normal form (3NF)

• Non-prime attr. depend only on candidate keys
– Consider FD X -> a
– Either a  X OR X is a superkey OR a is prime (part of a key)
=> No transitive dependencies allowed
• Counterexample:
– studio -> studioAddr
(studioAddr depends on studio which is not a candidate key)

Title Year Studio StudioAddr

Star Wars 1977 Lucasfilm 1 Lucas Way
Patriot Games 1992 Paramount Cloud 9
Last Crusade 1989 Lucasfilm 1 Lucas Way
55

3NF, dependencies, and join loss

• Theorem: always possible to convert a schema to join-
lossless, dependency-preserving 3NF
• Caveat: always possible to create schemas in 3NF for
which these properties do not hold
• Join loss example 1:
– MovieInfo(title, year, studioName)
– StudioAddress(title, year, studioAddress)
=> Cannot enforce studioName -> studioAddress
• Join loss example 2:
– Movies(title, year, star)
– StarSalary(star, salary)
=> Cannot enforce Movies StarSalary yields bogus tuples (irreparable)
57

Boyce-Codd normal form (BCNF)

• One additional restriction over 3NF
– All non-trivial FD have superkey LHS
• Counterexample
– CanadianAddress(street, city, province, postalCode)
– Candidate keys: {street, postalCode}, {street, city, province}
– FD: postalCode -> city, province

– Satisfies 3NF: city, province both non-prime

– Violates BCNF: postalCode is not a superkey
=> Possible anomalies involving postalCode

Do we care? How often do postal codes change?

Limits of decomposition
• Pick two…
– Lossless join
– Dependency preservation
– Anomaly-free
• 3NF
– Always allows join lossless and dependency preserving
– May allow some anomalies
• BCNF
– Always excludes anomalies
– May give up one of join lossless or dependency preserving

Use domain knowledge to choose 3NF vs. BCNF

Thermal - Arc Welder - 160s - Inverter - Welder - SM PDF
No ratings yet
Thermal - Arc Welder - 160s - Inverter - Welder - SM PDF
131 pages
Technical Specifications For Etp Rev 00 - Yehlanka CCPP
No ratings yet
Technical Specifications For Etp Rev 00 - Yehlanka CCPP
190 pages
Introduction To Combined Cycle Power Plants PDF
75% (4)
Introduction To Combined Cycle Power Plants PDF
4 pages
Itp & Check List For Fence
100% (2)
Itp & Check List For Fence
4 pages
Diathermy: Presented By: Jignasha Patel
No ratings yet
Diathermy: Presented By: Jignasha Patel
84 pages
Matbal Reactive Process
No ratings yet
Matbal Reactive Process
33 pages
Shimano 2020-2021 - Specifications - v032 - en
No ratings yet
Shimano 2020-2021 - Specifications - v032 - en
267 pages
VAL 170401 BMAA-VAL-Activity-5-OQ Template
No ratings yet
VAL 170401 BMAA-VAL-Activity-5-OQ Template
19 pages
Masson Guide 2010
No ratings yet
Masson Guide 2010
26 pages
Ver 30 Nas M.u.t.3manual
No ratings yet
Ver 30 Nas M.u.t.3manual
146 pages
Functional Dependencies and Normalization
No ratings yet
Functional Dependencies and Normalization
7 pages
Relational Database Design
No ratings yet
Relational Database Design
79 pages
Incendios Tanques Por William Fire
No ratings yet
Incendios Tanques Por William Fire
64 pages
Design Theory For Relational Databases
No ratings yet
Design Theory For Relational Databases
73 pages
Measurement, Marking Out & Fitting Practice
No ratings yet
Measurement, Marking Out & Fitting Practice
62 pages
CSE 544 Principles of Database Management Systems: Lecture 8 - Query Optimization
No ratings yet
CSE 544 Principles of Database Management Systems: Lecture 8 - Query Optimization
55 pages
Pega CV
No ratings yet
Pega CV
3 pages
Downloaded From Manuals Search Engine
No ratings yet
Downloaded From Manuals Search Engine
99 pages
Normalization: Repetition of Information Inability To Represent Certain Information Loss of Information
No ratings yet
Normalization: Repetition of Information Inability To Represent Certain Information Loss of Information
39 pages
1 Round Result COAP
No ratings yet
1 Round Result COAP
19 pages
A Software Architecture For Games
No ratings yet
A Software Architecture For Games
9 pages
Lecture03 Normalization
No ratings yet
Lecture03 Normalization
32 pages
Gas 8-9-10 PM 2011
No ratings yet
Gas 8-9-10 PM 2011
52 pages
Square Body Flush End Contact - 690V/700V (IEC/UL) : 40-2000A
No ratings yet
Square Body Flush End Contact - 690V/700V (IEC/UL) : 40-2000A
5 pages
PARWATI QUOTATION For Labour Rate
No ratings yet
PARWATI QUOTATION For Labour Rate
1 page
Brad Boehler, P.Eng, Vice President, Engineering
No ratings yet
Brad Boehler, P.Eng, Vice President, Engineering
3 pages
Lecture09 Optimization Structural
No ratings yet
Lecture09 Optimization Structural
27 pages
4 Normalisation
No ratings yet
4 Normalisation
97 pages
06 - DB Design - 01
No ratings yet
06 - DB Design - 01
40 pages
6 Normalization
No ratings yet
6 Normalization
72 pages
Chapter 3
No ratings yet
Chapter 3
91 pages
Relational Database Design
No ratings yet
Relational Database Design
92 pages
Functional Dependencies and Normalization For Relational Databases
No ratings yet
Functional Dependencies and Normalization For Relational Databases
41 pages
CS343 Embedded SQL Tutorial
No ratings yet
CS343 Embedded SQL Tutorial
15 pages
Functional Dependencies & Normalization For Relational Dbs
No ratings yet
Functional Dependencies & Normalization For Relational Dbs
76 pages
Handheld Microphones
No ratings yet
Handheld Microphones
1 page
CE223 w3 Fds
No ratings yet
CE223 w3 Fds
66 pages
The Challenges of Green Construction in Oman
No ratings yet
The Challenges of Green Construction in Oman
9 pages
Functional Dependency
No ratings yet
Functional Dependency
35 pages
6 - Database Design Theory-1
No ratings yet
6 - Database Design Theory-1
31 pages
AENG-252 - Practical Recor
No ratings yet
AENG-252 - Practical Recor
19 pages
Rules of Functional Dependencies
No ratings yet
Rules of Functional Dependencies
39 pages
Normalization
No ratings yet
Normalization
177 pages
Unit 3 Relational Database Design
No ratings yet
Unit 3 Relational Database Design
54 pages
Chapter 3 Update
No ratings yet
Chapter 3 Update
76 pages
Exam Sheet Metal
No ratings yet
Exam Sheet Metal
19 pages
Relational Database Design Functional Dependencies
No ratings yet
Relational Database Design Functional Dependencies
27 pages
05 Functional Dependency
No ratings yet
05 Functional Dependency
35 pages
Chapter 7: Relational Database Design
No ratings yet
Chapter 7: Relational Database Design
92 pages
Rules of Functional Dependencies PDF
No ratings yet
Rules of Functional Dependencies PDF
39 pages
Module 4 Dbms Student
No ratings yet
Module 4 Dbms Student
51 pages
Functional Dependencies and Normalization4
No ratings yet
Functional Dependencies and Normalization4
86 pages
Introduction To Database Systems: Functional Dependencies
No ratings yet
Introduction To Database Systems: Functional Dependencies
19 pages
Database I: Methodology Normalization
No ratings yet
Database I: Methodology Normalization
37 pages
Relational Database Design Functional Dependencies
No ratings yet
Relational Database Design Functional Dependencies
21 pages
Database Design Theory: Introduction To Databases CSCC43 Winter 2011 Ryan Johnson
No ratings yet
Database Design Theory: Introduction To Databases CSCC43 Winter 2011 Ryan Johnson
10 pages
A Guide To Safe Farm Tractor Operation
No ratings yet
A Guide To Safe Farm Tractor Operation
29 pages
FD
No ratings yet
FD
3 pages
OSHA 1910.137 - Electrical Protective Equipment
No ratings yet
OSHA 1910.137 - Electrical Protective Equipment
10 pages
Passive VOice
No ratings yet
Passive VOice
1 page
4-6 Water Works Lesson Plan
No ratings yet
4-6 Water Works Lesson Plan
7 pages
Normalization
No ratings yet
Normalization
113 pages
MODULE4
No ratings yet
MODULE4
69 pages
Normalization
No ratings yet
Normalization
51 pages
Functional Dependencies: R&G Chapter 19
No ratings yet
Functional Dependencies: R&G Chapter 19
16 pages
Unit 2 Functional - Dependency-2
No ratings yet
Unit 2 Functional - Dependency-2
22 pages
4 Normalization
No ratings yet
4 Normalization
41 pages
Functional Dependencies: CS 186, Spring 2006, Lecture 21 R&G Chapter 19
No ratings yet
Functional Dependencies: CS 186, Spring 2006, Lecture 21 R&G Chapter 19
17 pages
6 - Chapter 3 - Functional Dependencies
No ratings yet
6 - Chapter 3 - Functional Dependencies
29 pages
Lec 06
No ratings yet
Lec 06
30 pages
Normalization
No ratings yet
Normalization
101 pages
Design Theory For Relational Databases - Readmore
No ratings yet
Design Theory For Relational Databases - Readmore
73 pages
Datasheet LPGAM BC3G 26 5
No ratings yet
Datasheet LPGAM BC3G 26 5
4 pages
IT 220 Unit 4 Relational-Database-Design
No ratings yet
IT 220 Unit 4 Relational-Database-Design
56 pages
SWEP Report
No ratings yet
SWEP Report
2 pages
Chapter 2 RM and Normalization V2
No ratings yet
Chapter 2 RM and Normalization V2
71 pages
CS2202 Design
No ratings yet
CS2202 Design
45 pages
DB04 FDs Rules
No ratings yet
DB04 FDs Rules
31 pages
Chapter 7
No ratings yet
Chapter 7
37 pages
663b2c77317db99de578cb46 Chapter 3
No ratings yet
663b2c77317db99de578cb46 Chapter 3
73 pages
Winter Semester 2023-24 - CSE2007 - ETH - AP2023246001166 - 2024-02-29 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE2007 - ETH - AP2023246001166 - 2024-02-29 - Reference-Material-I
32 pages
550 Lecture13
No ratings yet
550 Lecture13
24 pages
DB Normalization Part1
No ratings yet
DB Normalization Part1
71 pages
ch4dbms FDand Nor
No ratings yet
ch4dbms FDand Nor
73 pages
CS2202 Design
No ratings yet
CS2202 Design
61 pages
Unit - Iv B
No ratings yet
Unit - Iv B
89 pages
AD Chap3
No ratings yet
AD Chap3
45 pages
DBMS 3
No ratings yet
DBMS 3
25 pages
Lec08 Design Theory
No ratings yet
Lec08 Design Theory
48 pages
En Database Principle-C6 Normalization Part1 TL
No ratings yet
En Database Principle-C6 Normalization Part1 TL
44 pages
Unit 3
No ratings yet
Unit 3
32 pages
DBMS NOTES (Database Design)
No ratings yet
DBMS NOTES (Database Design)
13 pages
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)

9 Design Theory

Uploaded by

9 Design Theory

Uploaded by

Design Theory for Relational DBs:

Thanks to Ryan Johnson, John Mylopoulos, Arnold Rosenbloom

Database Design Theory

Goal #1: remove redundancy

Multiple relations => exponentially worse

Goal #2: expressing constraints

Dependencies, constraints are domain-dependent

Most common to see singletons (X -> y or abc -> d)

Rules and principles about FDs

The Splitting/Combining rule of FDs

Splitting FDs – example

Gotcha: “doesn’t hold” = “not all tuples” != “all tuples not”

Identifying functional dependencies

• What if we try to infer FDs from the data?

Domain knowledge required to validate FDs

Keys and FDs

FD: relaxes the concept of a “key”

Cyclic functional dependencies?

Geometric view of FDs

Inferring functional dependencies

Intuitively, A -> C also holds

How to prove it in the general case?

Closure test for FDs

Closure test – example

{a,b}+={a,b,c,d,e,f} or ab -> cdef -- ab is a candidate key!

Example : Closure Test

Is AB -> E entailed by F? Yes

Result: XF+ allows us to determine all FDs of the form

Discarding redundant FDs

Constructing a minimal basis

=> Might make F’ too small

Minimal Basis: Example

Find the minimal Basis M of F

Minimal Basis: Example (cont.)

Minimal Basis: Example (cont.)

Minimal Basis: M = H = {A->C, B->A, D->B}

Minimal Basis: Example 2

FDs and redundancy

Splitting relations – example

Gotcha: lossy join decomposition

• Students Taking Courses has bogus tuples!

Information loss with decomposition

• Compute the minimal basis of Fi

Making projection more efficient

Example: Projecting FD’s

Motivation for normal forms

BCNF  3NF  2NF  1NF (focus on 3NF/BCNF)

1st normal form (1NF)

Name Instructor Student Name Student Email

2nd normal form (2NF)

3rd normal form (3NF)

Title Year Studio StudioAddr

3NF, dependencies, and join loss

Boyce-Codd normal form (BCNF)

– Satisfies 3NF: city, province both non-prime

Do we care? How often do postal codes change?

Use domain knowledge to choose 3NF vs. BCNF

You might also like