0% found this document useful (0 votes)

18 views31 pages

Functional Dependencies and Normalization For Relational Databases

Uploaded by

navneetccna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views31 pages

Functional Dependencies and Normalization For Relational Databases

Uploaded by

navneetccna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Functional Dependencies and

Normalization for Relational Databases

406.426 Design & Analysis of Database Systems

Jonghun Park
[email protected]
Dept. of Industrial Engineering
Seoul National University
outline
informal design guidelines for relational databases
functional dependencies (FDs)
normal forms based on primary deys
general normal form definitions (for multiple keys)
BCNF (Boyce-Codd Normal Form)

2
informal measures of quality for relation schema
semantics of the attributes
reducing the redundant values in tuples
reducing the null values in tuples
disallowing the possibility of generating spurious tuples

3
semantics of the relation attributes
guideline 1: Design a relation schema so that it is easy to explain its
meaning. Do not combine attributes from multiple entity types and
relationship types into a single relation. If a relation schema
corresponds to one entity type or one relationship type, it is
straightforward to explain its meaning.
examples of poor design

4
redundant information in tuples & update anomalies
one goal of schema design is to minimize the storage space
example:

5
update anomalies
insertion anomalies
to insert a new employee tuple into EMP_DEPT, we must include either the
attribute values for the department that the employee works for, or nulls
it is difficult to insert a new department that has no employees as yet in the
EMP_DEPT relation
deletion anomalies
if we delete from EMP_DEPT an employee tuple that happens to represent the
last employee working for a particular department, the information
concerning that department is lost
modification anomalies
in EMP_DEPT, if we change the value of one of the attributes of a
particular department, we must update the tuples of all employees who work
in that department
guideline 2: design the base relation schemas so that no insertion, deletion, or
modification anomalies are present in the relations

6
null values in tuples
grouping many attributes together into a fat relation -> if many of the
attributes do not apply to all tuples in the relation, we end up with
many nulls in those tuples
example
if only 10% of employees have individual offices, there is little
justification for including an attribute OFFICE_NUMBER in the
EMPLOYEE relation -> A relation EMP_OFFICES(ESSN,
OFFICE_NUMBER) can be created
guideline 3: as far as possible, avoid placing attributes in a base
relation whose values may frequently be null

7
generation of spurious tuples
example: consider EMP_LOCS and EMP_PROJ1 instead of
EMP_PROJ
EMP_LOCS: the employee whose name is ENAME works on some
project whose location is PLOCATION

8
generation of spurious tuples (cont.)
decomposing EMP_PROJ into EMP_LOCS and EMP_PROJ1 is undesirable
because, when we JOIN them back using NATURAL JOIN, we do not get the
correct original information
PLOCATION is the attribute that relates EMP_LOCS and EMP_PROJ1, and
PLOCATION is neither a primary key nor a foreign key in either
EMP_LOCS or EMP_PROJ1

9
generation of spurious tuples (cont.)
guideline 4: design relation schemas so that they can be joined with
equality conditions on attributes that are either primary keys or
foreign keys in a way that guarantees that no spurious tuples are
generated

10
definition
a functional dependency (FD), denoted by X -> Y, between two sets of attributes
X and Y that are subsets of R specifies a constraint on the possible tuples that can
form a relation state r of R
for any two tuples t1 and t2 in r that have t1[X] = t2[X], they must also have
t1[Y] = t2[Y]
the values of the Y component of a tuple in r depend on (or are determined by)
the values of the X component
if X is a candidate key of R, X -> Y for any subset of attributes Y of R
if X -> Y in R, this does not say whether or not Y -> X in R
example
FD1: {SSN, PNUMBER} -> HOURS
FD2: SSN -> ENAME
FD3: PNUMBER -> {PNAME, PLOCATION}

11
inference rules for FDs
F: the set of functional dependencies that are specified on relation
schema R
F+ (closure of F): the set of all dependencies that include F as well
as all dependencies that can be inferred from F
example
F = {SSN -> {ENAME, BDATE, ADDRESS, DNUMBER},
DNUMBER -> {DNAME, DMGRSSN}}
SSN -> {DNAME, DMGRSSN}
SSN -> SSN
DNUMBER -> DNAME
notations
F X -> Y: X -> Y is inferred from F
{X,Y} -> Z is abbreviated to XY -> Z

12
well-known inference rules
IR1 (reflexive rule)
If X Y, then X -> Y
IR2 (augmentation rule)
{X -> Y} XZ -> YZ
IR3 (transitive rule)
{X -> Y, Y -> Z} X -> Z
IR4 (decomposition rule)
{ X -> YZ} X -> Y
IR5 (union rule)
{X -> Y, X -> Z} X -> YZ
IR6 (pseudotransitive rule)
{X -> Y, WY -> Z} WX -> Z

13
closure computation
closure X+: the set of attributes that are functionally determined by X based on F
algorithm
X+ = X
repeat
oldX+ = X+
for each FD Y -> Z in F do
if X+ Y, then X+ = X+ Z
until (X+ = oldX+)
example
F = {SSN -> ENAME, PNUMBER -> {PNAME, PLOCATION}, {SSN,
PNUMBER} -> HOURS}
{SSN}+ = {SSN, ENAME}
{PNUMBER}+ = {PNUMBER, PNAME, PLOCATION}
{SSN, PNUMBER}+ ={SSN, ENAME, PNUMBER, PNAME, PLOCATION,
HOURS}

14
equivalence of sets of FDs
F: a set of FDs
F+: closure of F
the set of all FDs logically implied by F
F is said to cover another set of FDs E if every FD in E is also in F+
F covers E if
for every FD (X -> Y) in E, X+ (w.r.t. F) Y
That is, X+ Y => X+ -> Y => X -> X+; X+ -> Y => X -> Y
two sets of FDs E and F are equivalent if E+ = F+

15
minimal sets of FDs
minimal cover of a set of FDs E: a set of FDs F that satisfies the
property that
every FD in E is in F+
the above property is lost if any FD from F is removed
formally, F is minimal if
every FD in F has a single attribute for its rhs
we cannot replace any FD X -> A in F with a FD Y -> A, where Y X,
and still have a set of FDs that is equivalent to F
we cannot remove any FD from F and still have a set of FDs that is
equivalent to F

16
algorithm for finding a minimal cover F for E
set F = E
replace each FD X -> {A1, ..., An} in F by the n functional
dependencies X -> A1, ..., X -> An
for each FD X -> A in F
for each attribute B X
if {{F – {X -> A}} {(X – {B}) -> A}} is equivalent to F
then replace X -> A with (X – {B}) -> A in F
for each remaining FD X -> A in F
if {F – {X -> A}} is equivalent to F
then remove X -> A from F

17
normalization of relations
first proposed by Codd
takes a relation schema through a series of tests to certify whether it
satisfies a certain normal form
a process of analyzing the given relation schemas based on their FDs
and primary keys to achieve the desirable properties of (1)
minimizing redundancy, and (2) minimizing the insertion,
deletion, and update anomalies
the process of normalization through decomposition must confirm
the existence of additional properties that the relational schemas
should possess: e.g., nonadditive join property, dependency
preservation property
1NF, 2NF, 3NF, and BCNF: based on the functional dependencies
among the attributes of a relation
4NF, 5NF: Based on the concepts of multivalued dependencies and
join dependencies respectively

18
keys and attributes participating in keys
superkey of a relation schema R = {A1, ..., An}
a set of attributes S R with the property that no two tuples t1 and t2 in
any legal relation state r of R will have t1[S] = t2[S]
a key K is a superkey with the additional property that removal of
any attribute from K will cause K not to be a superkey any more
if a relation schema has more than one key, each is called a
candidate key
one of the candidate keys is arbitrarily designated to be the primary
key
an attribute of relation schema R is called a prime attribute of R if it
is a member of some candidate key of R

19
first normal form (1NF)
to disallow multivalued attributes, composite attributes, and their
combinations
the domain of an attribute must include only atomic values and the
value of any attribute in a tuple must be a single value from the
domain of that attribute
example

20
3 main techniques to achieve 1NF
remove the attribute DLOCATIONS
that violates 1NF and place it in a
separate relation
DEPT_LOCATIONS along with the
primary key DNUMBER of
DEPARTMENT -> generally
considered best
expand the key so that there will be a
separate tuple in the original
DEPARTMENT relation for each
location of a DEPARTMENT ->
introduces redundancy
if a maximum number of values is
known: DLOCATION1,
DLOCATION2, ... -> introduces null
values

21
another example: nested relation
EMP_PROJ(SSN, ENAME, {PROJS(PNUMBER, HOURS)})
SSN is the primary key of the EMP_PROJ while PNUMBER is the partial key of
the nested relation
for normalization into 1NF, we remove the nested relation attributes into a new
relation and propagate the primary key into it

22
second normal form (2NF)
an FD X -> Y is a full functional dependency (FFD) if removal of any attribute A
from X means that the dependency does not hold any more
an FD X -> Y is a partial dependency if some attribute A X can be removed
from X and the dependency still holds
a relation schema R is in 2NF if every nonprime attribute NA in R is fully
functionally dependent on the primary key of R
example: {SSN, PNUMBER} is a primary key for EMP_PROJ
{SSN, PNUMBER} -> ENAME: FFD?
{SSN, PNUMBER} -> PNAME: FFD?
{SSN, PNUMBER} -> PLOCATION: FFD?

23
converting into 2NF
if a relation schema is not in 2NF, it can be 2NF normalized into a
number of 2NF relations in which nonprime attributes are
associated only with the part of the primary key on which they
are fully functionally dependent

24
third normal form (3NF)
an FD X -> Y in a relation schema R is a transitive dependency if
there is a set of attributes Z that is neither a candidate key nor a
subset of any key of R, and both X -> Z and Z -> Y hold
a relation schema R is in 3NF if it satisfies 2NF and no nonprime
attribute of R is transitively dependent on the primary key
example
SSN -> DMGRSSN is transitively dependent because DNUMBER is a
nonprime attribute, SSN -> DNUMBER and DNUMBER ->
DMGRSSN hold, and DNUMBER is neither a key nor a subset of the
key of EMP_DEPT

25
example

26
general definitions of 2nd and 3rd normal forms
the previous definition of 3NF disallows partial and transitive
dependencies on the primary key to avoid update anomalies
now the partial and full functional dependencies and transitive
dependencies are considered w.r.t. all candidate keys of a relation

27
general definition of 2NF
prime attribute: an attribute that is part of some candidate key
a relation schema R is in 2NF if every nonprime attribute A in R is
not partially dependent on any key of R

candidate keys:
PROPERTY_ID#,
{COUNTY_NAME, LOT#}

{COUNTY_NAME, LOT#} -> TAX_RATE: FFD?

28
general definition of 3NF
def) a relation schema R is in 3NF satisfies the following property
whenever a nontrivial functional dependency X -> A holds in R,
either (a) X is a superkey of R, or (b) A is a prime attribute of R
an FD X -> A
violating (b) => A is a nonprime attribute
violating (a) => X is not a superset of any key of R
=> X is either nonprime or a proper subset of a key of R
X is nonprime => transitive dependency (i.e., a key Y, s.t. Y -> X -> A)
X is a proper subset of a key => partial dependency (i.e., a partial
dependency “Z(X) -> A” due to the existence of “X -> A”)
therefore, a relation schema R is in 3NF if for every nonprime
attribute A of R
it is non-transitively dependent on every key of R, and
it is fully functionally dependent on every key of R
29
example

FD4: AREA -> PRICE

AREA is not a superkey and PRICE is not a prime attribute
that is, from FD1 and FD2, we know that PRICE is transitively dependent on
each of the candidate keys (PROPERTY_ID#, {COUNTY_NAME, LOT#})
via the nonprime attribute AREA

30
Boyce-Codd normal form (BCNF)
a relation schema R is in BCNF if whenever a nontrivial functional dependency
X -> A holds in R, then X is a superkey of R
stricter than 3NF: every relation in BCNF is also in 3NF, but a relation in 3NF is
not necessarily in BCNF
example
FD5
{COUNTY_NAME, LOT#} is a candidate key
AREA is not a superkey => violates BCNF
COUNTY_NAME is a prime attribute => satisfies 3NF

Management Information Systems: James A O'Brien George M Marakas Ramesh Behl
No ratings yet
Management Information Systems: James A O'Brien George M Marakas Ramesh Behl
59 pages
Management Information Systems Lecture Notes
No ratings yet
Management Information Systems Lecture Notes
67 pages
Best Lab Manual of C# Programming
0% (1)
Best Lab Manual of C# Programming
23 pages
Assignment Database Maheshika PearsonNo Reg 11179
No ratings yet
Assignment Database Maheshika PearsonNo Reg 11179
77 pages
Normalization in DBMS
No ratings yet
Normalization in DBMS
16 pages
Chapter: 5 Normalization of Database Tables: in This Chapter, You Will Learn
No ratings yet
Chapter: 5 Normalization of Database Tables: in This Chapter, You Will Learn
43 pages
Normalization
No ratings yet
Normalization
45 pages
Module-4 Normalization Database Desgin Theory: 4.1 Informal Design Guidelines For Relation Schemas
No ratings yet
Module-4 Normalization Database Desgin Theory: 4.1 Informal Design Guidelines For Relation Schemas
22 pages
NEB Class 12 Computer Notes
0% (1)
NEB Class 12 Computer Notes
216 pages
Pgdca - Syllabus-1 - 20 8 2010
100% (2)
Pgdca - Syllabus-1 - 20 8 2010
19 pages
Normalization Exercises
No ratings yet
Normalization Exercises
2 pages
IJRSML 2019 Vol07 Issue 05 Eng 07
No ratings yet
IJRSML 2019 Vol07 Issue 05 Eng 07
6 pages
NORMALIZATION
No ratings yet
NORMALIZATION
51 pages
Chapter 15: Basics of Functional Dependencies and Normalization For Relational Databases
No ratings yet
Chapter 15: Basics of Functional Dependencies and Normalization For Relational Databases
65 pages
Unit 4 Databasedesign: Functional Dependencies and Normalization Informal Design Guidelines For Relation Schemas
No ratings yet
Unit 4 Databasedesign: Functional Dependencies and Normalization Informal Design Guidelines For Relation Schemas
24 pages
Normalization: Normalization Is A Method For Organizing Data Elements in A Database Into Tables
No ratings yet
Normalization: Normalization Is A Method For Organizing Data Elements in A Database Into Tables
4 pages
Normalization
No ratings yet
Normalization
35 pages
Functional Dependencies and Normalization For Relational Databases
No ratings yet
Functional Dependencies and Normalization For Relational Databases
36 pages
C Syl CS IV Semester
No ratings yet
C Syl CS IV Semester
18 pages
Module 4 Dbms Student
No ratings yet
Module 4 Dbms Student
51 pages
CH 14 FDs and Normalization PDF
No ratings yet
CH 14 FDs and Normalization PDF
55 pages
Unit-III Part - I
No ratings yet
Unit-III Part - I
35 pages
Chapter-4 Logical Database Design: Objectives
No ratings yet
Chapter-4 Logical Database Design: Objectives
20 pages
CS 380 Introduction To Database Systems: King Saud University
No ratings yet
CS 380 Introduction To Database Systems: King Saud University
45 pages
Relational Database Design: Normalization
No ratings yet
Relational Database Design: Normalization
63 pages
Functional Dependencies & Normalization For Relational Dbs
No ratings yet
Functional Dependencies & Normalization For Relational Dbs
76 pages
Web Twchnologies
No ratings yet
Web Twchnologies
373 pages
SQL Questions and Answers
No ratings yet
SQL Questions and Answers
49 pages
MODULE-3 DBMS CS208 NOTES (Ktuassist - In)
No ratings yet
MODULE-3 DBMS CS208 NOTES (Ktuassist - In)
4 pages
Normalization: Dr. M. Brindha Assistant Professor Department of CSE NIT, Trichy-15
No ratings yet
Normalization: Dr. M. Brindha Assistant Professor Department of CSE NIT, Trichy-15
47 pages
Functional Dependencies and Normalization
No ratings yet
Functional Dependencies and Normalization
49 pages
Chapter 13
No ratings yet
Chapter 13
31 pages
Relational Database Design
No ratings yet
Relational Database Design
52 pages
Privileges in SQL:: Allows Read Access To Relation, or The Ability To Query
No ratings yet
Privileges in SQL:: Allows Read Access To Relation, or The Ability To Query
29 pages
DBMS Question Bank and Solutions
No ratings yet
DBMS Question Bank and Solutions
45 pages
Lecture 10: BCSE302L - DBMS: Functional Dependencies
No ratings yet
Lecture 10: BCSE302L - DBMS: Functional Dependencies
35 pages
DBMS Assignment 2019 PDF
No ratings yet
DBMS Assignment 2019 PDF
39 pages
4 5NF
No ratings yet
4 5NF
45 pages
Assignment 1: ERD and SQL: CRICOS Provider No. 00103D ITECH2004 Assignment1 ER-SQL 2020
No ratings yet
Assignment 1: ERD and SQL: CRICOS Provider No. 00103D ITECH2004 Assignment1 ER-SQL 2020
5 pages
20cs413-Database Management Systems
No ratings yet
20cs413-Database Management Systems
1 page
DBMS - UNIT Wilse Important Questions
No ratings yet
DBMS - UNIT Wilse Important Questions
2 pages
Unit-3 Dbms Odd Sem 2020-2021
No ratings yet
Unit-3 Dbms Odd Sem 2020-2021
53 pages
FDMS - Chapter Four
No ratings yet
FDMS - Chapter Four
62 pages
Relational Normalization: Contents Relational Database Design: Rationale
No ratings yet
Relational Normalization: Contents Relational Database Design: Rationale
23 pages
Programming Assignment Unit 4
No ratings yet
Programming Assignment Unit 4
3 pages
25 Eng. V.K. Gupta 26
No ratings yet
25 Eng. V.K. Gupta 26
16 pages
Normalization
No ratings yet
Normalization
30 pages
DBMS - Unit 4
No ratings yet
DBMS - Unit 4
27 pages
40 Computer Science Minor
No ratings yet
40 Computer Science Minor
22 pages
Course Outline For FDS
No ratings yet
Course Outline For FDS
3 pages
L5 Normalization
No ratings yet
L5 Normalization
42 pages
Module 4 - Database Design
No ratings yet
Module 4 - Database Design
22 pages
Module 04
No ratings yet
Module 04
10 pages
DBMS UNIT 4 - Class
No ratings yet
DBMS UNIT 4 - Class
14 pages
Chapter 14
No ratings yet
Chapter 14
54 pages
Data Modeling Using The Entity-Relationship Model
No ratings yet
Data Modeling Using The Entity-Relationship Model
31 pages
01.functional Dependencies Till 5 NF
No ratings yet
01.functional Dependencies Till 5 NF
45 pages
DBMS Unit 3.0 Functional Dependencies
No ratings yet
DBMS Unit 3.0 Functional Dependencies
44 pages
UNIT-III Lecture Notes
No ratings yet
UNIT-III Lecture Notes
29 pages
Relational Model Relational Model
No ratings yet
Relational Model Relational Model
43 pages
PL SQL Final
No ratings yet
PL SQL Final
29 pages
Databases and Database Users
No ratings yet
Databases and Database Users
29 pages
2 Rel Model
No ratings yet
2 Rel Model
13 pages
Part4 - Ch9 - Functional Dependencies and Normalization
No ratings yet
Part4 - Ch9 - Functional Dependencies and Normalization
26 pages
Module 3 Part 1
No ratings yet
Module 3 Part 1
14 pages
09 - Functional Dependencies Normalization
No ratings yet
09 - Functional Dependencies Normalization
60 pages
CT1212 Slides 443 7
No ratings yet
CT1212 Slides 443 7
68 pages
2 Ermodel Handout Notes
No ratings yet
2 Ermodel Handout Notes
35 pages
Chapter 4
No ratings yet
Chapter 4
48 pages
Course Overview
No ratings yet
Course Overview
19 pages
4a Algebra Handout Notes
No ratings yet
4a Algebra Handout Notes
15 pages
Chapter 5 - Database
No ratings yet
Chapter 5 - Database
45 pages
Unit 4 - PDF
No ratings yet
Unit 4 - PDF
6 pages
06-Schema Design and Normalization
No ratings yet
06-Schema Design and Normalization
13 pages
Chapter 19 Normalization NEW
No ratings yet
Chapter 19 Normalization NEW
49 pages
Chapter 14
No ratings yet
Chapter 14
53 pages
Normalization
No ratings yet
Normalization
27 pages
DBMS 2
No ratings yet
DBMS 2
8 pages
Mujtaba - Codes
No ratings yet
Mujtaba - Codes
10 pages
Olap Vs Oltp
No ratings yet
Olap Vs Oltp
9 pages
ch4dbms FDand Nor
No ratings yet
ch4dbms FDand Nor
73 pages
0796 Ict Al p1 Soremex 2025
No ratings yet
0796 Ict Al p1 Soremex 2025
6 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
34 pages
DBMS Lab Manual Rahul Mehta
No ratings yet
DBMS Lab Manual Rahul Mehta
8 pages
DBMSPPTModule 4
No ratings yet
DBMSPPTModule 4
93 pages
NFT Project (1) 1
No ratings yet
NFT Project (1) 1
81 pages
Lecture 17-19 Functional Dependencies and Normalization
No ratings yet
Lecture 17-19 Functional Dependencies and Normalization
59 pages
Normalization Unit 4
No ratings yet
Normalization Unit 4
34 pages
DBMS Module4
No ratings yet
DBMS Module4
16 pages
Lecture 5
No ratings yet
Lecture 5
35 pages
Databases Chapter 1 - Database Design
No ratings yet
Databases Chapter 1 - Database Design
10 pages
Normalization GFGC
No ratings yet
Normalization GFGC
44 pages
Lec02 - Normalization
No ratings yet
Lec02 - Normalization
35 pages
DBMS Module 3 Study Notes
No ratings yet
DBMS Module 3 Study Notes
10 pages
Bcs403 Dbms m3 Notes
No ratings yet
Bcs403 Dbms m3 Notes
12 pages
Functional Dependencies and Normilization
No ratings yet
Functional Dependencies and Normilization
60 pages
Report Dbms Lost and Found
No ratings yet
Report Dbms Lost and Found
58 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
A Treatise on the Calculus of Finite Differences
From Everand
A Treatise on the Calculus of Finite Differences
George Boole
4/5 (1)
Lecture Notes in Elementary Real Analysis
From Everand
Lecture Notes in Elementary Real Analysis
Rohan Dalpatadu
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
From Everand
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
Arne Broman
2.5/5 (2)

Functional Dependencies and Normalization For Relational Databases

Uploaded by

Functional Dependencies and Normalization For Relational Databases

Uploaded by

Functional Dependencies and

Normalization for Relational Databases

{COUNTY_NAME, LOT#} -> TAX_RATE: FFD?

FD4: AREA -> PRICE

You might also like