0% found this document useful (0 votes)

9 views25 pages

DBMS 3

The document discusses schema refinement and normalization in database management systems, focusing on the issues caused by redundancy such as update, insertion, and deletion anomalies. It explains the use of functional dependencies to identify and resolve these problems through decomposition into smaller relations, and outlines various normal forms (1NF, 2NF, 3NF, BCNF, etc.) that help ensure data integrity and minimize redundancy. Additionally, it covers reasoning about functional dependencies and the process of normalization to create well-structured relations.

Uploaded by

Lahari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views25 pages

DBMS 3

Uploaded by

Lahari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

DATABASE MANAGEMENT SYSTEMS

UNIT – III

SCHEMA REFINEMENT

NORMAL FORMS
UNIT – III
8

SCHEMA REFINEMENT
Introduction to schema refinement
Functional dependencies
Reasoning about FDs
NORMAL FORMS
1NF, 2NF, 3NF, BCNF
Properties of decompositions, normalization,
schema refinement in database design
Other kinds of dependencies: 4NF, 5NF, DKNF
Case studies
The Evils of Redundancy
9

Redundancy is at the root of several problems associated with

relational schemas:
redundant storage, insert/delete/update anomalies
Integrity constraints, in particular functional dependencies, can be
used to identify schemas with such problems and to suggest
refinements.
Main refinement technique: decomposition (replacing ABCD with,
say, AB and BCD, or ACD and ABD).
Decomposition should be used judiciously:
Is there reason to decompose a relation?
What problems (if any) does the decomposition cause?
INTRODUCTION TO SCHEMA REFINEMENT
10

Problems Caused by Redundancy

Storing the same information redundantly, that is, in more than one
place within a database, can lead to several problems:
Redundant storage: Some information is stored repeatedly.

Update anomalies: If one copy of such repeated data is

updated, an inconsistency is created unless all copies are

similarly updated.
Insertion anomalies: It may not be possible to store some

information unless some other information is stored as well.

Deletion anomalies: It may not be possible to delete some

information without losing some other information as well.

INTRODUCTION TO SCHEMA REFINEMENT
11

Problems Caused by Redundancy (cont.)

Consider a relation obtained by translating a variant of the
Hourly_Emps entity set
Ex: Hourly_Emps(ssn, name, lot, rating, hourly wages, hours worked)
The key for Hourly_Emps is ssn.

In addition, suppose that the hourly wages attribute is

determined by the rating attribute. That is, for a given rating

value, there is only one permissible hourly wages value.
This IC is an example of a functional dependency.

It leads to possible redundancy in the relation Hourly_Emps

Use of Decomposition
12

Intuitively, redundancy arises when a relational schema forces

an association between attributes that is not natural.
Functional dependencies (ICs) can be used to identify such
situations and to suggest revetments to the schema.
The essential idea is that many problems arising from
redundancy can be addressed by replacing a relation with a
collection of smaller relations.
Each of the smaller relations contains a subset of the attributes
of the original relation.
We refer to this process as decomposition of the larger relation
into the smaller relations
Use of Decomposition (cont.)
13

We can deal with the redundancy in Hourly_Emps by

decomposing it into two relations:
Hourly_Emps2(ssn, name, lot, rating, hours worked)
Wages(rating, hourly wages)

rating hourly wages

8 10

5 7
Use of Decomposition (cont.)
14

ssn name lot rating hours worked

123-22-3666 Attishoo 48 8 40

231-31-5368 Smiley 22 8 30

131-24-3650 Smethurst 35 5 30

434-26-3751 Guldu 35 5 32

612-67-4134 Madayan 35 8 40
Problems related to Decomposition
15

Unless we are careful, decomposing a relation schema can

create more problems than it solves.
Two important questions must be asked repeatedly:
1. Do we need to decompose a relation?
2. What problems (if any) does a given decomposition cause?
To help with the rst question, several normal forms have been
proposed for relations.
If a relation schema is in one of these normal forms, we know
that certain kinds of problems cannot arise.
FUNCTIONAL DEPENDENCIES (FDs)
16

A Functional Dependency (FD) X Y (read as X determines

Y) (X ⊆ R, Y ⊆ R) holds over relation R if, for every allowable
instance r of R:
t1 ∈r, t2 ∈r, πX(t1) = πX(t2) implies πY(t1) = πY(t2)

i.e., given two tuples in r, if the X values agree, then the Y values
must also agree. (X and Y are sets of attributes.)
An FD is a statement about all allowable relations.
Must be identified based on semantics of application.

Given some allowable instance r1 of R, we can check if it

violates some FD f, but we cannot tell if f holds over R!
K is a candidate key for R means that K R
However, K R does not require K to be minimal!
FUNCTIONAL DEPENDENCIES (FDs) - Examples
17

Consider the schema:

Student ( studName, rollNo, sex, dept, hostelName, roomNo)

Since rollNois a key, rollNo → {studName, sex, dept, hostelName,

roomNo}
Suppose that each student is given a hostel room exclusively, then
hostelName, roomNo → rollNo
Suppose boys and girls are accommodated in separate hostels,
then hostelName → sex
FDs are additional constraints that can be specified by designers
Trivial / Non - Trivial FDs
18

An FD X →Y where Y ⊆ X
-called a trivial FD, it always holds good

An FD X →Y where Y ⊈ X
-non-trivial FD

An FD X →Y where X ∩Y = Ø
-completely non-trivial FD
FUNCTIONAL DEPENDENCIES (FDs) cont.
19

Example: Constraints on Entity Set

Consider relation obtained from Hourly_Emps:
Hourly_Emps (ssn, name, lot, rating, hrly_wages, hrs_worked)

Notation: We will denote this relation schema by listing the

attributes: SNLRWH
This is really the set of attributes {S, N, L, R, W, H}.

Sometimes, we will refer to all attributes of a relation by using

the relation name. (e.g., Hourly_Emps for SNLRWH)
Some FDs on Hourly_Emps:
ssn is the key: S SNLRWH
rating determines hrly_wages: R W
Wages R W
Example (Contd.) 8 10
Hourly_Emps2 5 7
20

Problems due to R → W :
S N L R H

123-22-3666 Attishoo 48 8 40
Update anomaly: Can
we change W in just 231-31-5368 Smiley 22 8 30
the 1st tuple of 131-24-3650 Smethurst 35 5 30
SNLRWH? 434-26-3751 Guldu 35 5 32
Insertion anomaly: What
612-67-4134 Madayan 35 8 40
if we want to insert an
employee and don’t know S N L R W H
the hourly wage for his 123-22-3666 Attishoo 48 8 10 40
rating?
231-31-5368 Smiley 22 8 10 30
Deletion anomaly: If we
delete all employees with 131-24-3650 Smethurst 35 5 7 30
rating 5, we lose the 434-26-3751 Guldu 35 5 7 32
information about the 612-67-4134 Madayan 35 8 10 40
wage for rating 5!
Constraints on a Relationship Set
21

Suppose that we have entity sets Parts, Suppliers, and

Departments, as well as a relationship set Contracts that involves
all of them. We refer to the schema for Contracts as CQPSD. A
contract with contract id
C species that a supplier S will supply some quantity Q of a part
P to a department D.
We might have a policy that a department purchases at most
one part from any given supplier.
Thus, if there are several contracts between the same supplier
and department,
we know that the same part must be involved in all of them. This
constraint is an FD, DS ! P.
Reasoning about Functional Dependencies (FDs)
22

Given some FDs, we can usually infer additional FDs:

ssn did, did lot implies ssn lot
An FD f is implied by a set of FDs F if f holds whenever all FDs
in F hold.
+
F = closure of F is the set of all FDs that are implied by F.

Armstrong’s Axioms (X, Y, Z are sets of attributes):

Reflexivity: If X ⊆ Y, then Y X
Augmentation: If X Y, then XZ YZ for any Z
Transitivity: If X Y and Y Z, then X Z
These are sound and complete inference rules for FDs!
Reasoning About FDs (Contd.)
23

Couple of additional rules (that follow from AA):

Union: If X → Y and X → Z, then X → YZ
Decomposition: If X → YZ, then X → Y and X → Z

Example: Contracts(cid, sid, jid, did, pid, qty, value), and:

C is the key: C → CSJDPQV
Project purchases each part using single contract:
JP → C
Dept purchases at most one part from a supplier: S
D → P

JP → C, C → CSJDPQV imply JP → CSJDPQV

SD → P implies SDJ → JP
SDJ → JP, JP → CSJDPQV imply SDJ → CSJDPQV
Reasoning About FDs (Contd.)
24

Computing the closure of a set of FDs can be expensive. (Size

of closure is exponential in # attrs!)
Typically, we just want to check if a given FD X → Y is in the
closure of a set of FDs F. An efficient check:
Compute attribute closure of X (denoted X + ) wrt F:
Set of all attributes A such that X → A is in F +
There is a linear time algorithm to compute this.

Check if Y is in X +
Does F = {A → B, B → C, C D →E } imply A → E?
i.e, is A → E in the closure F + ? Equivalently, is E in A+ ?
Closure of a Set of FDs
25

The set of all FDs implied by a given set F of FDs is called the
closure of F and is denoted as F+.

An important question is how we can infer, or compute, the

closure of a given set F of FDs.

The following three rules, called Armstrong's Axioms, can be

applied repeatedly to infer all FDs implied by a set F of FDs.

We use X, Y, and Z to denote sets of attributes over a relation

schema R:
Closure of a Set of FDs (or Armstrong’s Inference Rules)
26

Reflexive Rule:
F ⊨{X →Y | Y ⊆ X} for any X. Trivial FDs
Augmentation Rule:
{X →Y} ⊨ {XZ →YZ}, Z ⊆ R. Here XZ denotes X ⋃ ⋃Z
Transitive Rule:
{X →Y, Y →Z} ⊨ {X →Z}
Armstrong's Axioms are sound in that they generate only FDs in F+
when applied to a set F of FDs.
They are complete in that repeated application of these rules will
generate all FDs in the closure F+.
Closure of a Set of FDs (or Armstrong’s Inference Rules)
27

It is convenient to use some additional rules while reasoning about

F+:
Union or Additive Rule:
{X →Y, X →Z} ⊨ {X →YZ}
Decomposition or Projective Rule:
{X →YZ} ⊨ {X →Y, X →Z}
Pseudo Transitive Rule:
{X →Y, WY →Z} ⊨ {WX →Z}
Attribute Closure
28

If we just want to check whether a given dependency, say, X → Y, is

in the closure of a set F of FDs,
we can do so eciently without computing F+. We rst compute the
attribute closure X+ with respect to F,
which is the set of attributes A such that X → A can be inferred
using the Armstrong Axioms.
The algorithm for computing the attribute closure of a set X of
attributes is
closure = X;
repeat until there is no change: {
if there is an FD U → V in F such that U subset of closure,
then set closure = closure union of V
}
Database Normalization
29

The main goal of Database Normalization is to restructure the

logical data model of a database to:
Eliminate redundancy

Organize data efficiently

Reduce the potential for data anomalies.

Database Normalization definitions
30

How to take a raw collection of data and break it up into

more logical units or tables, in order to reduce the occurrence
of redundant data in the database. This process of reducing
data redundancy is referred to as Normalization.
Normalization is a body of rules addressing analysis and
conversion of data structures into relations that exhibit more
desirable properties of internal consistency, minimal
redundancy and maximum stability.
Database Normalization definitions
31

Normalization is the process by which attributes are grouped

together to form a well-structured relation.
We focused on the characteristics of a good relation:

Analyzing sample relations

Identifying design flaws
And learning how to eliminate them
This is called Normalizing a relation
Normalization is a process of decomposing relations to
produce smaller, well-structured relations.
Normalization is a tool to validate and improve a logical
design, so that it satisfies certain constraints that avoid
unnecessary duplication of data.

Tableau Best Practices
No ratings yet
Tableau Best Practices
81 pages
Unit V: Rating Hourly Wages
No ratings yet
Unit V: Rating Hourly Wages
13 pages
L11 PPT IVSem
No ratings yet
L11 PPT IVSem
18 pages
FD
No ratings yet
FD
3 pages
550 Lecture13
No ratings yet
550 Lecture13
24 pages
Functional Dependencies: R&G Chapter 19
No ratings yet
Functional Dependencies: R&G Chapter 19
16 pages
Functional Dependencies: CS 186, Spring 2006, Lecture 21 R&G Chapter 19
No ratings yet
Functional Dependencies: CS 186, Spring 2006, Lecture 21 R&G Chapter 19
17 pages
DB04 FDs Rules
No ratings yet
DB04 FDs Rules
31 pages
Unit05 DBMS
No ratings yet
Unit05 DBMS
49 pages
Functional Dependencies and Normalization For Relational Databases
No ratings yet
Functional Dependencies and Normalization For Relational Databases
41 pages
F U-4 PDF
No ratings yet
F U-4 PDF
48 pages
Week3 Lecture
No ratings yet
Week3 Lecture
41 pages
FD Lecture21norm
No ratings yet
FD Lecture21norm
17 pages
CAS CS 460/660 Introduction To Database Systems Functional Dependencies and Normal Forms
No ratings yet
CAS CS 460/660 Introduction To Database Systems Functional Dependencies and Normal Forms
38 pages
Functional Dependencies & Normalization For Relational Dbs
No ratings yet
Functional Dependencies & Normalization For Relational Dbs
76 pages
MODULE4
No ratings yet
MODULE4
69 pages
4 Normalisation
No ratings yet
4 Normalisation
97 pages
Relational Database Design
No ratings yet
Relational Database Design
79 pages
Unit 3 DBMS R23
No ratings yet
Unit 3 DBMS R23
24 pages
Unit 4 - PDF
No ratings yet
Unit 4 - PDF
6 pages
AD Chap3
No ratings yet
AD Chap3
45 pages
Relational Database Design Functional Dependencies
No ratings yet
Relational Database Design Functional Dependencies
27 pages
Ch15 FDs and Normalization
No ratings yet
Ch15 FDs and Normalization
48 pages
DB Normalization Part1
No ratings yet
DB Normalization Part1
71 pages
Lecture 17-19 Functional Dependencies and Normalization
No ratings yet
Lecture 17-19 Functional Dependencies and Normalization
59 pages
Chapter 5 - Database
No ratings yet
Chapter 5 - Database
45 pages
9 Design Theory
No ratings yet
9 Design Theory
47 pages
Module 4 Dbms Student
No ratings yet
Module 4 Dbms Student
51 pages
06 - DB Design - 01
No ratings yet
06 - DB Design - 01
40 pages
Dbms Unit-5 Notes
100% (2)
Dbms Unit-5 Notes
27 pages
CH 14 FDs and Normalization PDF
No ratings yet
CH 14 FDs and Normalization PDF
55 pages
Unit-3 Dbms Odd Sem 2020-2021
No ratings yet
Unit-3 Dbms Odd Sem 2020-2021
53 pages
Database Management Systems: Introduction To Schema Refinement
No ratings yet
Database Management Systems: Introduction To Schema Refinement
28 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
78 pages
Functional Dependencies and Normalization4
No ratings yet
Functional Dependencies and Normalization4
86 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
34 pages
Chapter 14
No ratings yet
Chapter 14
54 pages
Normalization: Repetition of Information Inability To Represent Certain Information Loss of Information
No ratings yet
Normalization: Repetition of Information Inability To Represent Certain Information Loss of Information
39 pages
MIS - Lec 11 - FDs-Anomalies
No ratings yet
MIS - Lec 11 - FDs-Anomalies
26 pages
Normalization
No ratings yet
Normalization
51 pages
IT 220 Unit 4 Relational-Database-Design
No ratings yet
IT 220 Unit 4 Relational-Database-Design
56 pages
Normal Is at Ion 1
100% (1)
Normal Is at Ion 1
30 pages
Chapter 15: Basics of Functional Dependencies and Normalization For Relational Databases
No ratings yet
Chapter 15: Basics of Functional Dependencies and Normalization For Relational Databases
65 pages
ch4dbms FDand Nor
No ratings yet
ch4dbms FDand Nor
73 pages
Schema Refinement and Normalization: Reasoning About Fds (Review) Rules of Inference (Review)
No ratings yet
Schema Refinement and Normalization: Reasoning About Fds (Review) Rules of Inference (Review)
5 pages
Dbms Unit III Normalforms
No ratings yet
Dbms Unit III Normalforms
20 pages
Unit 3
No ratings yet
Unit 3
28 pages
6 Normalization
No ratings yet
6 Normalization
72 pages
Winter Semester 2023-24 - CSE2007 - ETH - AP2023246001166 - 2024-02-29 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE2007 - ETH - AP2023246001166 - 2024-02-29 - Reference-Material-I
32 pages
663b2c77317db99de578cb46 Chapter 3
No ratings yet
663b2c77317db99de578cb46 Chapter 3
73 pages
Lecture 16
No ratings yet
Lecture 16
24 pages
Dbms Unit III
No ratings yet
Dbms Unit III
14 pages
Database Design Theory: Introduction To Databases CSCC43 Winter 2011 Ryan Johnson
No ratings yet
Database Design Theory: Introduction To Databases CSCC43 Winter 2011 Ryan Johnson
10 pages
Functional Dependencies: Hemani Parikh Lecturer, CE dept.,BIT
No ratings yet
Functional Dependencies: Hemani Parikh Lecturer, CE dept.,BIT
17 pages
Unit 3
No ratings yet
Unit 3
32 pages
Lec 06
No ratings yet
Lec 06
30 pages
Schema Refinement: Book: Chapter 19
No ratings yet
Schema Refinement: Book: Chapter 19
34 pages
En Database Principle-C6 Normalization Part1 TL
No ratings yet
En Database Principle-C6 Normalization Part1 TL
44 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet
Week - 2 Assignment - Introduction To AI Programming
No ratings yet
Week - 2 Assignment - Introduction To AI Programming
3 pages
Srinu Logbook
No ratings yet
Srinu Logbook
54 pages
SDD Assignment 240
No ratings yet
SDD Assignment 240
7 pages
Running Head:: Data Mining 1
No ratings yet
Running Head:: Data Mining 1
7 pages
Functional Dependency
No ratings yet
Functional Dependency
23 pages
Implementation Guide For Aerial Applicationof Fire Retardant
No ratings yet
Implementation Guide For Aerial Applicationof Fire Retardant
69 pages
Sample Paper 14 IP
No ratings yet
Sample Paper 14 IP
9 pages
Course 7 Week 1 Glossary - DA Terms and Definitions
No ratings yet
Course 7 Week 1 Glossary - DA Terms and Definitions
21 pages
Inforbright IEE Architecture Overview
No ratings yet
Inforbright IEE Architecture Overview
14 pages
ISA 3.0 E-Learning Assessment Test
No ratings yet
ISA 3.0 E-Learning Assessment Test
24 pages
Recommendation System in Business Intelligence Solutions For Grocery Shops Challenges and Perspective
No ratings yet
Recommendation System in Business Intelligence Solutions For Grocery Shops Challenges and Perspective
5 pages
COMP1638 Revision
No ratings yet
COMP1638 Revision
24 pages
Online MachineLearningUsing Python
No ratings yet
Online MachineLearningUsing Python
3 pages
Class Practical 3 - Chapter 3 Questions
No ratings yet
Class Practical 3 - Chapter 3 Questions
3 pages
7 Business Intelligence Lifecycle 03-01-2025
No ratings yet
7 Business Intelligence Lifecycle 03-01-2025
8 pages
Share Plex
No ratings yet
Share Plex
2 pages
Write A Query To Find The Addresses
No ratings yet
Write A Query To Find The Addresses
2 pages
DDM Lab Manual 2024-2025
No ratings yet
DDM Lab Manual 2024-2025
54 pages
2022 FRSecure CISSP Mentor Program - 2022 - Class Four
No ratings yet
2022 FRSecure CISSP Mentor Program - 2022 - Class Four
122 pages
SQL Commands Notes
No ratings yet
SQL Commands Notes
11 pages
Niroosha H CV
No ratings yet
Niroosha H CV
2 pages
Document1 PDF
No ratings yet
Document1 PDF
1 page
Fundamental Factors Affecting Quality
No ratings yet
Fundamental Factors Affecting Quality
10 pages
Unit 2
No ratings yet
Unit 2
22 pages
Introduction and Conceptual Modeling: Sarat Saharia, Deptt. of CSE, Tezpur University
No ratings yet
Introduction and Conceptual Modeling: Sarat Saharia, Deptt. of CSE, Tezpur University
22 pages
Chapater 1 Data Mining 2025
No ratings yet
Chapater 1 Data Mining 2025
7 pages
Analysis Services Asallproducts Allversions
No ratings yet
Analysis Services Asallproducts Allversions
4,774 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
28 pages
DS Unit-5
No ratings yet
DS Unit-5
50 pages

DBMS 3

Uploaded by

DBMS 3

Uploaded by

DATABASE MANAGEMENT SYSTEMS

Redundancy is at the root of several problems associated with

Problems Caused by Redundancy

Update anomalies: If one copy of such repeated data is

updated, an inconsistency is created unless all copies are

information unless some other information is stored as well.

information without losing some other information as well.

Problems Caused by Redundancy (cont.)

In addition, suppose that the hourly wages attribute is

determined by the rating attribute. That is, for a given rating

It leads to possible redundancy in the relation Hourly_Emps

Intuitively, redundancy arises when a relational schema forces

We can deal with the redundancy in Hourly_Emps by

rating hourly wages

ssn name lot rating hours worked

Unless we are careful, decomposing a relation schema can

A Functional Dependency (FD) X Y (read as X determines

Given some allowable instance r1 of R, we can check if it

Consider the schema:

Since rollNois a key, rollNo → {studName, sex, dept, hostelName,

Example: Constraints on Entity Set

Notation: We will denote this relation schema by listing the

Sometimes, we will refer to all attributes of a relation by using

Suppose that we have entity sets Parts, Suppliers, and

Given some FDs, we can usually infer additional FDs:

Armstrong’s Axioms (X, Y, Z are sets of attributes):

Couple of additional rules (that follow from AA):

Example: Contracts(cid, sid, jid, did, pid, qty, value), and:

JP → C, C → CSJDPQV imply JP → CSJDPQV

Computing the closure of a set of FDs can be expensive. (Size

An important question is how we can infer, or compute, the

The following three rules, called Armstrong's Axioms, can be

We use X, Y, and Z to denote sets of attributes over a relation

It is convenient to use some additional rules while reasoning about

If we just want to check whether a given dependency, say, X → Y, is

The main goal of Database Normalization is to restructure the

Organize data efficiently

Reduce the potential for data anomalies.

How to take a raw collection of data and break it up into

Normalization is the process by which attributes are grouped

Analyzing sample relations

You might also like