0% found this document useful (0 votes)

16 views27 pages

RDBMS Unit3 Informaldesign Guidelines

Informal gd

Uploaded by

yoyo36685

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views27 pages

RDBMS Unit3 Informaldesign Guidelines

Informal gd

Uploaded by

yoyo36685

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

NORMALIZATION: DATABASE DESIGN

THEORY

https://www.youtube.com/watch?v=NFk9sDJk50U

https://www.youtube.com/watch?v=gInecSg-36Y
Contents

• Informal design guidelines for relation schemas

• Functional Dependencies
• Normal Forms Based on Primary Keys
Informal design guidelines for relation
schemas
Four informal guidelines that may be used as measures to determine the quality of
relation schema design:
1. Making sure that the semantics of the attributes is clear in the schema
2. Reducing the redundant information in tuples
3. Reducing the NULL values in tuples
4. Disallowing the possibility of generating spurious tuples
1. Semantics of the Relation Attributes
Semantics, specifies how to interpret the attribute values stored in a tuple of the
relation-in other words, how the attribute values in a tuple relate to one another.

• Whenever we group attributes to form relation, we assume certain meaning

associated with attributes.
• This meaning we call “SEMANTICS”.
• It tells how to interpret the values stored.
• Easier the semantics, better the relation schema would be.
• The DNUMBER attribute is a foreign key that represents an implicit
relationship between EMPLOYEE and DEPARTMENT.

• The ease with which the meaning of a relation's attributes can be explained
is an informal measure of how well the relation is designed.

• In DEPT_LOCATIONS and WORKS_ON, the schema

DEPT_LOCATIONS represents a multi-valued attribute of
DEPARTMENT, where as WORKS_ON represents an M:N relationship
between EMPLOYEE and PROJ ECT

• Hence, all the relation schemas may be considered as easy to explain and
hence good from the standpoint of having clear semantics.
• We can thus formulate the following informal design guideline.
GUIDELINE 1
• Design a relation schema so that it is easy to explain its meaning.
• Do not combine attributes from multiple entity types and relationship types
into a single relation.
There is nothing wrong logically with these two relations, they are considered
poor designs because they violate Guideline 1 by mixing attributes from
distinct real-world entities
2. Redundant Information in Tuples and Update
Anomalies

One goal of schema design is to minimize the storage space used by the base
relations.

• Mixing attributes of multiple entities may cause problems

• Information is stored redundantly wasting storage
• Another serious problem with using the relations in as base relations is the
problem of update anomalies.
• These can be classified into insertion anomalies, deletion anomalies, and
modification anomalies.

Insertion Anomalies:
An Insert Anomaly occurs when certain attributes cannot be inserted into the
database without the presence of other attributes.

Insertion anomalies can be differentiated into two types, illustrated by the

following examples based on the EMP_DEPT relation.

1. To insert a new employee tuple into EMP_DEPT, we must include either

the attribute values for the department that the employee works for, or
nulls.
2. It is difficult to insert a new department that has no employees as yet in
the EMP_DEPT relation.
we can't add a new course unless we have at least one student enrolled on the course.
Deletion Anomalies:
A Delete Anomaly exists when certain attributes are lost because of the
deletion of other attributes.

• If we delete from EMP_DEPT an employee tuple that happens to represent

the last employee working for a particular department, the information
concerning that department is lost from the database

Consider what happens if Student S30 is the last student to leave the course -
All information about the course is lost.
Modification Anomalies:

An Update Anomaly exists when one or more instances of duplicated data is

updated, but not all.

• In EMP_DEPT, if we change the value of one of the attributes of a

particular department-say, the manager of department 5-we must update the
tuples of all employees who work in that department; otherwise, the
database will become inconsistent.

Consider Jones moving address - you need to update all instances of Jones's
address.
Based on the preceding three anomalies, we can state the guideline that
follows:

GUIDELINE 2

• Design the base relation schemas so that no insertion, deletion, or

modification anomalies are present in the relations.

• If any anomalies are present, note them clearly and make sure that the
.
programs that update the database will operate correctly
3. Null Values in Tuples
• In some schema designs, we may group many attributes together into a
"fat" relation (More no. of attributes in a single relation where not all
attributes are totally functionally dependent on prime attribute).
• For Example: In a Student Relation, a student having multiple phone
numbers say phno1,phno2 and phno3. Only few students may have more
than 2 phone nos. so rest of the students will keep that attribute value as a
blank or NULL so we should try to avoid it.
• Another example: Department having multiple locations where not all the
department have more than one location so rest of the tuple values will be
filled with NULL
• If many of the attributes do not apply to all tuples in the relation, we end up
with many nulls in those tuples.
• For Example.: If Apartment no. is there in a relation and if you are not
living in a apartment then the value for that attribute will end up with
NULL as it is not applicable to you.
GUIDELINE 3:

• As far as possible, avoid placing attributes in a base

relation whose values may frequently be null.
• If nulls are unavoidable, make sure that they apply in
exceptional cases only and do not apply to a majority of
tuples in the relation.
4. Generation of Spurious Tuples
•A spurious tuple is, basically, a record in a database that gets created when two
tables are joined badly.

•Spurious tuple means a Generation of an extra tuple without a notice. We should

avoid it.

•Decomposition in a Relation will be based on a Primary key.

•Split the relation based on Non-Primary key results in a generation of Spurious

tuples or Incorrect Information.

For Example:
Let us consider two relation schema
Emp_Locs(ename, plocation)
Emp_proj1(eno, pnumber, hours, pname, plocation)
• If we attempt a natural join operation on above relation schema, the result
produces many more tuples than the original set of tuples.
• Additional tuples that were not there in Emp_proj1 are called spurious tuples
because they represent wrong information which is not valid.
The two relations EMP_PROJ1 and EMP_LOCS as the base relations of EMP_PROJ1,
is not a good schema design.

Problem is if a Natural Join is performed on above two relations it produces more

tuples than origin set of tuple in EMP_PROJ1 based on non-key attributes Hours and
Pname.

These additional tuples that were on it present in EMP_PROJ1 are called Spurious
Tuples because they represented spurious or wrong information that are not valid.

This is because the Plocation attribute which is used for joining the two relations is
neither a primary key, nor a foreign key in either EMP_LOC and EMP_PROJ1.
GUIDELINE 4

• Design relation schemas so that they can be joined with equality conditions
on attributes that are either primary keys or foreign keys in a way that
guarantees that no spurious tuples are generated.

• Avoid relations that contain matching attributes that are not (foreign key,
primary key) combinations, because joining on such attributes may
produce spurious tuples.
Functional Dependencies
Normal Forms Based on Primary Keys

Normalization of Relations

The normalization process, as first proposed by Codd (1972). Codd proposed three
main normal forms, which he called first, second, third normal form and Boyce-
Codd normal form (BCNF- an extension of 3NF).

All these normal forms are based on functional dependencies among the attributes
of a relation.

Later, a fourth normal form (4NF) and a fifth normal form (5NF) were proposed,
based on the concepts of multivalued dependencies and join dependencies,
respectively;
• Normalization of data can be considered a process of analyzing
the given relation schemas based on their FDs and primary keys to
achieve the desirable properties of
(1) minimizing redundancy and
(2) minimizing the insertion, deletion, and update anomalies.

• It can be considered as a “filtering” or “purification” process to

make the design have successively better quality.

Features of Good Relational Design and Schema Refinement 1
No ratings yet
Features of Good Relational Design and Schema Refinement 1
25 pages
Unit 2 InformalDesignGuidelines-1
No ratings yet
Unit 2 InformalDesignGuidelines-1
20 pages
DBMS Notes Unit-III - For Students
No ratings yet
DBMS Notes Unit-III - For Students
90 pages
Functional Dependencies and Normalization For Relational Databases
100% (2)
Functional Dependencies and Normalization For Relational Databases
11 pages
Relational Database Design
No ratings yet
Relational Database Design
17 pages
Log 20210514104403
No ratings yet
Log 20210514104403
105 pages
Student Attendance System
No ratings yet
Student Attendance System
11 pages
Solutions DatabaseSystemConcepts 7thed
No ratings yet
Solutions DatabaseSystemConcepts 7thed
193 pages
SQL Class 12 PPT Study
No ratings yet
SQL Class 12 PPT Study
34 pages
NORMALIZATION
No ratings yet
NORMALIZATION
51 pages
DBT Teaching Guidelines
50% (2)
DBT Teaching Guidelines
4 pages
Unit 6 - Normalization
No ratings yet
Unit 6 - Normalization
10 pages
Normalization
No ratings yet
Normalization
175 pages
4 DBMS Module-IV
No ratings yet
4 DBMS Module-IV
12 pages
DBMS Module4
No ratings yet
DBMS Module4
124 pages
DBMS Module4 Notes
No ratings yet
DBMS Module4 Notes
124 pages
80 SQL Interview Questions and Answers
No ratings yet
80 SQL Interview Questions and Answers
20 pages
Module 4 - Normalization
No ratings yet
Module 4 - Normalization
141 pages
Lecture 8 1493715884
No ratings yet
Lecture 8 1493715884
138 pages
Module-4 Normalization: Database Design Theory DBMS (18CS53)
No ratings yet
Module-4 Normalization: Database Design Theory DBMS (18CS53)
24 pages
DBMS M4 - Ktunotes - in
No ratings yet
DBMS M4 - Ktunotes - in
114 pages
Functional Dependencies and Normilization
No ratings yet
Functional Dependencies and Normilization
60 pages
Unit - 3
No ratings yet
Unit - 3
92 pages
Chapter 4-Functional Dependancy and Normalization
No ratings yet
Chapter 4-Functional Dependancy and Normalization
86 pages
Example Report College E-Outing
No ratings yet
Example Report College E-Outing
96 pages
My Normalization Chapter
No ratings yet
My Normalization Chapter
76 pages
Chapter14 - Revised
No ratings yet
Chapter14 - Revised
60 pages
Chapter 4 - Database Design - (Normalization)
No ratings yet
Chapter 4 - Database Design - (Normalization)
43 pages
Chapter 14 Slides
No ratings yet
Chapter 14 Slides
58 pages
1 - Dbms Module 4 PPT 1
No ratings yet
1 - Dbms Module 4 PPT 1
64 pages
DB113-1 05 Normalization
No ratings yet
DB113-1 05 Normalization
48 pages
DBMS Module - 04
No ratings yet
DBMS Module - 04
33 pages
Chapter# 14 Database Design Theory and Normalization
No ratings yet
Chapter# 14 Database Design Theory and Normalization
54 pages
5-Review of DBMS Techniques - Normalization-09-01-2024
No ratings yet
5-Review of DBMS Techniques - Normalization-09-01-2024
62 pages
FDMS - Chapter Four
No ratings yet
FDMS - Chapter Four
62 pages
05 - Relational Database Design - Week 05
No ratings yet
05 - Relational Database Design - Week 05
37 pages
7 Normalization For Relational Databases
No ratings yet
7 Normalization For Relational Databases
38 pages
DBMS Demo
No ratings yet
DBMS Demo
25 pages
FDS Chapter 5 Database Design and Normalization Part 1 STUDENT
No ratings yet
FDS Chapter 5 Database Design and Normalization Part 1 STUDENT
27 pages
Informal Guidelines
No ratings yet
Informal Guidelines
56 pages
Schema Refinement (Normalization) in DBMS
No ratings yet
Schema Refinement (Normalization) in DBMS
39 pages
SQL Assignment 3
No ratings yet
SQL Assignment 3
26 pages
Topic 6 Database Design
No ratings yet
Topic 6 Database Design
54 pages
15 05 Normalisasi
No ratings yet
15 05 Normalisasi
48 pages
DBMS Module 04
No ratings yet
DBMS Module 04
33 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
35 pages
Design A Database
No ratings yet
Design A Database
65 pages
Soal Ganda
No ratings yet
Soal Ganda
18 pages
Dbms 2nd Ia Question Bank
No ratings yet
Dbms 2nd Ia Question Bank
28 pages
CH - 5 FD and Normalization
No ratings yet
CH - 5 FD and Normalization
44 pages
Different Types of Data Visualization in Data Science - NareshIT
No ratings yet
Different Types of Data Visualization in Data Science - NareshIT
19 pages
MM 3
No ratings yet
MM 3
14 pages
Hibernate Annotations: Reference Guide
No ratings yet
Hibernate Annotations: Reference Guide
25 pages
20240628152931D6667 - 006. Schema Refinement
No ratings yet
20240628152931D6667 - 006. Schema Refinement
30 pages
ICT Assignment
No ratings yet
ICT Assignment
27 pages
Chapter Five
No ratings yet
Chapter Five
35 pages
M3 Imp
No ratings yet
M3 Imp
13 pages
Part4 - Ch9 - Functional Dependencies and Normalization
No ratings yet
Part4 - Ch9 - Functional Dependencies and Normalization
26 pages
DBMS UNIT 4 - Class
No ratings yet
DBMS UNIT 4 - Class
14 pages
DBMS - Unit 4
No ratings yet
DBMS - Unit 4
27 pages
DBMS Module 3 Study Notes
No ratings yet
DBMS Module 3 Study Notes
10 pages
Ch7 Functional Dependencies and Normalization
No ratings yet
Ch7 Functional Dependencies and Normalization
23 pages
Lecture 1
No ratings yet
Lecture 1
3 pages
Quiz 4 - 1
No ratings yet
Quiz 4 - 1
6 pages
This Approach Is Not Very Popular in Practice Because It Suffers From The
No ratings yet
This Approach Is Not Very Popular in Practice Because It Suffers From The
6 pages
Normalization PDF
No ratings yet
Normalization PDF
29 pages
Unit V:Normalization: Normalization: Relational Database Design Pitfalls, Denormalized Data, Decomposition
No ratings yet
Unit V:Normalization: Normalization: Relational Database Design Pitfalls, Denormalized Data, Decomposition
30 pages
DBMS Q.Bank
No ratings yet
DBMS Q.Bank
11 pages
Privileges in SQL:: Allows Read Access To Relation, or The Ability To Query
No ratings yet
Privileges in SQL:: Allows Read Access To Relation, or The Ability To Query
29 pages
Updated Front
No ratings yet
Updated Front
3 pages
Sertifikat - MUHAMMAD ELDWIN PASARIBU - Database Design & Programming With SQL
No ratings yet
Sertifikat - MUHAMMAD ELDWIN PASARIBU - Database Design & Programming With SQL
3 pages
Relational 1
No ratings yet
Relational 1
5 pages
5.relational DB Design
No ratings yet
5.relational DB Design
30 pages
Normalization: Dr. M. Brindha Assistant Professor Department of CSE NIT, Trichy-15
No ratings yet
Normalization: Dr. M. Brindha Assistant Professor Department of CSE NIT, Trichy-15
47 pages
DS Department Assignment
No ratings yet
DS Department Assignment
3 pages
Relational Database Design: Guideline1 - Semantics of The Attributes: Design A Relation Schema So That It Is
No ratings yet
Relational Database Design: Guideline1 - Semantics of The Attributes: Design A Relation Schema So That It Is
20 pages
Query 1: Retrieve List of All Databases: SP - Helpdb
No ratings yet
Query 1: Retrieve List of All Databases: SP - Helpdb
30 pages
Shruti Raj
No ratings yet
Shruti Raj
1 page
Updated Front
No ratings yet
Updated Front
1 page
Normalization PDF
No ratings yet
Normalization PDF
29 pages
Assignment 2
No ratings yet
Assignment 2
1 page
UML Diagram
No ratings yet
UML Diagram
1 page
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
No ratings yet
Semantics of The Relation Attributes: Each Tuple in A Relation Should Represent One Entity or Relationship Instance
36 pages
DBMS Viva Questions
No ratings yet
DBMS Viva Questions
4 pages
Unit 9 Functional Dependencies and Normalization For Relational Databases
No ratings yet
Unit 9 Functional Dependencies and Normalization For Relational Databases
20 pages
06a - Normalization
No ratings yet
06a - Normalization
8 pages
Ict235lecture6 PDF
No ratings yet
Ict235lecture6 PDF
6 pages
Unit Iv Data Normalization: Semantics of Attributes Should Be Easy To Interpret
No ratings yet
Unit Iv Data Normalization: Semantics of Attributes Should Be Easy To Interpret
14 pages
#Ye Galat Hai Tell Me Why?
No ratings yet
#Ye Galat Hai Tell Me Why?
3 pages
CSC 313 Past Questions Answer
No ratings yet
CSC 313 Past Questions Answer
6 pages
Navathe Chapter 6 Solution
No ratings yet
Navathe Chapter 6 Solution
1 page
Database Design 2
No ratings yet
Database Design 2
7 pages
Excel In 7 Days : Master Excel Features & Formulas. Become A Pro From Scratch In Just 7 Days With Step-By-Step Instructions, Clear Illustrations, And Practical Examples
From Everand
Excel In 7 Days : Master Excel Features & Formulas. Become A Pro From Scratch In Just 7 Days With Step-By-Step Instructions, Clear Illustrations, And Practical Examples
Paul Slatkin
No ratings yet
101 Most Popular Excel Formulas: 101 Excel Series, #1
From Everand
101 Most Popular Excel Formulas: 101 Excel Series, #1
John Michaloudis
4/5 (5)

RDBMS Unit3 Informaldesign Guidelines

Uploaded by

RDBMS Unit3 Informaldesign Guidelines

Uploaded by

NORMALIZATION: DATABASE DESIGN

• Informal design guidelines for relation schemas

• Whenever we group attributes to form relation, we assume certain meaning

• In DEPT_LOCATIONS and WORKS_ON, the schema

• Mixing attributes of multiple entities may cause problems

Insertion anomalies can be differentiated into two types, illustrated by the

1. To insert a new employee tuple into EMP_DEPT, we must include either

• If we delete from EMP_DEPT an employee tuple that happens to represent

An Update Anomaly exists when one or more instances of duplicated data is

• In EMP_DEPT, if we change the value of one of the attributes of a

• Design the base relation schemas so that no insertion, deletion, or

• As far as possible, avoid placing attributes in a base

•Spurious tuple means a Generation of an extra tuple without a notice. We should

•Decomposition in a Relation will be based on a Primary key.

•Split the relation based on Non-Primary key results in a generation of Spurious

Problem is if a Natural Join is performed on above two relations it produces more

• It can be considered as a “filtering” or “purification” process to

You might also like