100% found this document useful (2 votes)

243 views11 pages

Functional Dependencies and Normalization For Relational Databases

The document discusses normalization of relational databases. It defines normalization as the process of analyzing relation schemas based on functional dependencies and primary keys to minimize redundancy and anomalies. The document describes three normal forms - first normal form requires single-valued attributes, second normal form eliminates partial dependencies, and third normal form removes transitive dependencies. Examples are provided to illustrate how to decompose relations to satisfy each normal form.

Uploaded by

cp54609

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

243 views11 pages

Functional Dependencies and Normalization For Relational Databases

Uploaded by

cp54609

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Unit 4

Functional Dependencies and Normalization for Relational Databases

Definition of Relational Model schema (relational schema):

 Relation schema defines the design and structure of the relation like it
consists of the relation name, set of attributes/field names/column names.
every attribute would have an associated domain.

Design guidelines for Relational schemas (Informal Design Guidelines for

Relational Schemas)

1. Semantics of the relational attributes

2. Reducing the redundant values in tuples.
3. Minimising the null values in tuples
4. Avoiding the generation of spurious(fake) tuples, if any

1. Semantics of the relational attributes

 Whenever we group attributes to form relation,

 We assume certain meaning associated with attributes.
 This meaning we call “SEMANTICS”.
 It tells how to interpret the values stored.
 Easier the semantics, better the relation schema would be
It means that

Whenever we are going to form relational schema there should be some meaning
among the attributes. This meaning is called semantics. This semantics relates one
attribute to another with some relation

Example:

Student name relates to the USNNO

Design guideline1:

 Design a relation schema that is easy to understand and explain its meaning
clearly.
 For which, do not combine various attributes drawn from different entity types
and relationship types into single relation.
2. Reducing the redundant values in tuples.

 Mixing attributes of multiple entities may cause problems

 Information is stored redundantly wasting storage
 Problems with update anomalies
 Insertion anomalies
 Deletion anomalies
 Modification anomalies

 Here whenever if we insert the tuples there may be ‘N’ students in one
department, so Dept No,Dept Name values are repeated ‘N’ times which
leads to data redundancy.
 Another problem is update anomalies ie if we insert new dept that has no
students.
 If we delete the last student of a dept, then whole information about that
department will be deleted

 If we change the value of one of the attributes of a particular table the we

must update the tuples of all the students belonging to that dept else
Database will become inconsistent.
 Note: Design in such a way that no insertion ,deletion, modification
anomalies will occur.

3. Reducing Null values in Tuples.

NULL values appear if...
 Attribute does not apply to the tuple.
 The attribute value for this tuple is unknown.
 The value is known, but not recorded
 Having too many attributes with lot of NULL values results in waste of storage,
and also leads to problems in JOIN operations and aggregate functions.

GUIDELINE
 As far as possible, avoid placing attributes in the base relation whose values
frequently are null. If nulls are unavoidable they should be applied only to
exceptional cases and not to majority of tuples.

1. Avoiding the generation of spurious(fake) tuples, if any

 Decomposing a relation should be based on primary keys. split based on non-

primary key results in generation of spurious tuples or incorrect information.
GUIDELINE
 Design relation schemas so that they can be joined with equality conditions on
primary key/foreign keys, which guarantee no spurious tuples to be generated.
Do not have relations that contain matching attributes other than PK/FK
combinations.
 If such relations are unavoidable do not join such relations.

Functional Dependency

 Functional Dependency (FD) determines the relation of one attribute to another

attribute in a database management system (DBMS) system.
 Functional dependency helps you to maintain the quality of data in the database.
A functional dependency is denoted by an arrow →. The functional dependency
of X on Y is represented by X → Y.
 Functional Dependency plays a vital role to find the difference between good and
bad database design.

Or
Functional Dependency is nothing but relationship that exist, when one attribute
uniquely determines another attribute.

Example

The following is an example that would make it easier to understand functional

dependency −
We have a <Department> table with two attributes − DeptId and DeptName.

DeptId = Department ID
DeptName = Department Name

The DeptId is our primary key. Here, DeptId uniquely identifies

the DeptName attribute. This is because if you want to know the department name,
then at first you need to have the DeptId.

DeptId DeptName

001 Finance

002 Marketing

003 HR

Therefore, the above functional dependency between DeptId and DeptName can be
determined as DeptId is functionally dependent on DeptName −

DeptId -> DeptName

Advantages of Functional Dependency

 Functional Dependency avoids data redundancy.

 Therefore same data do not repeat at multiple locations in that database
 It helps you to maintain the quality of data in the database
 It helps you to defined meanings and constraints of databases
 It helps you to identify bad designs
 It helps you to find the facts regarding the database design.

Normalization
Defining Normalization:
It is the process of analysing the given set of relation schemas based on their Functional
Dependencies and primary keys to achieve desirable properties like
1. Minimizing redundancy
2. Minimizing insertion, deletion and updating anomalies

Categories of Normal Forms

1. First Normal Form (1NF)

For a table to be in the First Normal Form, it should follow the following 4 rules:
 It should only have single(atomic) valued attributes/columns.
 Values stored in a column should be of the same domain
 All the columns in a table should have unique names.
 And the order in which data is stored, does not matter.


 A relation is in 1NF if every attribute is a single-valued attribute or it does not
contain any multi-valued or composite attribute, i.e., every attribute is an atomic
attribute. If there is a composite or multi-valued attribute, it violates the 1NF.
 To solve this, we can create a new row for each of the values of the multi-valued
attribute to convert the table into the 1NF.

Example:
Let’s take an example of a relational table <EmployeeDetail> that contains the
details of the employees of the company
 Here, the Employee Phone Number is a multi-valued attribute. So, this relation is
not in 1NF.

Solution:

 To convert this table into 1NF, we make new rows with each Employee Phone
Number as a new row as shown below:

Second Normal Form (2NF)

For a relational table to be in second normal form, it must satisfy the following rules:

1. The table must be in first normal form.

2. It must not contain any partial dependency, i.e., all non-prime attributes are fully
functionally dependent on the primary key.

If a partial dependency exists, we can divide the table to remove the partially dependent
attributes and move them to some other table where they fit in well.

Let us take an example of the following <EmployeeProjectDetail> table to understand

what is partial dependency and how to normalize the table to the second normal form:

In the above table, the prime attributes of the table are Employee Code and Project ID.
We have partial dependencies in this table because Employee Name can be determined
by Employee Code and Project Name can be determined by Project ID. Thus, the above
relational table violates the rule of 2NF.

Solution

To remove partial dependencies from this table and normalize it into second normal
form, we can decompose the <EmployeeProjectDetail> table into the following three
tables:
Thus, we’ve converted the <EmployeeProjectDetail> table into 2NF by decomposing it
into <EmployeeDetail>, <ProjectDetail> and <EmployeeProject> tables. As you can see,
the above tables satisfy the following two rules of 2NF as they are in 1NF and every non-
prime attribute is fully dependent on the primary key.

Third Normal Form (3NF)

The normalization of 2NF relations to 3NF involves the elimination of transitive

dependencies.

A functional dependency X -> Z is said to be transitive if the following three functional

dependencies hold:

 X -> Y
 Y does not -> X
 Y -> Z

For a relational table to be in third normal form, it must satisfy the following rules:

1. The table must be in the second normal form.

2. No non-prime attribute is transitively dependent on the primary key.
3. For each functional dependency X -> Z at least one of the following conditions
hold:

 X is a super key of the table.

 Z is a prime attribute of the table.

If a transitive dependency exists, we can divide the table to remove the transitively
dependent attributes and place them to a new table along with a copy of the
determinant.

Let us take an example of the following <EmployeeDetail> table to understand what is

transitive dependency and how to normalize the table to the third normal form:

The above table is not in 3NF because it has Employee Code -> Employee City transitive
dependency because:

 Employee Code -> Employee Zipcode

 Employee Zipcode -> Employee City

Also, Employee Zipcode is not a super key and Employee City is not a prime attribute.

To remove transitive dependency from this table and normalize it into the third normal
form, we can decompose the <EmployeeDetail> table into the following two tables:
Thus, we’ve converted the <EmployeeDetail> table into 3NF by decomposing it into
<EmployeeDetail> and <EmployeeLocation> tables as they are in 2NF and they don’t
have any transitive dependency.

Boyce–Codd Normal Form (BCNF)

Boyce-Codd Normal Form is an advanced version of 3NF as it contains additional

constraints compared to 3NF.

For a relational table to be in Boyce-Codd normal form, it must satisfy the following
rules:

1. The table must be in the third normal form.

2. For every non-trivial functional dependency X -> Y, X is the superkey of the table.
That means X cannot be a non-prime attribute if Y is a prime attribute.

A superkey is a set of one or more attributes that can uniquely identify a row in a
database table.
Let us take an example of the following <EmployeeProjectLead> table to understand
how to normalize the table to the BCNF:

The above table satisfies all the normal forms till 3NF, but it violates
the rules of BCNF because the candidate key of the above table is
{Employee Code, Project ID}. For the non-trivial functional
dependency, Project Leader -> Project ID, Project ID is a prime
attribute but Project Leader is a non-prime attribute. This is not
allowed in BCNF.

To convert the given table into BCNF, we decompose it into two tables:
Thus, we’ve converted the <EmployeeProjectLead> table into BCNF by
decomposing it into <EmployeeProject> and <ProjectLead> tables.

Detecting Logic Bugs
No ratings yet
Detecting Logic Bugs
26 pages
Database1 Final Revision ٠٤٥٢٢٤
100% (1)
Database1 Final Revision ٠٤٥٢٢٤
14 pages
sathyabama-IIsem-Distributed Computing-683201-783201
No ratings yet
sathyabama-IIsem-Distributed Computing-683201-783201
2 pages
Database Management Systems Jan 2014
No ratings yet
Database Management Systems Jan 2014
2 pages
Question Paper Mca 2 Sem Database Management Systems Kca204 2022.pdfmca 2 Sem Database Management Systems Kca204 2022
No ratings yet
Question Paper Mca 2 Sem Database Management Systems Kca204 2022.pdfmca 2 Sem Database Management Systems Kca204 2022
3 pages
DB Question
No ratings yet
DB Question
209 pages
Distributed Database Transparency Features
No ratings yet
Distributed Database Transparency Features
6 pages
Se Mcqs
No ratings yet
Se Mcqs
28 pages
CPU Scheduling: Practice Exercises
No ratings yet
CPU Scheduling: Practice Exercises
6 pages
CSC207
No ratings yet
CSC207
14 pages
Tcs Theory Notes by Kamal Sir
No ratings yet
Tcs Theory Notes by Kamal Sir
24 pages
Concurrency Control Dbms
No ratings yet
Concurrency Control Dbms
49 pages
Practical List (AI)
No ratings yet
Practical List (AI)
2 pages
DBMS DPP 4
No ratings yet
DBMS DPP 4
3 pages
Query Processing - Database Questions & Answers - Sanfoundry 00
No ratings yet
Query Processing - Database Questions & Answers - Sanfoundry 00
7 pages
DSA MCQ Unit 1 PDF
No ratings yet
DSA MCQ Unit 1 PDF
41 pages
MODULE 4 Unix Notes PDF
No ratings yet
MODULE 4 Unix Notes PDF
27 pages
Computer Systems Unit 2 - Fill The Blanks
No ratings yet
Computer Systems Unit 2 - Fill The Blanks
7 pages
Crash Recovery
No ratings yet
Crash Recovery
30 pages
Week 9
No ratings yet
Week 9
4 pages
Serializability
No ratings yet
Serializability
26 pages
Dbms-Module-2 Solutions
No ratings yet
Dbms-Module-2 Solutions
13 pages
Disk Storage, Basic File Structures, and Hashing: Dr. Hasnaa Raafat Dr. Nora Zakie
No ratings yet
Disk Storage, Basic File Structures, and Hashing: Dr. Hasnaa Raafat Dr. Nora Zakie
31 pages
Relational Database Design: Exercises
No ratings yet
Relational Database Design: Exercises
9 pages
Unix Lab Manual
No ratings yet
Unix Lab Manual
23 pages
Ajp Assignment 1 To 5
No ratings yet
Ajp Assignment 1 To 5
6 pages
Functional Dependency
No ratings yet
Functional Dependency
2 pages
OOP - I GTU Study Material Presentations Unit-1 07022022102854PM
No ratings yet
OOP - I GTU Study Material Presentations Unit-1 07022022102854PM
59 pages
DS Lecture 01 - Introduction PDF
No ratings yet
DS Lecture 01 - Introduction PDF
23 pages
Database Design
No ratings yet
Database Design
97 pages
Vtu 5TH Sem Cse DBMS Notes
67% (3)
Vtu 5TH Sem Cse DBMS Notes
35 pages
DBMS Question Bank and Assignment - 1
No ratings yet
DBMS Question Bank and Assignment - 1
1 page
Software Engineering NTA UGC NET Question Analysis PART2
No ratings yet
Software Engineering NTA UGC NET Question Analysis PART2
16 pages
Solved DBMS Study MCQs
100% (1)
Solved DBMS Study MCQs
75 pages
Peer-To-Peer File Sharing
No ratings yet
Peer-To-Peer File Sharing
6 pages
DBMS Question DBMS
100% (1)
DBMS Question DBMS
14 pages
ADBMS Sem 1 Mumbai University (MSC - CS)
No ratings yet
ADBMS Sem 1 Mumbai University (MSC - CS)
39 pages
Lecture 3 - Introduction To NoSQL - Updated
No ratings yet
Lecture 3 - Introduction To NoSQL - Updated
35 pages
Design & Analysis of Algorithm June 2012 NEW
No ratings yet
Design & Analysis of Algorithm June 2012 NEW
2 pages
Election Algorithms and Distributed Processing: Section 6.5
No ratings yet
Election Algorithms and Distributed Processing: Section 6.5
22 pages
DBMS Module4 QuestionBank
No ratings yet
DBMS Module4 QuestionBank
2 pages
Assignment 4
50% (2)
Assignment 4
10 pages
Dbms Question Papers
100% (1)
Dbms Question Papers
7 pages
Hbase PPT PDF
No ratings yet
Hbase PPT PDF
100 pages
Dbms Aicte Lab
No ratings yet
Dbms Aicte Lab
42 pages
CS614 FinalTerm Solved Papers
No ratings yet
CS614 FinalTerm Solved Papers
24 pages
Software Engineering Solved MCQs (Set-1) PDF
No ratings yet
Software Engineering Solved MCQs (Set-1) PDF
6 pages
DBMS Chapter 4
No ratings yet
DBMS Chapter 4
39 pages
Dbms MCQ
No ratings yet
Dbms MCQ
3 pages
Hw7 Sol Motro
100% (1)
Hw7 Sol Motro
6 pages
Search Sort MCQ Gate
No ratings yet
Search Sort MCQ Gate
12 pages
Fasl14 (Kiau-Alborz Ir)
No ratings yet
Fasl14 (Kiau-Alborz Ir)
8 pages
Dbms Lab Exam
0% (2)
Dbms Lab Exam
13 pages
CST204 Database Management Systems, July 2021
0% (1)
CST204 Database Management Systems, July 2021
3 pages
Question Paper Code:: (10×2 20 Marks)
100% (1)
Question Paper Code:: (10×2 20 Marks)
3 pages
DBMS MT-1 QuestionPaper
No ratings yet
DBMS MT-1 QuestionPaper
2 pages
D) All of The Above A) AVL Tree
No ratings yet
D) All of The Above A) AVL Tree
26 pages
CS 551: Banker's Algorithm
No ratings yet
CS 551: Banker's Algorithm
4 pages
CS 1802 OOMD QUESTION BANK - Ans Key1
No ratings yet
CS 1802 OOMD QUESTION BANK - Ans Key1
85 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
From Everand
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
VIOLET CASTRO
No ratings yet
Schema For Decision Support
No ratings yet
Schema For Decision Support
3 pages
MIDTERM
No ratings yet
MIDTERM
20 pages
Panskura Banamali College (Autonomous) : Semester
No ratings yet
Panskura Banamali College (Autonomous) : Semester
2 pages
Answer
No ratings yet
Answer
36 pages
OOSAD Hotel Management System
No ratings yet
OOSAD Hotel Management System
12 pages
Second Year Syllabus
No ratings yet
Second Year Syllabus
40 pages
!!!!normalization Tutorial
No ratings yet
!!!!normalization Tutorial
5 pages
DBMS Q&a
No ratings yet
DBMS Q&a
2 pages
BSCIT
No ratings yet
BSCIT
34 pages
Project File
No ratings yet
Project File
123 pages
Computer Science
No ratings yet
Computer Science
19 pages
REVISED BCA-3rd-4th-Semester-wef-2014-2015
No ratings yet
REVISED BCA-3rd-4th-Semester-wef-2014-2015
15 pages
MSC AI 2021
No ratings yet
MSC AI 2021
59 pages
DBMS Prinnt Ready - A
No ratings yet
DBMS Prinnt Ready - A
12 pages
Unit-Ii Nonloss
No ratings yet
Unit-Ii Nonloss
25 pages
Module-4 Normalization: Database Design Theory DBMS (18CS53)
No ratings yet
Module-4 Normalization: Database Design Theory DBMS (18CS53)
24 pages
IT Sem 5 Syllabus
No ratings yet
IT Sem 5 Syllabus
6 pages
Andishesaz - Ir-Database Normalization Compl
No ratings yet
Andishesaz - Ir-Database Normalization Compl
60 pages
B.Tech - IT and CSIT Syllabus of 3rd Year
No ratings yet
B.Tech - IT and CSIT Syllabus of 3rd Year
37 pages
Bca Sem II Syllabus F
No ratings yet
Bca Sem II Syllabus F
8 pages
Rationale
No ratings yet
Rationale
24 pages
DDM Lab Manual 2024-2025
No ratings yet
DDM Lab Manual 2024-2025
54 pages
MCA Syllabus
No ratings yet
MCA Syllabus
8 pages
BCA Batch 2022 25
No ratings yet
BCA Batch 2022 25
53 pages
Types of Functional Dependencies in DBMS
No ratings yet
Types of Functional Dependencies in DBMS
8 pages
Sybase Catalog 2012
No ratings yet
Sybase Catalog 2012
48 pages
02 Normalization
No ratings yet
02 Normalization
82 pages
Normalization:: 1NF 2NF 3NF BCNF 4NF 5NF
No ratings yet
Normalization:: 1NF 2NF 3NF BCNF 4NF 5NF
6 pages
Unit V: Rating Hourly Wages
No ratings yet
Unit V: Rating Hourly Wages
13 pages

Functional Dependencies and Normalization For Relational Databases

Uploaded by

Functional Dependencies and Normalization For Relational Databases

Uploaded by

Unit 4

Functional Dependencies and Normalization for Relational Databases

Definition of Relational Model schema (relational schema):

Design guidelines for Relational schemas (Informal Design Guidelines for

1. Semantics of the relational attributes

1. Semantics of the relational attributes

 Whenever we group attributes to form relation,

Student name relates to the USNNO

 Mixing attributes of multiple entities may cause problems

 If we change the value of one of the attributes of a particular table the we

3. Reducing Null values in Tuples.

1. Avoiding the generation of spurious(fake) tuples, if any

 Decomposing a relation should be based on primary keys. split based on non-

 Functional Dependency (FD) determines the relation of one attribute to another

The following is an example that would make it easier to understand functional

The DeptId is our primary key. Here, DeptId uniquely identifies

DeptId -> DeptName

 Functional Dependency avoids data redundancy.

Categories of Normal Forms

1. First Normal Form (1NF)

Second Normal Form (2NF)

1. The table must be in first normal form.

Let us take an example of the following <EmployeeProjectDetail> table to understand

Third Normal Form (3NF)

The normalization of 2NF relations to 3NF involves the elimination of transitive

A functional dependency X -> Z is said to be transitive if the following three functional

1. The table must be in the second normal form.

 X is a super key of the table.

Let us take an example of the following <EmployeeDetail> table to understand what is

 Employee Code -> Employee Zipcode

Boyce–Codd Normal Form (BCNF)

Boyce-Codd Normal Form is an advanced version of 3NF as it contains additional

1. The table must be in the third normal form.

You might also like