0% found this document useful (0 votes)

28 views8 pages

Data Normalization

Uploaded by

Julfikar asif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views8 pages

Data Normalization

Uploaded by

Julfikar asif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Data Normalization

Functional Dependency

Functional dependency (FD) is set of constraints between two attributes in a relation. Functional
dependency says that if two tuples have same values for attributes A1, A2,..., An then those two
tuples must have to have same values for attributes B1, B2, ..., Bn. Functional dependency is
represented by arrow sign (→), that is X→Y, where X functionally determines Y. The left hand
side attributes determines the values of attributes at right hand side.

Transitivity rule: Same as transitive rule in algebra, if a → b holds and b → c holds then a → c
also hold. a → b is called as a functionally determines b.

We might say that Salesperson Number defines Salesperson Name. If I give you a Salesperson
Number, you can give me back the one and only name that goes with it. These defining
associations are commonly written with a right-pointing arrow like this:
Salesperson Number →Salesperson Name
In the more formal terms of functional dependencies, the attribute on the left side is referred to as
the determinant attribute. This is because its value determines the value of the attribute on the
right side. Conversely, we also say that the attribute on the right is functionally dependent on the
attribute on the left.
Figure 4-23 Salesperson entity attributes.
Salesperson Number
Salesperson Name
Commission
Percentage
Year of Hire
Department
Number
Manager Name
Product Number
Product Name
Unit Price
Quantity

Page 1 of 8 Lecture – 6 (Normalization)

Normalization

If a database design is not perfect it may contain anomalies, which are like a bad dream for
database itself. Managing a database with anomalies is next to impossible. Data normalization
is a methodology for organizing attributes into tables so that redundancy among the nonkey
attributes is eliminated. Each of the resultant tables deals with a single data focus, which is just
another way of saying that each resultant table will describe a single entity type or a single
many-to-many relationship. Furthermore, foreign keys will appear exactly where they are
needed. In other words, the output of the data normalization process is a properly structured
relational database

 Update anomalies: if data items are scattered and are not linked to each other properly,
then there may be instances when we try to update one data item that has copies of it
scattered at several places, few instances of it get updated properly while few are left with
there old values. This leaves database in an inconsistent state.
 Deletion anomalies: we tried to delete a record, but parts of it left undeleted because of
unawareness, the data is also saved somewhere else.
 Insert anomalies: we tried to insert data in a record that does not exist at all.

Normalization is a method to remove all these anomalies and bring database to consistent state
and free from any kinds of anomalies.

Here are three additional points to remember:

1. Once the attributes are arranged in third normal form (and if none of the exception conditions
is present), the group of tables that they comprise is, in fact, a well-structured relational database
with no data redundancy.
2. A group of tables is said to be in a particular normal form if every table in the group is in that
normal form.
3. The data normalization process is progressive. If a group of tables is in second normal form, it
is also in first normal form. If the tables are in third normal form, they are also in second normal
form.

Understanding Unnormalized Data or zero normal form

The table in Figure 4-25 is unnormalized. The table has four records, one for each salesperson.
But since each salesperson has sold several products and there is only one record for each
salesperson, several attributes of each record must have multiple values. For example, the record
for salesperson 137 has three product numbers, 19440, 24013, and 26722, in its Product Number
attribute because salesperson 137 has sold all three of those products. Having such multivalued
attributes is not permitted and so this table is unnormalized.

Normalizing to First Normal Form

In the first normal form, each attribute value is atomic, that is, no attribute is multivalued. The
table in Figure 4-26 is the first normal form representation of the data. The attributes under
consideration have been listed in one table, and a primary key has been established. In this

Page 2 of 8 Lecture – 6 (Normalization)

definition of normal forms, the requirement for a primary key is not listed as part of any normal
form, but is considered an assumed requirement of the initial E-R diagramming process.
As the sample data in Figure 4-27 shows, the number of records has increased compared to the
unnormalized representation. Every attribute of every record has just one value. The multivalued
attributes from Figure 4-25 are eliminated.

Normalizing to Second Normal Form

Page 3 of 8 Lecture – 6 (Normalization)

More formally, second normal form does not allow partial functional dependencies where data
is dependent on part of the primary key. That is, in a table in second normal form, every nonkey
attribute must be fully functionally dependent on the entire key of that table. In plain language, a
nonkey attribute cannot depend on only part of the key, the way that Salesperson Name, Product
Name, and most of the other nonkey attributes of Figure 4-26 violate this restriction.

Normalizing to Third Normal Form

In third normal form, nonkey attributes are not allowed to define other nonkey attributes.
Stated more formally, third normal form does not allow transitive dependencies in which one
nonkey attribute is functionally dependent on another.

Page 4 of 8 Lecture – 6 (Normalization)

Example 1 Student Database
0NF: Un normalized data with multivalued attributes.

1NF:
Remove multivalued attributes
Student database (Student_ID, Student_Name, Batch, Advisor, Department_Name,
Department_Head, Course_No, Course_Title)

2NF:
Remove partial functional dependencies. data is dependent on part of the primary key.
Student (Student _ID, Student_Name, Batch, Advisor, Department_Name, Department_Head)
Student_Course (Student_ID,Course_No, Course_Title)

3NF:
Remove transitive dependencies
Student (Student _ID, Student_Name, Batch, Department_Name)
Advisor ( Batch, Advisor)
Department (Department_Name, Department_Head)
Student_Course ( Student_ID, Course_ID)
Course (Course_ID, Course_Title)

Example 2 Employee Database

1NF
Employee Database ( Empoyee_ID, Employee_Name, Mobile, Department_Name,
Department_Location, Project_ID, Project_Name )

2NF
Employee (Empoyee_ID, Employee_Name, Mobile, Department_Name, Department_Location)
Project (Project_ID, Project_Name, Employee_ID)

3NF
Employee (Empoyee_ID, Employee_Name, Mobile,Department_ID)
Department (Department_ID, Department_Name, Department_Location)
Project (Project_ID, Project_Name, Employee_ID)

Boyce Codd Normal Form (BCNF)

When a relation has more than one candidate key, anomalies may result even though the relation
is in 3NF. 3NF does not deal satisfactorily with the case of a relation with overlapping candidate
keys. –i.e. composite candidate keys with at least one attribute in common.
•BCNF is based on the concept of a determinant.
–A determinant is any attribute (simple or composite) on which some other attribute is fully
functionally dependent.
•A relation is in BCNF is, and only if, every determinant is a candidate key.
The theory
•Consider the following relation and determinants.
R(a,b,c,d)

Page 5 of 8 Lecture – 6 (Normalization)

a,c -> b,d
a,d -> b
•To be in BCNF, all valid determinants must be a candidate key. In the relation R, a,c->b,d is the
determinate used, so the first determinate is fine.
•a,d->b suggests that a,d can be the primary key, which would determine b. However this would
not determine c. This is not a candidate key, and thus R is not in BCNF.
Example 1
Appointment Table
Patient No Patient Name Appointment Id Time Doctor

1 Jhon 0 09:00 Zorro

2 Kerr 0 9:00 Killer
3 Adam 1 10:00 Zorro
4 Robert 0 13:00 Killer
5 Zane 1 14:00 Zorro

Two possible keys

•DB(Patno, PatName, appNo, time, doctor)
•Determinants:
–Patno-> PatName
–Patno, appNo-> Time, doctor
–Time -> appNo

•Two options for 1NF primary key selection:

–DB(Patno, PatName, appNo, time, doctor) (example 1a)
–DB(Patno, PatName, appNo, time, doctor) (example 1b)

Example 1a
•DB(Patno, PatName, appNo, time, doctor)
•No repeating groups, so in 1NF
•2NF –eliminate partial key dependencies:
–DB(Patno, appNo, time, doctor)
–R1(Patno, PatName)
•3NF –no transient dependences so in 3NF
•Now try BCNF.

BCNF Every determinant is a candidate key

DB(Patno, appNo, time, doctor)
R1(Patno, PatName)
•Is determinant a candidate key?
–Patno-> PatName
Patno is present in DB, but not PatName, so
irrelevant.
-Patno, appNo-> Time, doctor
All LHS and RHS present so relevant. Is this a candidate
key? Patno,appNoIS the key, so this is a candidate key.

Page 6 of 8 Lecture – 6 (Normalization)

–Time -> appNo
Time is present, and so is appNo, so relevant. Is this a
candidate key? If it was then we could rewrite DB as:
DB(Patno, appNo, time, doctor)
This will not work, so not BCNF.

Rewrite to BCNF
•DB(Patno, appNo, time, doctor)
R1(Patno, PatName)
•BCNF: rewrite to
DB(Patno, time, doctor)
R1(Patno, PatName)
R2(time, appNo)
•time is enough to work out the appointment number of a patient. Now BCNF is satisfied, and
the final relations shown are in BCNF

Example 1b
•DB(Patno, PatName, appNo, time, doctor)
•No repeating groups, so in 1NF
•2NF –eliminate partial key dependencies:
–DB(Patno, time, doctor)
–R1(Patno, PatName)
–R2(time, appNo)
•3NF –no transient dependences so in 3NF
•Now try BCNF.

BCNF Every determinant is a candidate key

DB(Patno, time, doctor)
R1(Patno, PatName)
R2(time, appNo)
•Is determinant a candidate key?
–Patno-> PatName
Patnois present in DB, but not PatName, irrelevant.
–Patno, appNo-> Time, doctor
Not all LHS present so not relevant
–Time -> appNo
Time is present, but not appNo, so not relevant.
–Relations are in BCNF.

Summary -Example 1
This example has demonstrated three things:
•BCNF is stronger than 3NF, relations that are in 3NF are not necessarily inBCNF
•BCNF is needed in certain situations to obtain full understanding of the data model
•there are several routes to take to arrive at the same set of relations in BCNF.
–Unfortunately there are no rules as to which route will be the easiest one to take.

Page 7 of 8 Lecture – 6 (Normalization)

Summary:
1NF: no attribute is multivalued
2NF: no partial functional dependencies
3NF: no transitive dependencies
4NF (Boyce CoddNormal Form (BCNF)): no multi-valued dependencies.

What are the Benefits of Database Normalization?

Improved data integrity!
No INSERT or UPDATE anomalies.
Decreased storage requirements!
No redundant data stored.
Faster search performance!
Smaller file for table scans.
More directed searching.

Page 8 of 8 Lecture – 6 (Normalization)

Dbms Chapter 6 Normalization
No ratings yet
Dbms Chapter 6 Normalization
2 pages
DB_Lecture_9&10
No ratings yet
DB_Lecture_9&10
50 pages
CMPG311_SU5-CH7
No ratings yet
CMPG311_SU5-CH7
37 pages
Dbms Theory Notes Unit IV
No ratings yet
Dbms Theory Notes Unit IV
73 pages
376420_LEC06_Normalization_Up
No ratings yet
376420_LEC06_Normalization_Up
51 pages
7. Normalization
No ratings yet
7. Normalization
30 pages
Chapter6_NormalizationDatabaseTables_Part4 (2)
No ratings yet
Chapter6_NormalizationDatabaseTables_Part4 (2)
38 pages
Normalizekiit PDF
No ratings yet
Normalizekiit PDF
68 pages
Normalization
No ratings yet
Normalization
57 pages
Chapter 5-T323 Introduction to the Relational Database
No ratings yet
Chapter 5-T323 Introduction to the Relational Database
37 pages
ADBMS Lec4
No ratings yet
ADBMS Lec4
35 pages
Normalization docx (Autosaved)
No ratings yet
Normalization docx (Autosaved)
33 pages
Database Design With Normalization
No ratings yet
Database Design With Normalization
30 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
28 pages
Chapter 4 DB
No ratings yet
Chapter 4 DB
30 pages
Topic 07
No ratings yet
Topic 07
56 pages
7 Normalization SAR
No ratings yet
7 Normalization SAR
33 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
23 pages
Redundancy Dependency Loss of Information
No ratings yet
Redundancy Dependency Loss of Information
61 pages
Normalization
No ratings yet
Normalization
30 pages
Lecture 6 Normalization
No ratings yet
Lecture 6 Normalization
54 pages
Functional Dependency and Normalization
No ratings yet
Functional Dependency and Normalization
10 pages
Normalization: ITM 692 Sanjay Goel
No ratings yet
Normalization: ITM 692 Sanjay Goel
34 pages
DATABASE NOTES Database Normalization
No ratings yet
DATABASE NOTES Database Normalization
13 pages
Normalization and Denormalization
No ratings yet
Normalization and Denormalization
44 pages
Quiz 8
No ratings yet
Quiz 8
4 pages
lesson10 Normalization
No ratings yet
lesson10 Normalization
8 pages
Normalization
No ratings yet
Normalization
13 pages
DBS Normalization
No ratings yet
DBS Normalization
30 pages
Normalization in DBMS11
No ratings yet
Normalization in DBMS11
12 pages
IM Module 3, Lesson 3
No ratings yet
IM Module 3, Lesson 3
51 pages
Topic 6- Normalization
No ratings yet
Topic 6- Normalization
13 pages
What is Normalization
No ratings yet
What is Normalization
8 pages
VII. Normalización
No ratings yet
VII. Normalización
16 pages
Normalization 1
No ratings yet
Normalization 1
10 pages
DBMS Normalization
No ratings yet
DBMS Normalization
18 pages
Normalization
No ratings yet
Normalization
48 pages
Unit 4
No ratings yet
Unit 4
19 pages
Normalization of Database Tables
100% (1)
Normalization of Database Tables
59 pages
Unit3-Part2-Normalization-Normal Forms
No ratings yet
Unit3-Part2-Normalization-Normal Forms
20 pages
Normalization of Database Tables: Examples of Functional Dependencies
No ratings yet
Normalization of Database Tables: Examples of Functional Dependencies
5 pages
ASSIGNMENT NORMALIZATION_Phoenix
No ratings yet
ASSIGNMENT NORMALIZATION_Phoenix
8 pages
Lec 5 Normalization
No ratings yet
Lec 5 Normalization
25 pages
IM 101_Fundamentals of Database Systems_Unit 8
No ratings yet
IM 101_Fundamentals of Database Systems_Unit 8
27 pages
Chapter 6 - Normalization of Database Tables
No ratings yet
Chapter 6 - Normalization of Database Tables
23 pages
Unit 3 1
No ratings yet
Unit 3 1
11 pages
Dbms Normalization
No ratings yet
Dbms Normalization
5 pages
MYSQL DAY - 20 (Normalization)
No ratings yet
MYSQL DAY - 20 (Normalization)
13 pages
Normalization and Normal Form
No ratings yet
Normalization and Normal Form
11 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
21 pages
Ch04 Normalization
No ratings yet
Ch04 Normalization
23 pages
Systems Analysis and Design 9th Edition Kendall Test Bank download
100% (2)
Systems Analysis and Design 9th Edition Kendall Test Bank download
20 pages
Protocolo Modbus - VSD Adv & GCS
No ratings yet
Protocolo Modbus - VSD Adv & GCS
93 pages
Dbms Assignment ON Normalization: Submitted By, R.Kiruba Sankar
No ratings yet
Dbms Assignment ON Normalization: Submitted By, R.Kiruba Sankar
10 pages
Helpdesk Flowchart
No ratings yet
Helpdesk Flowchart
3 pages
CP R81 CLI ReferenceGuide
No ratings yet
CP R81 CLI ReferenceGuide
1,673 pages
Normalization in SQL Server
No ratings yet
Normalization in SQL Server
11 pages
Cloud Computing
No ratings yet
Cloud Computing
88 pages
CIS Ubuntu Linux 20.04 LTS Benchmark v2.0.1
No ratings yet
CIS Ubuntu Linux 20.04 LTS Benchmark v2.0.1
941 pages
Assignment No. 3: ND ST
No ratings yet
Assignment No. 3: ND ST
11 pages
KDC 248 U
No ratings yet
KDC 248 U
22 pages
IBM Documentation CLG
No ratings yet
IBM Documentation CLG
53 pages
Media and Information Literacy 2
No ratings yet
Media and Information Literacy 2
46 pages
Java and Mathematica
No ratings yet
Java and Mathematica
4 pages
10.Amazon Web Services - Lambda
No ratings yet
10.Amazon Web Services - Lambda
5 pages
FRST Tutorial - How To Use Farbar Recovery Scan Tool - Malware Removal Guides and Tutorials
No ratings yet
FRST Tutorial - How To Use Farbar Recovery Scan Tool - Malware Removal Guides and Tutorials
48 pages
E - Procurement System
No ratings yet
E - Procurement System
13 pages
Expat Manual ABN AMRO - tcm18-145135
No ratings yet
Expat Manual ABN AMRO - tcm18-145135
11 pages
SI Check 000
No ratings yet
SI Check 000
17 pages
FIT3046 Operating Environments: Week 1 Lecture Study Guide 1
No ratings yet
FIT3046 Operating Environments: Week 1 Lecture Study Guide 1
43 pages
Unit1 LoadBal
No ratings yet
Unit1 LoadBal
20 pages
Zeyad Rashad CV - Architect
No ratings yet
Zeyad Rashad CV - Architect
1 page
Computer Science Practical File Printed
100% (1)
Computer Science Practical File Printed
71 pages
Coe Solution Design Guideline Oracle Field Service
No ratings yet
Coe Solution Design Guideline Oracle Field Service
11 pages
Contents: Ledger Master (Create/Edit/Delete) 1. Create A Regular Leger . 02-06 2. Create A Party Leger ... 07
No ratings yet
Contents: Ledger Master (Create/Edit/Delete) 1. Create A Regular Leger . 02-06 2. Create A Party Leger ... 07
9 pages
Java Gson + JSON Tutorial With Examples: Gson Jar To Resolve Dependency
No ratings yet
Java Gson + JSON Tutorial With Examples: Gson Jar To Resolve Dependency
10 pages
Internship Presentation
No ratings yet
Internship Presentation
9 pages
Analog Devices - Integrated, High Power Solutions For Xilinx FPGAs
No ratings yet
Analog Devices - Integrated, High Power Solutions For Xilinx FPGAs
16 pages
AST 0066570 Three Tiers of SSO White Paper 3 12
No ratings yet
AST 0066570 Three Tiers of SSO White Paper 3 12
8 pages
DN-70182 Manual English 20160224
No ratings yet
DN-70182 Manual English 20160224
21 pages
Data Access Control 6.8
No ratings yet
Data Access Control 6.8
2 pages
F1F9 1
No ratings yet
F1F9 1
2 pages
Why Do You Need To Scale Data in KNN: 3 Answers
No ratings yet
Why Do You Need To Scale Data in KNN: 3 Answers
1 page
CCNP Aag
No ratings yet
CCNP Aag
1 page
The Art of R Programming: A Tour of Statistical Software Design
From Everand
The Art of R Programming: A Tour of Statistical Software Design
Norman Matloff
4/5 (30)
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
From Everand
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
VIOLET CASTRO
No ratings yet
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
From Everand
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
William Sullivan
5/5 (3)
Design And Analysis Of Algorithm
From Everand
Design And Analysis Of Algorithm
Bhupendra Mandloi
No ratings yet
A General Introduction to Data Analytics
From Everand
A General Introduction to Data Analytics
João Moreira
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet

Data Normalization

Uploaded by

Data Normalization

Uploaded by

Data Normalization

Page 1 of 8 Lecture – 6 (Normalization)

Here are three additional points to remember:

Understanding Unnormalized Data or zero normal form

Normalizing to First Normal Form

Page 2 of 8 Lecture – 6 (Normalization)

Normalizing to Second Normal Form

Page 3 of 8 Lecture – 6 (Normalization)

Normalizing to Third Normal Form

Page 4 of 8 Lecture – 6 (Normalization)

Example 2 Employee Database

Boyce Codd Normal Form (BCNF)

Page 5 of 8 Lecture – 6 (Normalization)

1 Jhon 0 09:00 Zorro

Two possible keys

•Two options for 1NF primary key selection:

BCNF Every determinant is a candidate key

Page 6 of 8 Lecture – 6 (Normalization)

BCNF Every determinant is a candidate key

Page 7 of 8 Lecture – 6 (Normalization)

What are the Benefits of Database Normalization?

Page 8 of 8 Lecture – 6 (Normalization)

You might also like