0% found this document useful (0 votes)

133 views16 pages

Database Normalization 10

This document discusses database normalization. It defines normalization as a process of analyzing relations for anomalies and correcting them by splitting relations into new, related relations. The document outlines several normal forms (1NF, 2NF, 3NF, BCNF) and describes how normalization addresses issues like insertion, deletion, and update anomalies by reducing data redundancy. It provides examples to illustrate the normalization process and definition of first normal form.

Uploaded by

VinodKumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

133 views16 pages

Database Normalization 10

Uploaded by

VinodKumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

DATABASE NORMALIZATION

What Youll Learn

This section of notes covers the process of database normalization in which relations
(tables) created from theconversion of the E-R model are analyzed for potential flaws
(anomalies) and these flaws are corrected. The following specific topics are covered:
The Relational Model
Functional Dependencies
Keys and Uniqueness
Modification Anomalies
Normalization Process
First Normal Form
Second Normal Form
Third Normal Form
Boyce-Codd Normal Form
Fourth Normal Form
Fifth Normal Form
Domain/Key Normal Form
De-Normalization
All-In-One Example of normalization.
More Normalization Exercises to try

o
o
o
o
o
o
o

Textbook Resources
Connolly, Begg,
Holowczak

Ch. 8

Database
Systems:
Conolly&Begg
5th Ed: 13 and 14
6th Ed: 14 and 15

Rob/Coronel Elmasri/Navathe Kroenke Hoffer, Pre

(5th ed)
(3rd) ed.
(7th ed.) McFadden

Ch. 4

Ch. 14 and 15

Ch. 5

Ch. 5 and A
B

The Relational Model

As a reminder, the database development process we are following has the these steps:
1.
2.

Gather user/business requirements.

Develop the conceptual E-R Model (shown as an E-R Diagram) based on the
user/business requirements.
3.
Convert the E-R Model to a set of relations in the (logical) relational model
4.
Normalize the relations to remove any anomalies.
5.

Implement the database by creating a table for each normalized relation in a

relational database management system.

What is Normalization?

Normalization is a process in which we systematically examine relations

for anomalies and, when detected, remove those anomalies by splitting up the relation
into two new, related, relations.
Normalization is an important part of the database development process: Often
during normalization, the database designers get their first real look into how the data
are going to interact in the database.
Finding problems with the database structure at this stage is strongly preferred to
finding problems further along in the development process because at this point it is
fairly easy to cycle back to the conceptual model (Entity Relationship model) and make
changes.
Normalization can also be thought of as a trade-off between data redundancy and
performance. Normalizing a relation reduces data redundancy but introduces the need
for joins when all of the data is required by an application such as a report query.
Recall, the Relational Model consists of the elements: relations, which are made up of
attributes.

1.
2.
3.
4.
5.
6.

A relation is a set of attributes with values for each attribute such that:
Each attribute (column) value must be a single value only.
All values for a given attribute (column ) must be of the same data type.
Each attribute (column) name must be unique.
The order of attributes (columns) is insignificant
No two tuples (rows) in a relation can be identical.
The order of the tuples (rows) is insignificant.
From our discussion of E-R Modeling, we know that an Entity typically
corresponds to a relation and that the Entitys attributes become attributes of the relation.
We also discussed how, depending on the relationships between entities, copies
of attributes (the identifiers) were placed in related relations as foreign keys.

The next step is to identify functional dependencies within each relation. Click on
the __Next Page link below to learn more about the normalization process.

Functional Dependencies

A Functional Dependency describes a relationship between attributes within a

single relation.
An attribute is functionally dependent on another if we can use the value of one
attribute to determine the value of another.
Example: Employee_Name is functionally dependent on Social_Security_Number
because Social_Security_Number can be used to uniquely determine the value of
Employee_Name.
We use the arrow symbol to indicate a functional dependency.
X Y is read X functionally determines Y
Here are a few more examples:
Student_ID Student_Major
Student_ID, CourseNumber, Semester Grade
Course_Number, Section Professor, Classroom, NumberOfStudents
SKU Compact_Disk_Title, Artist
CarModel, Options, TaxRate Car_Price

The attributes listed on the left hand side of the are called determinants.
One can read A B as, A determines B. Or more specifically: Given a value for A, we
can uniquely determine one value for B.

Keys and Uniqueness

Key: One or more attributes that uniquely identify a tuple (row) in a relation.
The selection of keys will depend on the particular application being considered.
In most cases the key for a relation will already be specified during the conversion
from the E-R model to a set of relations.
Users can also offer some guidance as to what would make an appropriate key.
Recall that no two relations should have exactly the same values, thus a
candidate key would consist of all of the attributes in a relation.

A key functionally determines a tuple (row). So one functional dependency that

can always be written is:
The Key All other attributes

Modification Anomalies
Once our E-R model has been converted into relations, we may find that some
relations are not properly specified. There can be a number of problems:
o
Deletion Anomaly: Deleting one fact or data point from a relation results

in other information being lost.

o
Insertion Anomaly: Inserting a new fact or tuple into a relation requires
we have information from two or more entities this situation might not be feasible.
o
Update Anomaly: Updating one fact in a relation requires us to update
multiple tuples.

Anomaly Example 1

Here is an example to illustrate these anomalies: Consider a very common

CUSTOMER relation:
CUSTOMER(CustomerID, CustomerName, Street, City, State,
PostalCode)
In the United States, the PostalCode (or ZipCode) references a specific City and
State so one might have data such as:
CustomerID

Name

Street

City

State PostalCo

C101

Bill Smith

123 First St.

New Brunswick

07101

C102

Mary Green

11 Birch St.

Old Bridge

07066

C103

Ted Jones

3 Academy St.

Old Bridge

07066

C104

Sally Taylor

446 First Ave.

New Brunswick

07101

C105

Mary Miller

44 Toga Ct.

Farmingdale

11735

Insertion Anomaly: What happens if we go to add a new Customer: C106, Joe

Feldman, 99 Ninth St., Springfield, NJ
What we know about Joe is that he lives in Springfield, NJ (one fact) but we may not
know his PostalCode.
We will need to get that additional fact (the fact that the PostalCode for Springfield, NJ is
07081.

Deletion Anomaly: What happens if we delete customer C105: Then we not only
remove the customer information but we also remove (lose) the fact that Farmingdale,
NY has postal code 11735.
Modification Anomaly: It is possible that when a town grows in population, the zip
code will be split into two (or more) new zip codes.
For example, if Old Bridge, NJ splits its zip code, then we will have to update many
different tuples even though we are only changing one fact about Old Bridges zip code.

Anomaly Example 2

Here is another example to illustrate anomalies: A company has a Purchase

Order form:

Our dutiful consultant creates the E-R Model directly matching the purchase

order:

When we follow the steps to convert to a set of relations this results in two
relations (keys are underlined):
PO_HEADER (PO_Number, PODate, Vendor, Ship_To, ...)

LINE_ITEMS (PO_Number, ItemNum, PartNum, Description, Price,

Qty)

Consider some sample data for the LINE_ITEMS relation:

PO_Number

ItemNum

PartNum

Description

Price

O101

I01

P99

Plate

$3.00

O101

I02

P98

Cup

$1.00

O101

I03

P77

Bowl

$2.00

O102

I01

P99

Plate

$3.00

O102

I02

P77

Bowl

$2.00

O103

I01

P33

Fork

$2.50

What are some of the problems with this relation ?

What happens if we want to add the fact that Order O103 has quantity 5 of
part P99 ?

2.
3.

What happens when we delete item I02 from Order O101 ?

What happens if we want to change the price of the Plate (P99)?
These problems occur because the relation in question contains data about 2 or
more themes.
Typical way to solve these anomalies is to split the relation in to two or more
relations This is part of theProcess called Normalization discussed next.
On the next page we will formally define the Normalization Process.

Normalization Process

o
o
o
o
o
o
o

1.
2.
3.
4.
5.

Relations can fall into one or more categories (or classes) called Normal Forms
Normal Form: A class of relations free from a certain set of modification
anomalies.
Normal forms are given names such as:
First normal form (1NF)
Second normal form (2NF)
Third normal form (3NF)
Boyce-Codd normal form (BCNF)
Fourth normal form (4NF)
Fifth normal form (5NF)
Domain-Key normal form (DK/NF)
These forms are cumulative. A relation in Third normal form is also in 2NF and
1NF.
The Normalization Process for a given relation consists of:
Specify the Key of the relation
Specify the functional dependencies of the relation.
Sample data (tuples) for the relation can assist with this step.
Apply the definition of each normal form (starting with 1NF).
If a relation fails to meet the definition of a normal form, change the relation (most often
by splitting the relation into two new relations) until it meets the definition.
Re-test the modified/new relations to ensure they meet the definitions of each normal
form.
In the next set of notes, each of the normal forms will be defined along with an example
of the normalization steps.

First Normal Form (1NF)

A relation is in first normal form if it meets the definition of a relation:

1.
2.

Each attribute (column) value must be a single value only.

All values for a given attribute (column ) must be of the same type.

3.
4.

Each attribute (column) name must be unique.

The order of attributes (columns) is insignificant

5.
6.

No two tuples (rows) in a relation can be identical.

The order of the tuples (rows) is insignificant.
If you have a key defined for the relation, then you can meet the unique row requirement.
Example relation in 1NF (note that key attributes are underlined):
STOCKS (Company, Symbol, Headquarters, Date, Close_Price)

Company

Symbol

Headquarters

Date

Close Price

Microsoft

MSFT

Redmond, WA

09/07/2013

23.96

Microsoft

MSFT

Redmond, WA

09/08/2013

23.93

Microsoft

MSFT

Redmond, WA

09/09/2013

24.01

Oracle

ORCL

Redwood Shores, CA

09/07/2013

24.27

Oracle

ORCL

Redwood Shores, CA

09/08/2013

24.14

Oracle

ORCL

Redwood Shores, CA

09/09/2013

24.33

Note that the key (which consists of the Symbol and the Date) can uniquely determine the
Company, headquarters and Close Price of the stock. Here was assume that Symbol must be
unique but Company, Headquarters, Date and Price are not unique

Second Normal Form (2NF)

A relation is in second normal form (2NF) if all of its non-key attributes are
dependent on all of the key.
Another way to say this: A relation is in second normal form if it is free from
partial-key dependencies
Relations that have a single attribute for a key are automatically in 2NF.

This is one reason why we often use artificial identifiers (non-composite keys) as

keys.

In the example below, Close Price is dependent on Company, Date

The following example relation is not in 2NF:
STOCKS (Company, Symbol, Headquarters, Date, Close_Price)

Company

Symbol

Headquarters

Date

Microsoft

MSFT

Redmond, WA

09/07/2013

23.96

Microsoft

MSFT

Redmond, WA

09/08/2013

23.93

Microsoft

MSFT

Redmond, WA

09/09/2013

24.01

Oracle

ORCL

Redwood Shores, CA

09/07/2013

24.27

Oracle

ORCL

Redwood Shores, CA

09/08/2013

24.14

Oracle

ORCL

Redwood Shores, CA

09/09/2013

24.33

To start the normalization process, list the functional dependencies (FD):

FD1: Symbol, Date Company, Headquarters, Close Price
FD2: Symbol Company, Headquarters

Consider that Symbol, Date Close Price.

So we might use Symbol, Date as our key.

However we also see that: Symbol Headquarters

This violates the rule for 2NF in that a part of our key key determines a non-key
attribute.
Another name for this is a Partial key dependency. Symbol is only a part of the
key and it determines a non-key attribute.
Also, consider the insertion and deletion anomalies.
One Solution: Split this up into two new relations:
COMPANY (Company, Symbol, Headquarters)

STOCK_PRICES (Symbol, Date, Close_Price)

Close Pri

At this point we have two new relations in our relational model. The original
STOCKS relation we started with is removed form the model.

Sample data and functional dependencies for the two new relations:
COMPANY Relation:
Company

Symbol

Headquarters

Microsoft

MSFT

Redmond, WA

Oracle

ORCL

Redwood Shores, CA

FD1: Symbol Company, Headquarters

STOCK_PRICES relation:
Symbol

Date

Close Price

MSFT

09/07/2013

23.96

MSFT

09/08/2013

23.93

MSFT

09/09/2013

24.01

ORCL

09/07/2013

24.27

ORCL

09/08/2013

24.14

ORCL

09/09/2013

24.33

FD1: Symbol, Date Close Price

In checking these new relations we can confirm that they meet the definition of
1NF (each one has well defined unique keys) and 2NF (no partial key dependencies).

Third Normal Form (3NF)

A relation is in third normal form (3NF) if it is in second normal form and it

contains no transitive dependencies.
Consider relation R containing attributes A, B and C. R(A, B, C)
If A B and B C then A C
Transitive Dependency: Three attributes with the above dependencies.
Example: At CUNY:

Course_Code Course_Number, Section

Course_Number, Section Classroom, Professor

Consider one of the new relations we created in the STOCKS example for 2nd
normal form:
Company

Symbol

Headquarters

Microsoft

MSFT

Redmond, WA

Oracle

ORCL

Redwood Shores, CA

The functional dependencies we can see are:

FD1: Symbol
Company
FD2: Company Headquarters
so therefore:
Symbol Headquarters

This is a transitive dependency.

What happens if we remove Oracle?
We loose information about 2 different facts.

The solution again is to split this relation up into two new relations:
STOCK_SYMBOLS(Company, Symbol)

COMPANY_HEADQUARTERS(Company, Headquarters)

This gives us the following sample data and FD for the new relations
Company

Symbol

Microsoft

MSFT

Oracle

ORCL

FD1: Symbol Company

Company
Microsoft

Headquarters
Redmond, WA

Oracle

FD1:

Redwood Shores, CA

Company

Headquarters

Again, each of these new relations should be checked to ensure they meet the
definition of 1NF, 2NF and now 3NF.

Boyce-Codd Normal Form (BCNF)

A relation is in BCNF if every determinant is a candidate key.
Recall that not all determinants are keys.
Those determinants that are keys we initially call candidate keys.
Eventually, we select a single candidate key to be the key for the relation.
Consider the following example:
Funds consist of one or more Investment Types.
Funds are managed by one or more Managers
Investment Types can have one more Managers
Managers only manage one type of investment.
Relation: FUNDS (FundID, InvestmentType, Manager)

o
o
o
o

FundID

InvestmentType

Manager

Common Stock

Smith

Municipal Bonds

Jones

Common Stock

Green

Growth Stocks

Brown

Common Stock

Smith

FD1:
FD2:
FD3:

FundID, InvestmentType Manager

FundID, Manager
InvestmentType
Manager
InvestmentType

In this case, the combination FundID and InvestmentType form a candidate

key because we can use FundID,InvestmentType to uniquely identify a tuple in the
relation.

Similarly, the combination FundID and Manager also form a candidate

key because we can use FundID, Manager to uniquely identify a tuple.
Manager by itself is not a candidate key because we cannot use Manager alone
to uniquely identify a tuple in the relation.
Is this relation FUNDS(FundID, InvestmentType, Manager) in 1NF, 2NF or 3NF ?
Given we pick FundID, InvestmentType as the Primary Key: 1NF for sure.
2NF because all of the non-key attributes (Manager) is dependant on all of the key.
3NF because there are no transitive dependencies.

o
o
o
2.

However consider what happens if we delete the tuple with FundID 22. We loose
the fact that Brown manages the InvestmentType Growth Stocks.
Therefore, while FUNDS relation is in 1NF, 2NF and 3NF, it is in BCNF because
not all determinants (Manager in FD3) are candidate keys.
The following are steps to normalize a relation into BCNF:
List all of the determinants.
See if each determinant can act as a key (candidate keys).
For any determinant that is not a candidate key, create a new relation from
the functional dependency. Retain the determinant in the original relation.
For our example:
FUNDS (FundID, InvestmentType, Manager)

o The determinants are:

o FundID, InvestmentType
o FundID, Manager
Manager
o Which determinants can act as keys ?
o FundID, InvestmentType YES
o FundID, Manager YES
Manager NO
o Create a new relation from the functional dependency:
MANAGERS(Manager, InvestmentType)
FUND_MANAGERS(FundID, Manager)
In this last step, we have retained the determinant Manager in the original relation
MANAGERS.

3. Each of the new relations sould be checked to ensure they meet the definitions of 1NF,
2NF, 3NF and BCNF

Fourth Normal Form (4NF)

A relation is in fourth normal form if it is in BCNF and it contains no multivalued

dependencies.
Multivalued Dependency: A type of functional dependency where the

determinant can determine more than one value.

More formally, there are 3 criteria:

1.
There must be at least 3 attributes in the relation. call them A, B, and C,
for example.
2.
Given A, one can determine multiple values of B.
Given A, one can determine multiple values of C.
3.

B and C are independent of one another.

Book example:
Student has one or more majors.
Student participates in one or more activities.
StudentID

Major

Activities

100

CIS

Baseball

100

CIS

Volleyball

100

Accounting

Baseball

100

Accounting

Volleyball

200

Marketing

Swimming

FD1: StudentID Major

FD2: StudentID Activities
Portfolio ID

Stock Fund

Bond Fund

1.
2.
3.
4.

999

Janus Fund

Municipal Bonds

999

Janus Fund

Dreyfus Short-Intermediate Municipal Bond Fund

999

Scudder Global Fund

Municipal Bonds

999

Scudder Global Fund

Dreyfus Short-Intermediate Municipal Bond Fund

888

Kaufmann Fund

T. Rowe Price Emerging Markets Bond Fund

A few characteristics:
No regular functional dependencies
All three attributes taken together form the key.
Latter two attributes are independent of one another.
Insertion anomaly: Cannot add a stock fund without adding a bond fund
(NULL Value). Must always maintain the combinations to preserve the meaning.
Stock Fund and Bond Fund form a multivalued dependency on Portfolio ID.
PortfolioID

Stock Fund
PortfolioID

Bond Fund

Resolution: Split into two tables with the common key:

Portfolio ID

Stock Fund

999

Janus Fund

999

Scudder Global Fund

888

Kaufmann Fund

Portfolio ID

Bond Fund

999

Municipal Bonds

999

Dreyfus Short-Intermediate Municipal Bond Fund

888

T. Rowe Price Emerging Markets Bond Fund

Fifth Normal Form (5NF)

Also called Projection Join Normal form.

There are certain conditions under which after decomposing a relation, it cannot
be reassembled back into its original form.
We dont consider these issues here.

CrossRef Help Site
No ratings yet
CrossRef Help Site
168 pages
Institutional Repositories
No ratings yet
Institutional Repositories
51 pages
CrossRefSchemaDocumentation4 1 0
No ratings yet
CrossRefSchemaDocumentation4 1 0
220 pages
Power BI Vs Excel: Which One Is Better - JBK Academy
No ratings yet
Power BI Vs Excel: Which One Is Better - JBK Academy
9 pages
Term Project - Week 8 - (17%) :: 5 No Fields Beyond Those in The Report / Spreadsheet Are Needed
No ratings yet
Term Project - Week 8 - (17%) :: 5 No Fields Beyond Those in The Report / Spreadsheet Are Needed
12 pages
Student Data
No ratings yet
Student Data
28 pages
Digital Object Identifier: Uses. The DOI Is A Persistent Identifier of Intellec
No ratings yet
Digital Object Identifier: Uses. The DOI Is A Persistent Identifier of Intellec
3 pages
Student Tracking System: Pagadian City Science High School
No ratings yet
Student Tracking System: Pagadian City Science High School
1 page
Learning Contract Intern
No ratings yet
Learning Contract Intern
3 pages
Student Tracking System
No ratings yet
Student Tracking System
2 pages
Aspire Advising Guide
No ratings yet
Aspire Advising Guide
102 pages
Lesson 6
No ratings yet
Lesson 6
34 pages
Summarising and Analysing Data
No ratings yet
Summarising and Analysing Data
10 pages
Import Export OJS
No ratings yet
Import Export OJS
9 pages
Exp19 Excel Ch03 CapAssessment Movies Instructions
No ratings yet
Exp19 Excel Ch03 CapAssessment Movies Instructions
4 pages
BTM 200 Mini Case
No ratings yet
BTM 200 Mini Case
10 pages
Class Five
No ratings yet
Class Five
85 pages
DOI and URL Flowchart: APA Publication Manual (See Also Pp. 188-192)
No ratings yet
DOI and URL Flowchart: APA Publication Manual (See Also Pp. 188-192)
1 page
Internship Handbook
No ratings yet
Internship Handbook
24 pages
DOI Handbook - Applications
No ratings yet
DOI Handbook - Applications
6 pages
Rethinking The Institutional Repository
No ratings yet
Rethinking The Institutional Repository
14 pages
Management Operations Management Internship Learning Objectives
No ratings yet
Management Operations Management Internship Learning Objectives
2 pages
Learner 1: Student Self Monitoring Log
No ratings yet
Learner 1: Student Self Monitoring Log
3 pages
Database Development Workbook
No ratings yet
Database Development Workbook
39 pages
Career Horoscope PDF
No ratings yet
Career Horoscope PDF
2 pages
College Course Manager1
No ratings yet
College Course Manager1
6 pages
Excel Xapplication Capstone Exercise001
0% (1)
Excel Xapplication Capstone Exercise001
2 pages
Excel 2G Inventory Instructions
No ratings yet
Excel 2G Inventory Instructions
4 pages
Microsoft Excel 2019 Vba and Macros PDF
No ratings yet
Microsoft Excel 2019 Vba and Macros PDF
2 pages
DBMS Normalization
No ratings yet
DBMS Normalization
15 pages
Module 3
No ratings yet
Module 3
55 pages
Excel 1 Lab Exercises PDF
No ratings yet
Excel 1 Lab Exercises PDF
8 pages
Workshop 03 - S1 - 2020 - Solutions For Business Statistics
No ratings yet
Workshop 03 - S1 - 2020 - Solutions For Business Statistics
13 pages
Stats Formulas
No ratings yet
Stats Formulas
54 pages
Jing PDF Tutorial Template
No ratings yet
Jing PDF Tutorial Template
9 pages
MS Access Tutorial
No ratings yet
MS Access Tutorial
147 pages
Performance Tracker '22
No ratings yet
Performance Tracker '22
34 pages
Module 2 Assignment
0% (1)
Module 2 Assignment
6 pages
MySQL Complete Guide
100% (4)
MySQL Complete Guide
199 pages
Wywla Internship Workbook
No ratings yet
Wywla Internship Workbook
20 pages
Using Microsoft Excel For Data Processing: Practical Work
No ratings yet
Using Microsoft Excel For Data Processing: Practical Work
14 pages
EX2013 Capstone Level3 Instructions
0% (2)
EX2013 Capstone Level3 Instructions
5 pages
Planning Digital Libraries
No ratings yet
Planning Digital Libraries
14 pages
Green Bridge Excel Data Analytics Training Courseware
No ratings yet
Green Bridge Excel Data Analytics Training Courseware
227 pages
Student Progress Tracker - Class - , - Year Level - , - Sem # - , - School Year
No ratings yet
Student Progress Tracker - Class - , - Year Level - , - Sem # - , - School Year
13 pages
OCP - SQL&PL - SQL (Vol2)
No ratings yet
OCP - SQL&PL - SQL (Vol2)
348 pages
Final Excel Assignment
100% (1)
Final Excel Assignment
4 pages
Create An Excel UserForm
No ratings yet
Create An Excel UserForm
11 pages
DBMS Lab Manual
No ratings yet
DBMS Lab Manual
51 pages
Internship Proposal Learning Agreement
No ratings yet
Internship Proposal Learning Agreement
4 pages
SQL3
No ratings yet
SQL3
6 pages
Normalization and Its Types
100% (2)
Normalization and Its Types
12 pages
Introduction To Data Tables and Data Table Exercises: Tools For Excel Modelling
No ratings yet
Introduction To Data Tables and Data Table Exercises: Tools For Excel Modelling
25 pages
Ejercicio Base Datos Clientes
No ratings yet
Ejercicio Base Datos Clientes
7 pages
Chapter 5 - Database Management System - Pure Lecture
No ratings yet
Chapter 5 - Database Management System - Pure Lecture
26 pages
Lab Assignment 2 Ms Excel 2 Instructions
No ratings yet
Lab Assignment 2 Ms Excel 2 Instructions
2 pages
SQL Lab Manual
50% (2)
SQL Lab Manual
53 pages
Excel On Steroids Tips and Tricks Vol1
No ratings yet
Excel On Steroids Tips and Tricks Vol1
31 pages
Types of Relationships - Chapter 10. Table Relationships - Part II - The Design Process - Database Design For Mere Mortals - SQL - ETutorials
No ratings yet
Types of Relationships - Chapter 10. Table Relationships - Part II - The Design Process - Database Design For Mere Mortals - SQL - ETutorials
18 pages
Peer Graded Assignment Data Analytics
No ratings yet
Peer Graded Assignment Data Analytics
7 pages
Overview of Forms, Form Controls, and Activex Controls On A Worksheet
No ratings yet
Overview of Forms, Form Controls, and Activex Controls On A Worksheet
8 pages
Excel Intermediate-Advanced Practice Activities
No ratings yet
Excel Intermediate-Advanced Practice Activities
4 pages
DBMS Unit1
No ratings yet
DBMS Unit1
47 pages
Index: Online Banking
No ratings yet
Index: Online Banking
59 pages
ER Diagram
No ratings yet
ER Diagram
86 pages
Dbms Unit 1-1
No ratings yet
Dbms Unit 1-1
87 pages
Chapter 8
No ratings yet
Chapter 8
26 pages
DMS Assignment
No ratings yet
DMS Assignment
17 pages
Fall 2019 Excel Project #1 - Instructions
No ratings yet
Fall 2019 Excel Project #1 - Instructions
7 pages
TSQL2012
No ratings yet
TSQL2012
178 pages
Student Internship Evaluation Form
No ratings yet
Student Internship Evaluation Form
4 pages
Uml
No ratings yet
Uml
10 pages
From Enterprise Models To Dimensional Models - A Methodology For Data Warehouse and Data Mart Design
No ratings yet
From Enterprise Models To Dimensional Models - A Methodology For Data Warehouse and Data Mart Design
12 pages
Excel User Forms Tips 2
No ratings yet
Excel User Forms Tips 2
8 pages
DB 03
No ratings yet
DB 03
24 pages
DBMSNotes
No ratings yet
DBMSNotes
17 pages
New Microsoft Word Document4
No ratings yet
New Microsoft Word Document4
1 page
Chapter 3 Relational Database - Logical Design
No ratings yet
Chapter 3 Relational Database - Logical Design
44 pages
Database Constraints: What Are Keys?
No ratings yet
Database Constraints: What Are Keys?
8 pages
Test 02 - Attempt Review
No ratings yet
Test 02 - Attempt Review
8 pages
Can A Country Print Money and Get Rich
No ratings yet
Can A Country Print Money and Get Rich
1 page
ECSE - CBP Final
No ratings yet
ECSE - CBP Final
17 pages
SQL Queries
No ratings yet
SQL Queries
4 pages
Chapter 2. Service Contracts
No ratings yet
Chapter 2. Service Contracts
12 pages
Five
No ratings yet
Five
1 page
Five
No ratings yet
Five
1 page
Database Normalization 7
No ratings yet
Database Normalization 7
12 pages
Databases December 2015 Sample Examination Paper: Answer ALL Questions. Clearly Cross Out Surplus Answers
No ratings yet
Databases December 2015 Sample Examination Paper: Answer ALL Questions. Clearly Cross Out Surplus Answers
6 pages
Functional Dependency
No ratings yet
Functional Dependency
9 pages
Practical Assignment 1
No ratings yet
Practical Assignment 1
5 pages
Practical 1
No ratings yet
Practical 1
10 pages
hw4 Answerkey
No ratings yet
hw4 Answerkey
6 pages
Problem Statement
No ratings yet
Problem Statement
6 pages
Database Normalization Explained in Simple English
No ratings yet
Database Normalization Explained in Simple English
5 pages
Eliminating Redundant Storing Related Information: 1. First Normal Form (1NF)
No ratings yet
Eliminating Redundant Storing Related Information: 1. First Normal Form (1NF)
3 pages
BEAM Reference Card
No ratings yet
BEAM Reference Card
2 pages
New Microsoft Word Document5
No ratings yet
New Microsoft Word Document5
1 page
Choosing Between Floating and Fixed Rates:: Example
No ratings yet
Choosing Between Floating and Fixed Rates:: Example
1 page
New Microsoft Word Document2
No ratings yet
New Microsoft Word Document2
1 page
Activity - 5 Updated
No ratings yet
Activity - 5 Updated
26 pages
Excel Proficiency Test
No ratings yet
Excel Proficiency Test
3 pages

Database Normalization 10

Uploaded by

Database Normalization 10

Uploaded by

DATABASE NORMALIZATION

What Youll Learn

Rob/Coronel Elmasri/Navathe Kroenke Hoffer, Pre

The Relational Model

Gather user/business requirements.

Implement the database by creating a table for each normalized relation in a

Normalization is a process in which we systematically examine relations

A Functional Dependency describes a relationship between attributes within a

Keys and Uniqueness

A key functionally determines a tuple (row). So one functional dependency that

in other information being lost.

Here is an example to illustrate these anomalies: Consider a very common

123 First St.

446 First Ave.

Insertion Anomaly: What happens if we go to add a new Customer: C106, Joe

Here is another example to illustrate anomalies: A company has a Purchase

LINE_ITEMS (PO_Number, ItemNum, PartNum, Description, Price,

Consider some sample data for the LINE_ITEMS relation:

What are some of the problems with this relation ?

What happens when we delete item I02 from Order O101 ?

First Normal Form (1NF)

A relation is in first normal form if it meets the definition of a relation:

Each attribute (column) value must be a single value only.

Each attribute (column) name must be unique.

No two tuples (rows) in a relation can be identical.

Second Normal Form (2NF)

In the example below, Close Price is dependent on Company, Date

To start the normalization process, list the functional dependencies (FD):

Consider that Symbol, Date Close Price.

However we also see that: Symbol Headquarters

STOCK_PRICES (Symbol, Date, Close_Price)

FD1: Symbol Company, Headquarters

FD1: Symbol, Date Close Price

Third Normal Form (3NF)

A relation is in third normal form (3NF) if it is in second normal form and it

Course_Code Course_Number, Section

The functional dependencies we can see are:

This is a transitive dependency.

FD1: Symbol Company

Boyce-Codd Normal Form (BCNF)

FundID, InvestmentType Manager

In this case, the combination FundID and InvestmentType form a candidate

Similarly, the combination FundID and Manager also form a candidate

o The determinants are:

Fourth Normal Form (4NF)

A relation is in fourth normal form if it is in BCNF and it contains no multivalued

determinant can determine more than one value.

More formally, there are 3 criteria:

B and C are independent of one another.

FD1: StudentID Major

Dreyfus Short-Intermediate Municipal Bond Fund

Scudder Global Fund

Scudder Global Fund

Dreyfus Short-Intermediate Municipal Bond Fund

T. Rowe Price Emerging Markets Bond Fund

Resolution: Split into two tables with the common key:

Scudder Global Fund

Dreyfus Short-Intermediate Municipal Bond Fund

T. Rowe Price Emerging Markets Bond Fund

Fifth Normal Form (5NF)

Also called Projection Join Normal form.

You might also like