Normalization Lecture
Normalization Lecture
What is Normalization
Database normalization is the
process of removing redundant data
from the tables to improve storage
efficiency, data integrity, and
scalability.
Normalization allows us to organize
data so that it:
• Allows faster access (dependencies
make sense)
• Reduced space (less redundancy)
Data redundancy and update
anomalies
Tables that contain redundant
information may potentially suffer from
update anomalies.
b1
b2 a1
b3
Example: functional
dependence
All sales representatives in a given
pay class have the same commission
rate.
In this table:
• CustomerID and ProdID depend on the
OrderID and no other column (good)
• Stated another way, “If you know the OrderID,
you know the CustID and the ProdID”
So: OrderID CustID, ProdID
OrderID CustID ProdID Price Quantity Total
1 1001 AB-111 50 1,000 50,000
2 1002 AB-111 60 500 30,000
3 1001 ZA-245 35 100 3,500
4 1003 MB-153 82 25 2,050
5 1004 ZA-245 42 10 420
6 1002 ZA-245 40 50 2,000
7 1001 AB-111 75 100 7,500