Database Normalization
Database Normalization
Comprehensive Report
on Database Normalization
Prepared By:
Osan Salah Jamil
Zhwan Othman Majid
Supervised By:
T. Payman rahem
(2024 – 2025)
Table of Contents
Table of Contents ................................................................................................ 1
Introduction ......................................................................................................... 3
1. Atomicity:.................................................................................................. 4
1. Insertion Anomaly:.................................................................................... 4
1. Banking Systems...................................................................................... 6
2. E-Commerce ............................................................................................ 6
3. Healthcare ................................................................................................ 6
Advantages....................................................................................................... 7
1
• Improved Query Performance .................................................................. 7
Challenges ....................................................................................................... 7
• Complexity ............................................................................................... 7
• Performance Trade-offs............................................................................ 7
Conclusion .......................................................................................................... 8
References .......................................................................................................... 9
2
Introduction
Database normalization is a fundamental concept in relational database design,
introduced by Edgar Codd in the 1970s. It aims to reduce data redundancy,
prevent data anomalies, and ensure the consistency and integrity of the data.
3
Key Principles of Normalization
Normalization works based on a few key principles:
4
Repeating Groups and Atomicity
In First Normal Form (1NF), one of the main goals is to eliminate repeating
groups. A repeating group occurs when multiple values for a single attribute
(column) are stored in one record. This violates the principle of atomicity, which
dictates that each column should hold a single, indivisible value.
Example: Consider a table where each student can have multiple subjects.
Storing all subjects in one cell (e.g., "Math, Science, English") violates 1NF.
Instead, we split this into separate rows for each subject, maintaining atomicity
and ensuring the data is properly structured.
Normalization Forms
First Normal Form (1NF)
• Ensures that every field contains atomic data (no multiple values or
repeating groups).
o Example: A table containing multiple subjects per student should be
split into separate rows for each subject.
5
Functional Dependency
Functional dependency refers to a relationship where one attribute (or set of
attributes) uniquely determines another. For example, in a table where each
employee has a unique ID, the employee’s ID determines their name, position,
and salary.
• In 2NF, we ensure that non-key attributes depend on the entire primary key.
If a non-key attribute depends only on part of a composite key, it violates
2NF.
• In 3NF, we remove transitive dependencies, where non-key attributes
depend on other non-key attributes.
Real-World Applications
Normalization is crucial in various industries:
6
Advantages and Challenges
Advantages
• Storage Efficiency: Reduces unnecessary data storage by eliminating
redundancy.
• Improved Query Performance: Simplifies data retrieval, especially in large
databases.
• Data Integrity: Prevents errors caused by inconsistent data.
Challenges
• Complexity: Over-normalization can create a highly fragmented database
structure.
• Performance Trade-offs: Denormalization might be needed for query
performance at scale, which can reintroduce redundancy.
7
Conclusion
Database normalization is essential for organizing relational databases by
minimizing redundancy, preventing anomalies, and ensuring data integrity. By
applying normalization forms such as 1NF, 2NF, 3NF, and BCNF, we optimize the
database structure, which is crucial for the smooth functioning of systems in
sectors like banking, e-commerce, and healthcare.
As database systems evolve, newer models like NoSQL and NewSQL challenge
traditional normalization practices, but understanding these techniques remains
key for database administrators and developers. Ultimately, normalization ensures
efficient, consistent, and reliable data storage, making it a cornerstone of modern
database design.
8
References
1. Database System Concepts by Abraham Silberschatz, Henry Korth, and S.
Sudarshan. This book provides comprehensive coverage of relational
database theory and includes extensive examples on normalization.