Normalization
Normalization
Normalization
Student’s name
Institution
NORMALIZATION 2
Normalization can be defined as a unique technique that organizes data in a given database. It
tend to decompose tables hence eliminating data redundancy and some other undesirable
characteristics. Mostly these undesirable characteristics include: Update, Deletion and Insertion.
form, and this is done by eliminating duplicated data from a relation table (Chapple, M., 2011)
It ensures that data is logically kept, i.e. all data dependencies are sensible.
It is difficult to manipulate and update the database without data loss when normalization is
absent. Anomalies like Insertion and Deletion tend to be more frequent if the database is not
normalized (Codd, E. F., 1982). To better understand the concept of normalization, let us
Updation characteristic: In order to update the address of a teacher who has occur more than
once in the table, it will force us to update the address column in all the other rows, if not, the
Insertion characteristic: In the event that a new admission has to be made, we have a teacher
REG. NO., name and address of a teacher, but if the teacher is not willing to teach any subjects
yet then we will have to insert or put a NULL, this leads to an insertion Anamoly (Lee, H.,
1995).
Deletion Anamoly: Say the teacher REG. NO. 19 has been given only one subject and he drops
it temporarily, deleting that row will make the entire teacher record to be deleted along with it.
NORMALIZATION FORMS.
According to the rule of first normal form, there must be no two rows that contains a repetitive
collection of data, that is, every column set must have its own unique value, in that, multiple
columns cannot be used to search the same row. This means that each table should be organized
into distinctive rows, and every row must have a primary key that classifies it as unique
Usually, the primary key is a single column. However, a combination of more than one column
can be made to establish a single primary key. For instance, let’s consider a table which is not in
1NF:
KEVIN 34 PHYSICS
JORNARA 37 PHYSICS
As the first normal rule states, a row must not contain a column that has more than one item
saved, say separated by commas. Instead of doing so, it must separate such kind of data into
CASSIDY 35 CHEMISTRY
CASSIDY 35 PHYSICS
KEVIN 34 PHYSICS
JORNARA 37 PHYSICS
The use of 1NF increases data redundancy since there will be more columns with similar
According to the second Normal rule, there should be no partial dependency of a column on a
primary key. This means that for a given table which has a concatenated primary key, then each
and every column in the table which is not part of the primary key must depend entirely on the
However, if a column depends on one part only of the concatenated key, then the table is said to
In the above First Normal Form, there are a total of two rows for CASSIDY so as to incorporate
the two subjects he has opted. This is searchable and follows 1NF though an inefficient use of
space. Also, in the above example, the candidate key is (TEACHER, SUBJECT), but age of
teacher solely depends on the teacher column and which is inaccurate according to 2NF. Second
Normal Form is achieved by splitting the subjects into independent components, and use the
teacher names as foreign keys (Kung, H. J., & Hui-Lien, T., 2006)
TEACHER AGE
CASSIDY 35
KEVIN 34
JORNARA 37
In the above teacher table, the candidate key is the teacher column since all other columns
TEACHER SUBJECT
CASSIDY CHEMISTRY
CASSIDY PHYSICS
KEVIN PHYSICS
JORNARA PHYSICS
(Teacher, Subject) is the candidate key in the new subject table. As clearly shown, both the
tables above qualifies to be Second Normal Form thus they will never suffer from an update
Anomalies.
The 3NF states that every attribute of a table which is non-prime must be dependent on the
primary key. In simple terms, it means that the transitive functional dependency of a table should
be removed while the table remains in Second Normal form (Codd, E., 1982).
In the table above, the primary key is the TEACHER REG. NO. Street, city and state are all
dependent on the Zip. This dependency between zip and the other fields is what is called
NORMALIZATION 7
transitive dependency. To make it in 3NF, we have to move the other fields’ i.e. street, city and
state to another new table where zip will be the primary key.
NO.
Removing transitive dependency has the advantage of reducing duplicated data and improving
on data integrity
NORMALIZATION 8
References.
1. Bahmani, A. H., Naghibzadeh, M., & Bahmani, B. (2008, May). Automatic database
2. Biskup, J., Dayal, U., & Bernstein, P. A. (1979, May). Synthesizing independent database
5. Diederich, J., & Milton, J. (1988). New methods and fast algorithms for database
normalization).
NORMALIZATION 9