0% found this document useful (0 votes)

35 views

CO3-Notes-Database Design and Normalization

The document discusses the process of database design, which involves 6 main steps: 1) Requirements analysis to understand user needs 2) Conceptual design using entity-relationship modeling 3) Logical design translating the conceptual model into database tables and columns 4) Schema refinement through normalization to reduce data redundancy and inconsistencies 5) Physical design implementing the logical design in a database system 6) Application and security design to optimize performance and protect the database

Uploaded by

Nani Yagneshwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

CO3-Notes-Database Design and Normalization

Uploaded by

Nani Yagneshwar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

CO-3

Database Design
Design is the process of creating and planning the construction of a system, or environment. It
involves identifying the needs and constraints of the project, developing a concept or idea, and
refining it through iteration until a final design is achieved.

In the Context of Database Design, it involves creating a structured and organized approach to storing
and managing data. The goal of database design is to create a database that is efficient, effective, and
easy to use. Database systems are designed to manage large amounts of information that are typically
related to the operations of an organization or enterprise. The information stored in a database is often
used to support the activities of the organization, whether it is for internal operations or to provide
services to customers or clients.

Good Database Design helps organizations avoid the problems and achieve the benefits through
efficient data retrieval and manipulation, accurate and secure data, and easy maintenance and updates.
Overall, Good Database Design is essential for organizations that want to effectively manage their
data and avoid the consequences of a bad design.

The following six steps has to be followed during its design process

1. Requirements Analysis: In requirement analysis for database design, the main goal is to
understand the needs and expectations of the stakeholders for the database system to be developed.
The following are the steps involved in requirement analysis for database design:

• Identify stakeholders: The first step is to identify the stakeholders who will be using the
database system.
• Gather requirements: Once the stakeholders are identified, the next step is to conduct
interviews or surveys to gather information about their requirements and expectations for the
system.
• Define requirements: The information gathered from the stakeholders can then be used to
create a list of requirements for the database system.

After gathering the requirements these requirements are organized and represented using appropriate
tools and are given as input to the conceptual database design phase.

2. Conceptual Database Design: Specifications are converted into ER-Model or Any other similarly
high-level conceptual database design model. The conceptual database design is the first stage of
database design. ER-Model provides a simple description of the data. It is a high-level view of the
entire database that describes what the database should contain and how the data should be related to
each other. This design is usually presented in an Entity-Relationship (ER) diagram. The main focus
is on the overall structure and relationships between entities. Once the requirement specifications are
converted into ER-Model it is given as input to the logical database design phase.

3. Logical Database Design Schema: The logical database design is an important stage of database
design. It focuses on converting the conceptual design into a detailed logical model that can be
implemented in a database management system (DBMS). The main focus is on defining the data
elements, their relationships, and the data constraints. This design is usually presented in the form of
tables, columns, and relationships.

Logical database design means that ER diagrams are now converted into actual relational database
schemas, and these relational database schemas are given as input to the schema refinement phase.

4. Schema Refinement: Database designed based on the E-R model may have some amount of
• Inconsistency
• Uncertainty
• Redundancy

Refinement process is called Normalization. Defined as a step-by-step process of decomposing a

complex relation into a simple relation. The formal process that can be followed to achieve a good
database design. It is also used to check that an existing design is of good quality. The different stages
of normalization are known as “normal forms”. Guidelines that may be used as measures to determine
the quality of relation schema design:

• Guideline:1: Making sure that the semantics of the attributes is clear in the schema
• Guideline:2: Reducing the redundant information in tuples
• Guideline:3: Reducing the NULL values in tuples
• Guideline:4: Disallowing the possibility of generating spurious tuple

Guideline:1 Making sure that the semantics of the attributes is clear in the relations
The semantics of a relation refers to its meaning resulting from the interpretation of attribute values in
a tuple. Design a relation schema so that it is easy to explain its meaning. Do not combine attributes
from multiple entity types and relationship types into a single relation. Attributes of different entities
(EMPLOYEEs, DEPARTMENTs, PROJECTs) should not be mixed in the same relation. Only
foreign keys should be used to refer to other entities.
Guideline:2 Redundant Information in Tuples and Update Anomalies

• Wastes storage: Grouping attributes into relation schemas has a significant effect on storage
space.
• Problems with update anomalies: Storing natural joins of base relations leads to an additional
problem referred to as update anomalies. These can be classified into insertion anomalies,
deletion anomalies, and modification anomalies.
• Insertion anomalies: To insert a new employee tuple into EMP_DEPT, we must include either
the attribute values for the department that the employee works for, or NULLs (if the
employee does not work for a department as yet). For example, to insert a new tuple for an
employee who works in department number 5, we must enter all the attribute values of
department 5 correctly so that they are consistent with the corresponding values for
department 5 in other tuples in EMP_DEPT. It is difficult to insert a new department that has
no employees as yet in the EMP_DEPT relation.
• Deletion anomalies: If we delete from EMP_DEPT an employee tuple that happens to
represent the last employee working for a particular department, the information concerning
that department is lost inadvertently from the database. This problem does not occur in the
database because DEPARTMENT tuples are stored separately.
• Modification anomalies: In EMP_DEPT, if we change the value of one of the attributes of a
particular department—say, the manager of department 5—we must update the tuples of all
employees who work in that department; otherwise, the database will become inconsistent

Design the base relation schemas so that no insertion, deletion, or modification anomalies are present
in the relations. If any anomalies are present, note them clearly and make sure that the programs that
update the database will operate correctly.

Guideline:3 Reducing the NULL values in tuples:

As far as possible, avoid placing attributes in a base relation whose values may frequently be NULL.
If NULLs are unavoidable, make sure that they apply in exceptional cases only and do not apply to a
majority of tuples in the relation.

Reasons for nulls:

• The attribute does not apply to this tuple. For example, Visa_status may not apply to U.S.
students.
• The attribute value for this tuple is unknown. For example, the Date_of_birth may be
unknown for an employee.

Guideline:4 Disallowing the possibility of generating spurious tuple:

• Bad designs for a relational database may result in erroneous results for certain JOIN
operations.
• The "lossless join" property is used to guarantee meaningful results for join operations.

Design relation schemas so that they can be joined with equality conditions on attributes that are
appropriately related (primary key, foreign key) pairs in a way that guarantees that no spurious tuples
are generated. Avoid relations that contain matching attributes that are not (foreign key, primary key)
combinations because joining on such attributes may produce spurious tuples.

5. Physical Database Design: The physical database design is the third and final stage of database
design. It focuses on implementing the logical design in a specific database management system by
defining the physical database schema. This includes defining the storage structures, access methods,
indexes, and other physical parameters. The main focus is on how the database will be physically
implemented on a specific platform.

6. Application and Security Design: Application and security design for database management
systems (DBMS), there are several key considerations that must be taken into account. Here are some
important points to keep in mind:

• Authentication and authorization: Implement robust authentication and authorization

mechanisms to ensure that only authorized users can access the database. This can include
password policies, two-factor authentication, and access control lists.
• Logging and monitoring: Implement logging and monitoring to detect unusual activity or
unauthorized access attempts.
• Auditing: Perform regular audits of the database to ensure compliance with security policies
and regulations.
• Performance optimization: When designing an application that uses a DBMS, it's important to
optimize performance to ensure that the application runs smoothly and efficiently. This may
involve using indexing and other performance tuning techniques to speed up database queries.
• Backup and recovery: Establish regular backup and recovery procedures to ensure that data is
not lost in case of a system failure or data breach.

Functional Dependencies
A Functional Dependency is a relationship between or among attributes of a relation. For example,
if we know the value of Customer Account no then we can find the value of Customer Balance, if
this is true then we can say that Customer balance is functional dependent on Customer Account
no.

AccountNo → Balance

As another example:

ISBN → Title

Let X and Y are two attributes of a relation and given the value of X, if there is only one value of Y
corresponding to it then Y is said to be functionally dependent on X and this is indicated by the
notation:

X →Y

It means:

➢ Y is functionally dependent on X.
➢ X determines Y
➢ X is called determinant or attributes in the left side of the arrow are called determinants.

A B C D
a1 b1 c1 d1
a1 b2 c1 d2
a2 b2 c2 d2
a2 b3 c2 d3
a3 b4 c2 d4

A → C is satisfied but C→ A is not satisfied.

Consider the relation schema:

Emp_Proj (EmpId,Pnumber,Hours,Ename,Pname,Plocation)

1) EmpId → Ename (since value of an employee Id uniquely determines the employee

name.)

2) Pnumber → { Pname,Location}
3) {EmpId,Pnumber} → Hours

Functional Dependencies may also be based on composite attributes for example:

X, Z→Y

It means that there is only one value of Y corresponding to the given values of X, Z.

Armstrong’s Axioms or Inference Axioms

Or
Inference rules for Functional Dependencies:

Suppose we have F, a set of functional dependencies. To determine whether a FD X→ Y is

logically implied by F, we use a set of rules or axioms. Let R is a relation and W, X, Y , Z are
attributes or subsets of attributes in R:

1) Reflexivity or Reflexive Rule: If Y⊆X, then X →Y. This axiom says indicates that a
given set of attributes the set itself determines any of its subsets.

2) Augmentation Rule: If X →Y then XZ →YZ. We can augment the left side of the FD or
both sides conveniently with one or more attributes but the axiom does not allow
augmenting the right side alone.

3) Transitivity Rule: If X →Y and Y →Z then X →Z.

4) Union Rule: If X →Y and X →Z then X →YZ.

5) Decomposition Rule: X →YZ then X →Y and X →Z.

6) Pseudo Transitivity Rule: If X →Y and YZ →W then XZ→W.

Normalization
The basic objective of Normalization is to reduce redundancy, which means information is to be
stored only once. Storing information several times leads to the insertion, update and deletion
anomalies, wastage of storage space and increase in the total size of the data stored

Normalization of data can be considered a process of analyzing the given relation schemas based on
their FDs and primary keys to achieve the desirable properties of:

(1) Minimizing redundancy and Minimizing the insertion, deletion, and update anomalies

Why Relations are Normalized?

Student_Course Relation:

StudentNo StudentName Address CourseNo CourseName Instructor

85001 Mukul Sec-G CP302 Database Mishra
85001 Mukul Sec-G CP303 Communication Tripathi
85001 Mukul Sec-G CP304 Software Engg Khan
85005 Vipul Sec-A CP302 Database Mishra

Primary Key—(StudentNo,Courseno)

There are following undesirable features or anomalies:

1) Repetition of Information: A lot of information is being repeated. StudentNo, name,

address etc are being repeated often.

2) Insertion Anomalies: It is the inability to represent certain information. Since primary

key is (StudentNo, CourseNo). Any new tuple to be inserted in the relation must have a
value for the primary key since a key may have not null value So we cannot insert the No
and name of a new course in the database until a student enrolls in the course. Similarly
information about a new student cannot be inserted in the database until the student
enrolls in the course.

3) Updation Anomalies: If we want to change the value of one or more attributes of a

particular course in the relation, for example, the Instructor for course no CP302, we
must update all the tuples containing CP302 enrollment. If this modification is not carried
out properly, the database will become inconsistent.

4) Deletion Anomalies: It is a loss of useful information means useful information may be

lost when a tuple is deleted. For example, if we delete the tuple corresponding to student
85001 doing course CP304, we will lose the relevant information about the course
CP304. Similarly, deletion of course CP302 from the database may remove all
information about the student named Vipul.
The above problems arise because the relation StudentCourse has information about students as
well as Course. One solution is to deal with the problems is to decompose the relation into two or
more smaller relations.

Student(StudentNo,studentname,Address)
Course(CourseNo,CourseName,Instructor)

StudentCourse (StudentNo,CourseNo)

Such decomposition is called Normalization and is essential if we wish to overcome undesirable

anomalies.

Normal Forms
A number of Normal forms have been defined for classifying relations. Each Normal form has
associated with it a number of constraints on the kind of FDs that could be associated with the
relation.

The Normal Forms are used to ensure that various types of anomalies and inconsistencies are not
introduced into the database or we can say that a relation is said to be in a normal form if it satisfies
a certain prescribed set of conditions.

There are several stages of Normalization process. These are called the First Normal
Form(1NF),Second Normal Form(2NF),Third Normal Form(3NF),Boyce-Codd Normal
Form(BCNF),Forth Normal form etc.

First Normal Form:

It was defined to disallow multivalued attributes, composite attributes, and their combinations. It
states that the domain of an attribute must include only atomic (simple, indivisible) values and that
the value of any attribute in a tuple must be a single value from the domain of that attribute. Hence,
1NF disallows having a set of values, a tuple of values, or a combination of both as an attribute
value for a single tuple.

A Relation that is not in 1NF

1NF version of the same relation with redundancy.

Student Subject Information

Ashish Code Instructor
CS1 Prof A
CS2 Prof B
CS3 Prof C
Mukesh Code Instructor
CS1 Prof A
CS4 Prof D

A Relation that is not in 1NF

Student Code Lecturer

Ashish CS1 Prof A
Ashish CS2 Prof B
Ashish CS3 Prof C
Mukesh CS1 Prof D
Mukesh CS4 Prof E

1NF version of the same relation with redundancy.

Prime Attributes and Non-Prime Attributes:

An attribute of relation schema R is called a prime attribute of R if it is a member of some

candidate key of R. An attribute is called nonprime if it is not a prime attribute—that is, if it is not
a member of any candidate key.

R(ABCDEFH) AH is only candidate key of R then the attributes A and H are prime attributes and
B,C,D,E,F are non-prime attributes.

Full Functional Dependency and Partial Dependency:

A functional dependency X → Y is a full functional dependency if removal of any attribute A from
X means that the dependency does not hold anymore; that is, for any attribute A ε X, (X −
{A}) does not functionally determine Y.

A functional dependency X → Y is a partial dependency if some attribute A ε X can be removed

from X and the dependency still holds; that is, for some A ε X, (X − {A}) → Y.

EMP_PROJ

EmpId Pnumber Hours Ename Pname Plocation

{EmpId, Pnumber} → Hours is a full dependency (neither EmpId → Hours nor Pnumber → Hours
holds).
However, the dependency {EmpId, Pnumber} → Ename is partial because EmpId→ Ename holds.

Transitive Dependency

A functional dependency X → Y in a relation schema R is a transitive dependency if there exists a

set of attributes Z in R that is neither a candidate key nor a subset of any key of R and both X → Z
and Z → Y hold.

R(A,B,C,D,E) and given set of FDs F={ AB →C,B→D,C→E) and AB is the candidate key.Since

AB → C and C→ E therefore AB→ E

E is transitively dependent on the key

Second Normal Form:

A relation schema R is in 2NF if it is in 1NF and if every nonprime attribute A in R is fully
functionally dependent on the key of Relation R ore we can say that a 2NF does not permit partial
dependency between a non prime attribute and the key of the relation.Consider the relation
EMP_PROJ:

EMP_PROJ

EmpId Pnumber Hours Ename Pname Plocation

FDs are

1) EmpId ,Pnumber → Hours

2) EmpId → Ename
3) Pnumber → Pname,Plocation

The test for 2NF involves testing for functional dependencies whose left-hand side attributes are
part of the primary key. If the primary key contains a single attribute, the test need not be applied at
all.

The EMP_PROJ relation in Figure is in 1NF but is not in 2NF because:

1) Non Prime attribute Ename is Partially functional dependent on the key.

2) Non Prime attributes Pname, Plocation are Partially functional dependent on the key.
3) But the Non Prime attribute hours is fully functional dependent on the key.

If a relation schema is not in 2NF, it can be second normalized or 2NF normalized into a number of
2NF relations in which nonprime attributes are associated only with the part of the primary key on
which they are fully functionally dependent. Therefore, we decompose the EMP_PROJ into the
three relation schemas EP1, EP2, and EP3 shown in Figure, each of which is in 2NF.

EP1

EmpId Pnumber Hours

EP2

EmpId EName

EP3

Pnumber Pname PLocation

Third Normal Form:

A Relation schema R is in 3NF if it satisfies 2NF and no nonprime attribute is transitively

dependent on the key.

A Relation schema in Third normal form does not allow partial or transitive dependencies. The
relation schema EMP_DEPT in Figure is in 2NF, since no partial dependencies on a key exist.
Consider the relation EMP_DEPT:

EMP_DEPT

Ename EmpId Bdate Address Dnumber Dname Dmgr_no

FDs are

1) EmpId→ Ename,Bdate,Address,Dnumber
2) Dnumber → Dname,Dmgr_no

However, EMP_DEPT is not in 3NF because of the transitive dependency of Dmgr_no (and also
Dname) on EmpId via Dnumber.

Since EmpId → Dnumber & Dnumber → Dname

Therefore EmpId → dname (Transitive dependency)

Since EmpId → Dnumber & Dnumber → Dmgr_no

Therefore EmpId → Dmgr_no (Transitive dependency)

We can normalize EMP_DEPT by decomposing it into the two 3NF relation schemas ED1 and
ED2 shown in Figure by removing the attributes that violate 3NF and placing them with the
attributes through which they are transitively dependent into another relation.

ED1

Ename EmpId Bdate Address Dnumber

ED2

Dnumber Dname Dmgr_no

Boyce-Codd Normal Form:

A Relation is in BCNF when every determinant is a candidate key or we can say that if an attribute
of a composite key is dependent on an attribute of the other composite key, a Normalization called
BCNF is needed.

When a table contains only one candidate key, the 3NF and the BCNF are equivalent. BCNF can be
violated when the table contains more than one candidate key.

Consider the relation Teach:

FDs are given as:

FD1: {Student, Course} → Instructor

FD2: Instructor → Course

As we see in the relation, no single attribute is a Candidate key.

Candidate Key1: Student, Instructor

Candidate Key2: Student, Course

The relation TEACH is in 3NF since there are no partial dependencies or transitive dependencies.
We see that relation is not in BCNF because although Instructor is a determinant, it is not a
Candidate key.

We can convert the TEACH relation into BCNF by dividing it into two relations. The attribute that
is a determinant but not a Candidate key must also be placed in a separate relation and must be the
key of that relation.

TEACH1

Instructor Course

TEACH2

Instructor Student

Fourth Normal Form:

When a relation is in BCNF, there are no longer any anomalies that result from functional
dependencies. However, there may still be anomalies that result from Multivalued Dependency.
For example:
StudentId Subject Activity
100 Music Swimming
100 Accounting Swimming
100 Music Tennis
100 Accounting Tennis
150 Math Jogging

The candidate key is

(StudentId,Subject,Activity)

Multivalued dependencies are:

StudentId →→ Subject

StudentId →→Activity

In general, Multivalued dependency exists when a relation has at least three attributes, two of
them are multivalued and their values depend on only the third attribute. In other words, in a
Relation R(A,B,C) a multivalued dependency exists if A determines multiple values of B (
A→→B) and A determines multiple values of C (A →→C) and B and C are independent of
each other.

A relation is in 4NF if it is BCNF and has no Multivalued dependencies. So, we can say that
4NF is needed when a relation has undesirable Multivalued dependencies. We have to
eliminate these anomalies by creating two relations, each one storing data for only one of the
two Multivalued attributes.

StudentId Subject
100 Music
100 Accounting
150 Math

StudentId Activity
100 Swimming
100 Tennis
150 Jogging

Now these both relations are in Fourth Normal form as each relation has only one multivalued
attribute.
ANOTHER EXAMPLE OF FORTH NORMAL FORM (4NF)

LOSSLESS AND LOSSY DECOMPOSITION:

Fifth Normal Form:

The Fifth normal form (5NF) is generally not implemented in real life database design. But we
must learn the concept about it. 5NF is also known as Project join normal form (PJ/NF). A
relation will be in 5NF if

• It is in 4NF
• It does not have join dependency.

Stuvia 2485210 h13 624 - v5.0 Enu Hcip Storage v5.0 Exam Dumps
No ratings yet
Stuvia 2485210 h13 624 - v5.0 Enu Hcip Storage v5.0 Exam Dumps
8 pages
Accenture Accelerating Big Data Platform Adoption VF
No ratings yet
Accenture Accelerating Big Data Platform Adoption VF
11 pages
Chapter 9. Database Design
100% (1)
Chapter 9. Database Design
52 pages
Phases of Database Design
80% (10)
Phases of Database Design
4 pages
ISB CheatSheet
100% (2)
ISB CheatSheet
12 pages
Major Project Proposal ON: "Hospital Management System"
No ratings yet
Major Project Proposal ON: "Hospital Management System"
6 pages
The Main Stages of Database Design
No ratings yet
The Main Stages of Database Design
4 pages
ER Diagram (Entity-Relationship Model) : Database Design
No ratings yet
ER Diagram (Entity-Relationship Model) : Database Design
35 pages
13012204771
No ratings yet
13012204771
8 pages
DBLC
No ratings yet
DBLC
6 pages
Database Design
No ratings yet
Database Design
7 pages
Database Design
No ratings yet
Database Design
11 pages
4.Database Design
No ratings yet
4.Database Design
24 pages
Database Design Life Cycle, Database Design Group 29
No ratings yet
Database Design Life Cycle, Database Design Group 29
14 pages
DB Desing&other Learning
No ratings yet
DB Desing&other Learning
24 pages
Conceptual Database Design
No ratings yet
Conceptual Database Design
50 pages
Dbms
No ratings yet
Dbms
99 pages
Database Development Tutorial
100% (1)
Database Development Tutorial
14 pages
18BCS42C U4
No ratings yet
18BCS42C U4
17 pages
cb3401-unit-2
No ratings yet
cb3401-unit-2
24 pages
Importance of Database Design in DBMS
No ratings yet
Importance of Database Design in DBMS
5 pages
Normalization (3)
No ratings yet
Normalization (3)
175 pages
11700220010_PCC-CS-601_CA2
No ratings yet
11700220010_PCC-CS-601_CA2
8 pages
Chapter 6 Designing Databases
No ratings yet
Chapter 6 Designing Databases
36 pages
Unit 6 - Normalization
No ratings yet
Unit 6 - Normalization
10 pages
Database Design 1
100% (1)
Database Design 1
4 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
6 pages
20CS404-DBMS (5 Unit Notes)-82-140
No ratings yet
20CS404-DBMS (5 Unit Notes)-82-140
59 pages
DATABASE SYSTEM DEVELOPMENT LIFECYLE Summary
No ratings yet
DATABASE SYSTEM DEVELOPMENT LIFECYLE Summary
9 pages
Page 25 Onward
No ratings yet
Page 25 Onward
6 pages
Exit_EX_Tutorials
No ratings yet
Exit_EX_Tutorials
61 pages
Unit1 And2 Dbms
No ratings yet
Unit1 And2 Dbms
35 pages
Database Design Lecture Notes
No ratings yet
Database Design Lecture Notes
9 pages
Database basics 2
No ratings yet
Database basics 2
20 pages
Review Questions
No ratings yet
Review Questions
29 pages
Arsalan
No ratings yet
Arsalan
12 pages
Database Design: Chapter Three
No ratings yet
Database Design: Chapter Three
51 pages
Database Design
No ratings yet
Database Design
4 pages
Module -III
No ratings yet
Module -III
38 pages
Connecting With Computer Science Chapter 6 Review
No ratings yet
Connecting With Computer Science Chapter 6 Review
5 pages
Dbms Unit II
No ratings yet
Dbms Unit II
16 pages
Designing A Database Ok
No ratings yet
Designing A Database Ok
13 pages
5.relational DB Design
No ratings yet
5.relational DB Design
30 pages
SSAD chapter_4 Note
No ratings yet
SSAD chapter_4 Note
6 pages
DBMS UNIT 2
No ratings yet
DBMS UNIT 2
276 pages
Relational Odel DBMS
No ratings yet
Relational Odel DBMS
14 pages
Unit 10 complete assignment
No ratings yet
Unit 10 complete assignment
36 pages
UNIT THREE
No ratings yet
UNIT THREE
15 pages
DBMS
No ratings yet
DBMS
7 pages
4 DBMS Module-IV
No ratings yet
4 DBMS Module-IV
12 pages
CoSc 2041 chapter 5 and 6-1
No ratings yet
CoSc 2041 chapter 5 and 6-1
15 pages
Data Modeling and Data Models
No ratings yet
Data Modeling and Data Models
4 pages
Data Base Design Process
No ratings yet
Data Base Design Process
2 pages
Chapter Four
No ratings yet
Chapter Four
12 pages
MCS-014 Block 3
No ratings yet
MCS-014 Block 3
70 pages
Lecture 2 - Introduction to Database Design - Conceptual Design
No ratings yet
Lecture 2 - Introduction to Database Design - Conceptual Design
56 pages
DMBS Week 1 and 2 Material
No ratings yet
DMBS Week 1 and 2 Material
104 pages
Database Design and Normalization
No ratings yet
Database Design and Normalization
27 pages
Intro - To-Database - Chapter No 4
No ratings yet
Intro - To-Database - Chapter No 4
45 pages
Designing Databases
No ratings yet
Designing Databases
27 pages
MIS_IRM_Ch03.tgh
No ratings yet
MIS_IRM_Ch03.tgh
22 pages
Chapter 7 - Database Design
No ratings yet
Chapter 7 - Database Design
52 pages
Database Management System
From Everand
Database Management System
Manish Soni
No ratings yet
Oracle Quick Guides: Part 2 - Oracle Database Design
From Everand
Oracle Quick Guides: Part 2 - Oracle Database Design
Malcolm Coxall
No ratings yet
Abulhasanat DB
No ratings yet
Abulhasanat DB
3 pages
Getting Started with Support from SAP (Support Accreditation)
No ratings yet
Getting Started with Support from SAP (Support Accreditation)
20 pages
Joint Application Development (JAD)
100% (1)
Joint Application Development (JAD)
12 pages
Software Engineering Bhrigu Soni (1)
No ratings yet
Software Engineering Bhrigu Soni (1)
44 pages
Ms Access Advantages
No ratings yet
Ms Access Advantages
6 pages
What is SQL Server_ Introduction, History, Types, Versions
No ratings yet
What is SQL Server_ Introduction, History, Types, Versions
11 pages
Internet Presentation
No ratings yet
Internet Presentation
13 pages
Network Security Project
No ratings yet
Network Security Project
3 pages
Social Media Management Basics
100% (1)
Social Media Management Basics
59 pages
Talend Open Studio For Data Integration: User Guide
No ratings yet
Talend Open Studio For Data Integration: User Guide
452 pages
PCRF Information: UPCC Service Analysis
No ratings yet
PCRF Information: UPCC Service Analysis
7 pages
Isms Certification Readiness Recheck Questionnaire
No ratings yet
Isms Certification Readiness Recheck Questionnaire
19 pages
Voice and Communication Services
No ratings yet
Voice and Communication Services
23 pages
ProPeers - Connect Ask and Grow
No ratings yet
ProPeers - Connect Ask and Grow
1 page
StudyHandbook MIK 2018 - 12042018 Rev
No ratings yet
StudyHandbook MIK 2018 - 12042018 Rev
5 pages
TOGAF® 9 Training Course - Level 1 Foundation 3.1.0 EN
75% (4)
TOGAF® 9 Training Course - Level 1 Foundation 3.1.0 EN
246 pages
Resume Analyser
No ratings yet
Resume Analyser
57 pages
SAP ABAP BEST Practices
100% (1)
SAP ABAP BEST Practices
6 pages
Configuring AudioCodes Mediant 1000 VoIP Media Gateway With Avaya Voice Portal Using SIP Trunks
No ratings yet
Configuring AudioCodes Mediant 1000 VoIP Media Gateway With Avaya Voice Portal Using SIP Trunks
43 pages
Attributes and Usage of Jsp:Usebean Action Tag
No ratings yet
Attributes and Usage of Jsp:Usebean Action Tag
7 pages
Laporan Magang
No ratings yet
Laporan Magang
49 pages
Istio Mesh For Microservices r1
100% (3)
Istio Mesh For Microservices r1
65 pages
Solved BA Case Study - Automated Loan Approval
No ratings yet
Solved BA Case Study - Automated Loan Approval
12 pages
4G Advanced M2M GW: IDG851-LT001
No ratings yet
4G Advanced M2M GW: IDG851-LT001
2 pages
How To Create A Self-Signed Digital Certificate in Microsoft Office 2016
No ratings yet
How To Create A Self-Signed Digital Certificate in Microsoft Office 2016
10 pages
UCS551 Chapter 1 - Introduction To Data Analytics
No ratings yet
UCS551 Chapter 1 - Introduction To Data Analytics
23 pages

CO3-Notes-Database Design and Normalization

Uploaded by

CO3-Notes-Database Design and Normalization

Uploaded by

CO-3

Refinement process is called Normalization. Defined as a step-by-step process of decomposing a

Guideline:3 Reducing the NULL values in tuples:

Reasons for nulls:

Guideline:4 Disallowing the possibility of generating spurious tuple:

• Authentication and authorization: Implement robust authentication and authorization

A → C is satisfied but C→ A is not satisfied.

1) EmpId → Ename (since value of an employee Id uniquely determines the employee

Functional Dependencies may also be based on composite attributes for example:

Armstrong’s Axioms or Inference Axioms

Suppose we have F, a set of functional dependencies. To determine whether a FD X→ Y is

3) Transitivity Rule: If X →Y and Y →Z then X →Z.

4) Union Rule: If X →Y and X →Z then X →YZ.

5) Decomposition Rule: X →YZ then X →Y and X →Z.

6) Pseudo Transitivity Rule: If X →Y and YZ →W then XZ→W.

Why Relations are Normalized?

StudentNo StudentName Address CourseNo CourseName Instructor

There are following undesirable features or anomalies:

1) Repetition of Information: A lot of information is being repeated. StudentNo, name,

2) Insertion Anomalies: It is the inability to represent certain information. Since primary

3) Updation Anomalies: If we want to change the value of one or more attributes of a

4) Deletion Anomalies: It is a loss of useful information means useful information may be

Such decomposition is called Normalization and is essential if we wish to overcome undesirable

First Normal Form:

A Relation that is not in 1NF

Student Subject Information

A Relation that is not in 1NF

Student Code Lecturer

1NF version of the same relation with redundancy.

Prime Attributes and Non-Prime Attributes:

An attribute of relation schema R is called a prime attribute of R if it is a member of some

Full Functional Dependency and Partial Dependency:

A functional dependency X → Y is a partial dependency if some attribute A ε X can be removed

EmpId Pnumber Hours Ename Pname Plocation

A functional dependency X → Y in a relation schema R is a transitive dependency if there exists a

AB → C and C→ E therefore AB→ E

E is transitively dependent on the key

Second Normal Form:

EmpId Pnumber Hours Ename Pname Plocation

1) EmpId ,Pnumber → Hours

The EMP_PROJ relation in Figure is in 1NF but is not in 2NF because:

1) Non Prime attribute Ename is Partially functional dependent on the key.

EmpId Pnumber Hours

Pnumber Pname PLocation

Third Normal Form:

A Relation schema R is in 3NF if it satisfies 2NF and no nonprime attribute is transitively

Ename EmpId Bdate Address Dnumber Dname Dmgr_no

Since EmpId → Dnumber & Dnumber → Dname

Since EmpId → Dnumber & Dnumber → Dmgr_no

Ename EmpId Bdate Address Dnumber

Dnumber Dname Dmgr_no

Boyce-Codd Normal Form:

Consider the relation Teach:

FD1: {Student, Course} → Instructor

As we see in the relation, no single attribute is a Candidate key.

Candidate Key1: Student, Instructor

Fourth Normal Form:

The candidate key is

Multivalued dependencies are:

LOSSLESS AND LOSSY DECOMPOSITION:

You might also like