0% found this document useful (0 votes)

27 views

DBMS - Unit - 3 - Chapter - 2 - Relationl Database Design

Relationl database design

Uploaded by

valanukonda2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

DBMS - Unit - 3 - Chapter - 2 - Relationl Database Design

Relationl database design

Uploaded by

valanukonda2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

Unit-III Chapter-II

Relational Database Design

Introduction

Decomposition
Atomic Domains Functional- Using
and First Dependency Multivalued Database-Design
Normal Form Theory Dependencies Process

Decomposition Algorithms for More Normal

Using Functional Decomposition Forms
Dependencies
Introduction
• In general, the goal of relational database design is to generate a set of relation schemas that allows us to store
information without unnecessary redundancy, yet also allows us to retrieve information easily. This is accomplished
by designing schemas that are in an appropriate normal form
• Database normalization is the process of organizing the attributes of the database to reduce or eliminate data
redundancy (having the same data but at different places) .
• Problems because of data redundancy
Data redundancy unnecessarily increases the size of the database as the same data is repeated in many
places. Inconsistency problems also arise during insert, delete and update operations.
• The functional dependency is the relationship between attributes(characteristics) of a table related to each other. It
typically exists between the primary key and non-key attribute within a table.
• X → Y
• The left side of FD is known as a determinant, the right side of the production is known as a dependent.
• Here Emp_Id attribute can uniquely identify the Emp_Name attribute of employee table because if we know the
Emp_Id, we can tell that employee name associated with it.
• Functional dependency can be written as:
• Emp_Id → Emp_Name
• We can say that Emp_Name is functionally dependent on Emp_Id.
Types of Functional Dependencies
• Trivial functional dependency
• In Trivial functional dependency, a dependent is always a subset of the determinant.
• X → Y is called a trivial functional dependency if Y is the subset of X.
• Here, { Employee_Id , Name } → { Name } is a Trivial functional dependency, since the
dependent Name is the subset of determinant { Employee_Id, Name }.
• { Employee_Id } → { Employee_Id }, { Name } → { Name } and { Age } → { Age } are also Trivial.

• Non-Trivial functional dependency

• It is the opposite of Trivial functional dependency. Formally speaking, in Non-Trivial functional
dependency, dependent if not a subset of the determinant.
• Here, { Employee_Id } → { Name } is a non-trivial functional dependency because Name(dependent)
is not a subset of Employee_Id(determinant).
• Similarly, { Employee_Id, Name } → { Age } is also a non-trivial functional dependency.
• Multivalued functional dependency
• In Multivalued functional dependency, attributes in the dependent set are not dependent on each
other.
• For example, X → { Y, Z }, if there exists is no functional dependency between Y and Z, then it is
called as Multivalued functional dependency.
• Here, { Employee_Id } → { Name, Age } is a Multivalued functional dependency, since the dependent
attributes Name, Age are not functionally dependent(i.e. Name → Age or Age → Name doesn’t exist !)
• Transitive functional dependency
• Consider two functional dependencies A → B and B → C then according to the transitivity axiom A →
C must also exist. This is called a transitive functional dependency.
• In other words, dependent is indirectly dependent on determinant in Transitive functional dependency.
• Here, { Employee_Id → Department } and { Department → Street Number } holds true. Hence,
according to the axiom of transitivity, { Employee_Id → Street Number } is a valid functional
dependency.
Advantages of Functional Dependency

• It is used to maintain the quality of data in the database.

• It expresses the facts about the database design.

• It helps in clearly defining the meanings and constraints of databases.

• It helps to identify bad designs.

• Functional Dependency removes data redundancy where the same values should not be repeated at multiple
locations in the same database table.

• The process of Normalization starts with identifying the candidate keys in the relation. Without functional
dependency, it's impossible to find candidate keys and normalize the database.
Inference rules
• The inference rule is a type of assertion. It can apply to a set of FD(functional dependency) to derive other
FD.
• Using the inference rule, we can derive additional functional dependency from the initial set.
• Reflexive Rule (IR1)
• In the reflexive rule, if Y is a subset of X, then X determines Y.
• If X ⊇ Y then X → Y
• Example:
• X = {a, b, c, d, e}
• Y = {a, b, c}
• Augmentation Rule (IR2)
• The augmentation is also called as a partial dependency. In augmentation, if X determines Y, then XZ
determines YZ for any Z
• If X → Y then XZ → YZ
• Example:
• For R(ABCD), if A → B then AC → BC
• Transitive Rule (IR3)
• In the transitive rule, if X determines Y and Y determine Z, then X must also determine Z.
• If X → Y and Y → Z then X → Z

• Union Rule (IR4)

• Union rule says, if X determines Y and X determines Z, then X must also determine Y and Z.
• If X → Y and X → Z then X → YZ
• Proof:
• 1. X → Y (given)
2. X → Z (given)
3. X → XY (using IR2 on 1 by augmentation with X. Where XX = X)
4. XY → YZ (using IR2 on 2 by augmentation with Y)
5. X → YZ (using IR3 on 3 and 4)
• Decomposition Rule (IR5)
• Decomposition rule is also known as project rule. It is the reverse of union rule.
• This Rule says, if X determines Y and Z, then X determines Y and X determines Z separately.
• If X → YZ then X → Y and X → Z
• Proof:
• 1. X → YZ (given)
2. YZ → Y and YZ →Z (consider dependent side of 1 and use IR1 Rule)
3. X → Y and X →Z (using IR3 on 1 and 2)

• Pseudo transitive Rule (IR6)

• In Pseudo transitive Rule, if X determines Y and YZ determines W, then XZ determines W.
• If X → Y and YZ → W then XZ → W
• Proof:
• 1. X → Y (given)
2. YZ →W(given)
3. XZ →YZ(using IR2 on 1 by augmenting with Z)
4. XZ → W (using IR3 on 3 and 2)
Normalization
• A large database defined as a single relation may result in data duplication. This repetition of data may result
in:
• Making relations very large.

• It isn't easy to maintain and update data as it would involve searching many records in relation.

• Wastage and poor utilization of disk space and resources.

• The likelihood of errors and inconsistencies increases.

• So to handle these problems, we should analyze and decompose the relations with redundant data into
smaller, simpler, and well-structured relations that are satisfy desirable properties. Normalization is a
process of decomposing the relations into relations with fewer attributes.
• What is Normalization?
• Normalization is the process of organizing the data in the database.
• Normalization is used to minimize the redundancy from a relation or set of relations. It is also used to
eliminate undesirable characteristics like Insertion, Update, and Deletion Anomalies.
• Normalization divides the larger table into smaller and links them using relationships.
• The normal form is used to reduce redundancy from the database table.

• Why do we need Normalization?

• The main reason for normalizing the relations is removing these anomalies. Failure to eliminate
anomalies leads to data redundancy and can cause data integrity and other problems as the database
grows. Normalization consists of a series of guidelines that helps to guide you in creating a good
database structure.
Anomalies
• Data modification anomalies can be categorized into three types:

• Insertion Anomaly: Insertion Anomaly refers to when one cannot insert a new tuple into a relationship
due to lack of data.

• Deletion Anomaly: The delete anomaly refers to the situation where the deletion of data results in the
unintended loss of some other important data.

• Updation Anomaly: The update anomaly is when an update of a single data value requires multiple
rows of data to be updated.
Types of Normal Forms:
• Normalization works through a series of stages called Normal forms. The normal forms apply to individual
relations. The relation is said to be in particular normal form if it satisfies constraints.
First Normal Form (1NF)
• A relation will be 1NF if it contains an atomic value.
• It states that an attribute of a table cannot hold multiple values. It must hold only single-valued attribute.
• First normal form disallows the multi-valued attribute, composite attribute, and their combinations.
• Example: Relation STUDENT is not in 1NF because of multi-valued attribute STUD_PHONE.
Second Normal Form (2NF)
• In the 2NF, relational must be in 1NF.
• In the second normal form, all non-key attributes are fully functional dependent on the primary key
• In a table, if attribute B is functionally dependent on A, but is not functionally dependent on a proper subset of
A, then B is considered fully functional dependent on A. Hence, in a 2NF table, all non-key attributes cannot
be dependent on a subset of the primary key. Note that if the primary key is not a composite key, all non-key
attributes are always fully functional dependent on the primary key. A table that is in 1st normal form and
contains only a single key as the primary key is automatically in 2nd normal form.
• This table has a composite primary key [Customer ID, Store ID]. The non-key attribute is [Purchase
Location]. In this case, [Purchase Location] only depends on [Store ID], which is only part of the primary key.
Therefore, this table does not satisfy second normal form.
Third Normal Form(3NF)
• A relation is in third normal form, if there is no transitive dependency for non-prime attributes as well as it is in
second normal form.
• Transitive dependency – If A->B and B->C are two FDs then A->C is called transitive dependency.
• A relation is in 3NF if at least one of the following condition holds in every non-trivial function dependency X –>
Y
• X is a super key.
• Y is a prime attribute (each element of Y is part of some candidate key).

• FD set: { we will decompose the relation

STUD_NO -> STUD_NAME, STUDENT (STUD_NO, STUD_NAME, STUD_STATE,
STUD_NO -> STUD_STATE, STUD_COUNTRY_STUD_AGE) as:
STUD_STATE -> STUD_COUNTRY,
STUD_NO -> STUD_AGE
STUDENT (STUD_NO, STUD_NAME, STUD_STATE, STUD_AGE)
}
Candidate Key: {STUD_NO} STATE_COUNTRY (STUD_STATE, STUD_COUNTRY)
Boyce Codd normal form (BCNF)
• BCNF is the advance version of 3NF. It is stricter than 3NF.
• A table is in BCNF if every functional dependency X → Y, X is the super key of the table.
• For BCNF, the table should be in 3NF, and for every FD, LHS is super key.

• Example: Let's assume there is a company where employees work in more than one department.
• In the above table Functional dependencies are as follows:
• EMP_ID → EMP_COUNTRY
• EMP_DEPT → {DEPT_TYPE, EMP_DEPT_NO}
• Candidate key: {EMP-ID, EMP-DEPT}
• The table is not in BCNF because neither EMP_DEPT nor EMP_ID alone are keys.
• To convert the given table into BCNF, we decompose it into three tables:
EMP_ID EMP_COUNTR EMP_DEPT DEPT_TYPE EMP_DEPT_NO
Y
264 India Designing D394 283
264 India Testing D394 300
364 UK Stores D283 232
364 UK Developing D283 549

EMP_ID EMP_COUNTRY EMP_DEPT DEPT_TYPE EMP_DEPT_NO

264 India Designing D394 283

264 India Testing D394 300

Stores D283 232

EMP_ID EMP_DEPT Developing D283 549

D394 283
D394 300
D283 232
D283 549
Multivalued Dependency
• Multivalued dependency occurs when two attributes in a table are independent of each other but, both depend on a
third attribute.
• A multivalued dependency consists of at least two attributes that are dependent on a third attribute that's why it
always requires at least three attributes.
• Example: Suppose there is a bike manufacturer company which produces two colors(white and black) of each
model every year.
• Here columns COLOR and MANUF_YEAR are dependent on
BIKE_MODEL and independent of each other.
• In this case, these two columns can be called as multivalued
dependent on BIKE_MODEL.
The representation of these dependencies is shown below:
• BIKE_MODEL → → MANUF_YEAR
• BIKE_MODEL → → COLOR
• This can be read as "BIKE_MODEL multidetermined MANUF_YEAR"
and "BIKE_MODEL multidetermined COLOR".
Fourth Normal Form (4NF)
• Any relation is said to be in the fourth normal form when it satisfies the following conditions:
• It must be in BCNF
• It should have no multivalued dependency.
• FD{Student-ID->->Course
Student-ID->->Hobby}

• Now this relation is thus in 4NF. A relation can contain a functional dependency along with a multi-
valued dependency also. So when such a case arises the columns which are functionally dependent are
moved to a separate table and the columns which are multi-valued dependent are moved to a separate
table. This converts the relation into 4NF.
Join Dependency
• If a table can be recreated by joining multiple tables and each of this table have a subset of the attributes of
the table, then the table is in Join Dependency. It is a generalization of Multivalued Dependency

{(EmpName, EmpSkills ),
( EmpName, EmpJob),
(EmpSkills, EmpJob)}

Our Join Dependency −

That would mean that a join relation
of the above three relations is
equal to our original relation <Employee>.
Fifth Normal Form
• A relation R is in Fifth Normal Form (5NF) if and only if the following conditions are satisfied simultaneously:
1. R is already in 4NF.
2. It cannot be further non-loss decomposed.
• 5NF is also known as Project-join normal form (PJ/NF).
• Decomposition of a relation is done when a relation in relational model is not in appropriate normal form. Relation R is decomposed
into two or more relations if decomposition is lossless join as well as dependency preserving.
• Lossless Join Decomposition
• If we decompose a relation R into relations R1 and R2,
• Decomposition is lossy if R1 ⋈ R2 ⊃ R
• Decomposition is lossless if R1 ⋈ R2 = R
• To check for lossless join decomposition using FD set, following conditions must hold:
• Att(R1) U Att(R2) = Att(R) ,
• Att(R1) ∩ Att(R2) ≠ Φ
• Att(R1) ∩ Att(R2) = Att(R1) or Att(R2)
For Example, A relation R (A, B, C, D) with FD set{A->BC} is decomposed into R1(ABC) and R2(AD) which is a lossless join
decomposition as:
• First condition holds true as Att(R1) U Att(R2) = (ABC) U (AD) = (ABCD) = Att(R).
• Second condition holds true as Att(R1) ∩ Att(R2) = (ABC) ∩ (AD) ≠ Φ
• Third condition holds true as Att(R1) ∩ Att(R2) = A is a key of R1(ABC) because A->BC is given.
Pros & Cons of NF
• Advantages of Normalization
• Normalization helps to minimize data redundancy.
• Greater overall database organization.
• Data consistency within the database.
• Much more flexible database design.
• Enforces the concept of relational integrity.

• Disadvantages of Normalization
• You cannot start building the database before knowing what the user needs.
• The performance degrades when normalizing the relations to higher normal forms, i.e., 4NF, 5NF.
• It is very time-consuming and difficult to normalize relations of a higher degree.
• Careless decomposition may lead to a bad database design, leading to serious problems.
Decomposition Using Functional Dependencies
Keys and functional dependencies
• A database models a set of entities and relationships in the real world. There are usually a variety of
constraints (rules) on the data in the real world.
• For example, some of the constraints that are expected to hold in a university database are:
• 1. Students and instructors are uniquely identified by their ID.
• 2. Each student and instructor has only one name.
• 3. Each instructor and student is (primarily) associated with only one department.
• 4. Each department has only one value for its budget, and only one associated building.
An instance of a relation that satisfies all such real-world constraints is called a legal instance of the relation; a
legal instance of a database is one where all the relation instances are legal instances.
Some of the most commonly used types of real-world constraints can be represented formally as keys
(superkeys, candidate keys and primary keys), or as functional dependencies
FD have to be generated..
• We shall use functional dependencies in two ways:
• 1. To test instances of relations to see whether they satisfy a given set F of functional dependencies.
• 2. To specify constraints on the set of legal relations
• Third NF(refer previous slides)

• BCNF(refer previous slides)

• Dependency preserving concept: If we decompose a relation R into relations R1 and R2, All
dependencies of R either must be a part of R1 or R2 or must be derivable from combination of
FD’s of R1 and R2.
For Example, A relation R (A, B, C, D) with FD set{A->BC} is decomposed into R1(ABC) and
R2(AD) which is dependency preserving because FD A->BC is a part of R1(ABC).

• Higher NF(4th,5th NF) (refer previous slides)

• Closure of a Set of Functional Dependencies

• Closure of Attribute Sets

• Canonical Cover

• Lossless Decomposition

• Dependency preservation
Functional Dependency Theory
• Closure of a Set of Functional Dependencies
• Suppose we are given a relation schema r (A, B, C, G, H, I) and the set of functional dependencies:
• A→B
• A→C
• CG→H
• CG→I
• B→H
• The functional dependency:
• A→H is logically implied.
• Let F be a set of functional dependencies. The closure of F, denoted by F+, is the set of all functional
dependencies logically implied by F. Given F, we can compute F+ directly from the formal definition of
functional dependency. If F were large, this process would be lengthy and difficult. Such a computation of F+
requires arguments of the type just used to show that A→H is in the closure of our example set of
dependencies.
• Axioms, or rules of inference, provide a simpler technique for reasoning about functional dependencies
• we use Greek letters (α,β,γ,. . . ) for sets of attributes, and uppercase Roman letters from the beginning of the
alphabet for individual attributes. We use αβ to denote α ∪ β .
• By applying these rules repeatedly, we can find all of F+, given F. This collection of rules is called
Armstrong’s axioms in honor of the person who first proposed it.

• Armstrong’s axioms are sound, because they do not generate any incorrect functional dependencies.
• They are complete, because, for a given set F of functional dependencies, they allow us to generate all F+.
Let us apply our rules to the example of schema R = (A,
B, C, G, H, I) and the set F of functional dependencies
{A→ B, A→ C, CG → H, CG → I , B → H}.
We list several members of F+ here:

• CG → HI. Since CG → H and CG → I , the union

rule implies that CG →HI lly A → BC
• AG → I. Since A→C and CG → I , the pseudo
transitivity rule implies that AG → I holds.
• A → H. Since A → B and B → H hold, we apply the
transitivity rule

The set F+ of functional dependencies

is{A→ B, A→ C, CG → H, CG → I , B → H ,
CG →HI,AG →I,A →H,A → BC}.
• Closure of Attribute Sets
• We say that an attribute B is functionally determined by if α → B. To test whether a set α is a superkey, we must devise an
algorithm for computing the set of attributes functionally determined by α . One way of doing this is to compute F+, take all
functional dependencies with α as the left-hand side, and take the union of the right-hand sides of all such dependencies. However,
doing so can be expensive, since F+ can be large.

• FD{A→ B, A→ C, CG → H, CG → I , B → H}
• we shall use it to compute (AG)+ with the functional dependencies as
above and result = AG.
• A → B causes us to include B in result. To see this fact, we observe that
• A→ B is in F, A⊆ result (which is AG), so result := result ∪B.
• A→C causes result to become ABCG.
• CG→H causes result to become ABCGH.
• CG→I causes result to become ABCGHI.
• Canonical Cover
• Suppose that we have a set of functional dependencies F on a relation schema. Whenever a user performs an
update on the relation, the database system must ensure that the update does not violate any functional
dependencies, that is, all the functional dependencies in F are satisfied in the new database state.
• The system must roll back the update if it violates any functional dependencies in the set F.
• A canonical cover or irreducible a set of functional dependencies FD is a simplified set of FD that has a
similar closure as the original set FD.
• Extraneous attributes::An attribute of an FD is said to be extraneous if we can remove it without
changing the closure of the set of FD.
Q. Suppose a relational schema R(w x y z), and set of functional dependency as following F : { x w, wz xy, y wxz }
Find the canonical cover Fc (Minimal set of functional dependency).
• Lossless Decomposition::refer 5NF slide
• Dependency preserving::
Algorithms for Decomposition
3NF and BCNF:(need to explain in detail what is 3NF,BCNF,respective algorithms, difference between
3NF and BCNF)
For explanation regarding 3NF and BCNF refer previous slides.
Dependency-preserving, lossless decomposition into 3NF:

Sometimes, the result is not only in 3NF, but also in

BCNF. This suggests an alternative method of generating
a BCNF design. First use the 3NF algorithm. Then, for
any schema in the 3NF design that is not in BCNF,
decompose using the BCNF algorithm. If the result is not
dependency-preserving, revert to the 3NF design.
Testing of a relation schema R to see if it satisfies BCNF can be
simplified in some
cases:
• To check if a nontrivial dependency α → β causes a
violation of BCNF, compute α + (the attribute closure of α), and
verify that it includes all attributes of R; that is, it is a superkey
of R.
• To check if a relation schema R is in BCNF, it
suffices to check only the dependencies in the given set F for
violation of BCNF, rather than check all dependencies in F+.

We can show that if none of the dependencies in F causes a

violation of BCNF, then none of the dependencies in F+ will
cause a violation of BCNF,either.
Comparision between 3NF and BCNF
S.NO. 3NF BCNF
In 3NF there should be no transitive dependency that is no non
In BCNF for any relation A->B, A should be a super
1. prime attribute should be transitively dependent on the
key of relation.
candidate key.

2. It is less stronger than BCNF. It is comparatively more stronger than 3NF.

In 3NF the functional dependencies are already in 1NF and In BCNF the functional dependencies are already in
3.
2NF. 1NF, 2NF and 3NF.

4. The redundancy is high in 3NF. The redundancy is comparatively low in BCNF.

In BCNF there may or may not be preservation of all

5. In 3NF there is preservation of all functional dependencies.
functional dependencies.
6. It is comparatively easier to achieve. It is difficult to achieve.

7. Lossless decomposition can be achieved by 3NF. Lossless decomposition is hard to achieve in BCNF
Decomposition Using Multivalued Dependencies
• Multivalued dependency,4NF and 4NF decomposition Algorithm
• Multivalued dependency,4NF(concept refer in previous slides)
• 4NF decomposition Algorithm
More Normal Forms
• Join Dependency(refer previous slides)

• 5NF (refer previous slides)

• Multivalued dependencies help us understand and eliminate some forms of repetition of information that
cannot be understood in terms of functional dependencies. There are types of constraints called join
dependencies that generalize multivalued dependencies, and lead to another normal form called project-join
normal form (PJNF) (PJNF is called fifth normal form in some books).There is a class of even more
general constraints that leads to a normal form called domain-key normal form (DKNF).

• A practical problem with the use of these generalized constraints is that they are not only hard to reason with,
but there is also no set of sound and complete inference rules for reasoning about the constraints. Hence PJNF
and DKNF are used quite rarely.
Database-Design Process
• we assumed that a relation schema r(R) is given, and proceeded to normalize it. There are several ways in
which we could have come up with the schema r(R):
• So far we have looked at detailed issues about normal forms and normalization. In this section, we study how
normalization fits into the overall database-design process.
• There are several ways in which we could have come up with the schema r (R):
• 1. r (R) could have been generated in converting an E-R diagram to a set of relation schemas.
• 2. r(R) could have been a single relation schema containing all attributes that are of interest. The
normalization process then breaks up r (R) into smaller schemas.
• 3. r (R) could have been the result of an ad-hoc design of relations that we then test to verify that it satisfies a
desired normal form.
• E-R Model and Normalization
• When we define an E-R diagram carefully, identifying all entities correctly, the relation schemas generated
from the E-R diagram should not need much further normalization
• However, there can be functional dependencies between attributes of an entity. For instance, suppose an
instructor entity set had attributes dept_name and dept_address, and there is a functional dependency
dept_name → dept_address. We would then need to normalize the relation generated from instructor. Most
examples of such dependencies arise out of poor E-R diagram design. In the above example, if we had
designed the E-R diagram correctly, we would have created a department entity set with attribute
dept_address and a relationship set between instructor and department.
• Functional dependencies can help us detect poor E-R design. If the generated relation schemas are not in
desired normal form, the problem can be fixed in the E-R diagram. That is, normalization can be done
formally as part of data modeling. Alternatively, normalization can be left to the designer’s intuition during E-
R modeling, and can be done formally on the relation schemas generated from the E-R model.

• A careful reader will have noted that in order for us to illustrate a need for multivalued dependencies and
fourth normal form, we had to begin with schemas that were not derived from our E-R design. Indeed, the
process of creating an E-R design tends to generate 4NF designs. If a multivalued dependency holds and is
not implied by the corresponding functional dependency, it usually arises from one of the following sources:
• A many-to-many relationship set.
• A multivalued attribute of an entity set.

• For a many-to-many relationship set each related entity set has its own schema and there is an additional
schema for the relationship set. For a multivalued attribute, a separate schema is created consisting of that
attribute and the primary key of the entity set (as in the case of the phone number attribute of the entity set
instructor).
• Naming of Attributes and Relationships
• A desirable feature of a database design is the unique-role assumption, which means that each attribute name
has a unique meaning in the database. This prevents us from using the same attribute to mean different things
in different schemas.

• For example, we might otherwise consider using the attribute number for phone_number in the instructor
schema and for room_number in the classroom schema. The join of a relation on schema instructor with
one on classroom is meaningless.

• While users and application developers can work carefully to ensure use of the right number in each
circumstance, having a different attribute name for phone number and for room number serves to reduce user
errors.

• While it is a good idea to keep names for incompatible attributes distinct, if attributes of different relations
have the same meaning, it may be a good idea to use the same attribute name. For this reason we used the
same attribute name “name” for both the instructor and the student entity sets.
• In large database schemas, relationship sets (and schemas derived therefrom) are often named via a
concatenation of the names of related entity sets, perhaps with an intervening hyphen or underscore. We have
used a few such names, for example inst_sec and student_sec. We used the names teaches and takes instead of
using the longer concatenated names. This was acceptable since it is not hard for you to remember the
associated entity sets for a few relationship sets. We cannot always create relationship-set names by simple
concatenation; for example, a manager or works-for relationship between employees would not make much
sense if it were called employee-employee! Similarly, if there are multiple relationship sets possible between
a pair of entity sets, the relationship-set names must include extra parts to identify the relationship set.

• Different organizations have different conventions for naming entity sets. For example, we may call an entity
set of students student or students. We have chosen to use the singular form in our database designs. Using
either singular or plural is acceptable, as long as the convention is used consistently across all entity sets.

• As schemas grow larger, with increasing numbers of relationship sets, using consistent naming of attributes,
relationships, and entities makes life much easier for the database designer and application programmers.
De-normalization for Performance
• Occasionally database designers choose a schema that has redundant information; that is, it is not normalized.
• They use the redundancy to improve performance for specific applications.
• The penalty paid for not using a normalized schema is the extra work (in terms of coding time and execution
time) to keep redundant data consistent.
• For instance, suppose all course prerequisites have to be displayed along with a course information, every time a
course is accessed.
• In our normalized schema, this requires a join of course with prereq.
• One alternative to computing the join on the fly is to store a relation containing all the attributes of course and
prereq. This makes displaying the “full” course information faster. However, the information for a course is repeated
for every course prerequisite, and all copies must be updated by the application, whenever a course prerequisite is
added or dropped. The process of taking a normalized schema and making it non-normalized is called
denormalization, and designers use it to tune performance of systems to support time-critical operations.
• A better alternative, supported by many database systems today, is to use the normalized schema, and additionally
store the join of course and prereq as a materialized view.
Other Design Issues::Refer Text

Full Download HCNA Networking Study Guide 1st Edition Huawei Technologies Co PDF
100% (3)
Full Download HCNA Networking Study Guide 1st Edition Huawei Technologies Co PDF
62 pages
Software Quality Assurance Complete Notes
0% (1)
Software Quality Assurance Complete Notes
42 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
88 pages
Functional Dependency
No ratings yet
Functional Dependency
17 pages
Functional Dependency
No ratings yet
Functional Dependency
11 pages
DBMS UNIT 3
No ratings yet
DBMS UNIT 3
20 pages
UNIT-3 Functional Dependency
No ratings yet
UNIT-3 Functional Dependency
30 pages
Normalization
No ratings yet
Normalization
25 pages
204- SQLUnit 3
No ratings yet
204- SQLUnit 3
11 pages
Unit-Iii Normalization Functional Dependency: For Example
No ratings yet
Unit-Iii Normalization Functional Dependency: For Example
18 pages
Mod 4 DBMS
No ratings yet
Mod 4 DBMS
48 pages
DBMS Unit 3.0 Functional Dependencies
No ratings yet
DBMS Unit 3.0 Functional Dependencies
44 pages
Functional Dependency & Normalization: BY: Richa Jain
No ratings yet
Functional Dependency & Normalization: BY: Richa Jain
83 pages
Unit 3
No ratings yet
Unit 3
42 pages
Normalization Unit 3
No ratings yet
Normalization Unit 3
30 pages
Unit-3 DBMS
No ratings yet
Unit-3 DBMS
45 pages
Apznzazggksrduvjspc64a1fzvx3ej 0xqnunp Na-8r4obhf0zrf2m4avpmsg4kgy6egwld-Bh2f7wemdepirbubxvdjkdpsimstg4twt...02be5zz2ncvgu0xbgxqkfllpws0-3ziv3shydhrzuef c0wbgohgny0wtfkpqok3jysklapivs Df6px2n8a- Fqqxjuvjmdsue8rainxevyosvojyyffbzhm=
No ratings yet
Apznzazggksrduvjspc64a1fzvx3ej 0xqnunp Na-8r4obhf0zrf2m4avpmsg4kgy6egwld-Bh2f7wemdepirbubxvdjkdpsimstg4twt...02be5zz2ncvgu0xbgxqkfllpws0-3ziv3shydhrzuef c0wbgohgny0wtfkpqok3jysklapivs Df6px2n8a- Fqqxjuvjmdsue8rainxevyosvojyyffbzhm=
145 pages
DB Normalization
No ratings yet
DB Normalization
17 pages
dbms 3rd unit..
No ratings yet
dbms 3rd unit..
51 pages
Databases Lecture 5
No ratings yet
Databases Lecture 5
34 pages
Unit3[1]
No ratings yet
Unit3[1]
33 pages
SQL modules
No ratings yet
SQL modules
52 pages
Functional Dependency and Normalization: Chapter Four
No ratings yet
Functional Dependency and Normalization: Chapter Four
16 pages
DBMS Unit-3 Notes
No ratings yet
DBMS Unit-3 Notes
23 pages
DBMS Lecture of Unit 3 H
No ratings yet
DBMS Lecture of Unit 3 H
8 pages
Unit 3
No ratings yet
Unit 3
19 pages
What Is Functional Dependency?: Re Exivity: If Y Is A Subset of X, Then X Y Holds by Re Exivity Rule
No ratings yet
What Is Functional Dependency?: Re Exivity: If Y Is A Subset of X, Then X Y Holds by Re Exivity Rule
17 pages
Unit - 3
No ratings yet
Unit - 3
40 pages
Functional Dep Biruk Tsegaye 75721
No ratings yet
Functional Dep Biruk Tsegaye 75721
12 pages
Unit IV Database Normalization
No ratings yet
Unit IV Database Normalization
36 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
41 pages
UNIT-3 DBMS Notes
No ratings yet
UNIT-3 DBMS Notes
54 pages
2.3 1NF,2NF,3NF,4NF,5NF
No ratings yet
2.3 1NF,2NF,3NF,4NF,5NF
100 pages
Advance Database Systems - Lec 4
No ratings yet
Advance Database Systems - Lec 4
68 pages
UNIT 3 Notes
No ratings yet
UNIT 3 Notes
17 pages
NORMALISATION
No ratings yet
NORMALISATION
15 pages
Presentation 3
No ratings yet
Presentation 3
23 pages
UNIT-6: Schema Refinement (Normalization)
No ratings yet
UNIT-6: Schema Refinement (Normalization)
19 pages
Unit 3 (KCS501)
No ratings yet
Unit 3 (KCS501)
13 pages
Unit 3
No ratings yet
Unit 3
11 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
Database-unit-4-Normilization-1-1
No ratings yet
Database-unit-4-Normilization-1-1
38 pages
DBMS Unit-III (1)
No ratings yet
DBMS Unit-III (1)
42 pages
Normalization
No ratings yet
Normalization
51 pages
Normalization
No ratings yet
Normalization
94 pages
Module3 Dbms
No ratings yet
Module3 Dbms
192 pages
Unit_3
No ratings yet
Unit_3
28 pages
Database Normalization
No ratings yet
Database Normalization
28 pages
DBMS Module 3
No ratings yet
DBMS Module 3
24 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
90 pages
DBMS 5 FDB Functional Dependency
No ratings yet
DBMS 5 FDB Functional Dependency
30 pages
Unit-3 (Database Design and Normalization)
No ratings yet
Unit-3 (Database Design and Normalization)
18 pages
Functional Dependency Notes
No ratings yet
Functional Dependency Notes
52 pages
unit 3 ADBMS
No ratings yet
unit 3 ADBMS
12 pages
normalisation
No ratings yet
normalisation
56 pages
Functional Dependancy and Normalization
No ratings yet
Functional Dependancy and Normalization
33 pages
Chapter 4
No ratings yet
Chapter 4
25 pages
Module No 5 Relational Database Design
No ratings yet
Module No 5 Relational Database Design
160 pages
Fd,Normalization
No ratings yet
Fd,Normalization
67 pages
Int 306 Normalization
No ratings yet
Int 306 Normalization
66 pages
Introduction To Normalization
No ratings yet
Introduction To Normalization
12 pages
Basic Concepts in Data Structures
From Everand
Basic Concepts in Data Structures
K.Meenendranath Reddy
No ratings yet
JEE Main Important Chapter - Limits, Continuity and Differentiability
No ratings yet
JEE Main Important Chapter - Limits, Continuity and Differentiability
30 pages
Relational Database Design Algorithms 5NF
No ratings yet
Relational Database Design Algorithms 5NF
40 pages
CM252 DBMS - Labcycle 01
No ratings yet
CM252 DBMS - Labcycle 01
4 pages
PLSQL Complete
No ratings yet
PLSQL Complete
80 pages
Circuits 2 Lab Manual
No ratings yet
Circuits 2 Lab Manual
106 pages
Nuevo documento de texto
No ratings yet
Nuevo documento de texto
1 page
DEMO171 Subscription Management - Supremo - Extended Warranty
No ratings yet
DEMO171 Subscription Management - Supremo - Extended Warranty
46 pages
(Computer Science, Technology and Applications) Frederik L. Sørensen (Editor) - Enterprise Architecture and Service-Oriented Architecture (2020)
No ratings yet
(Computer Science, Technology and Applications) Frederik L. Sørensen (Editor) - Enterprise Architecture and Service-Oriented Architecture (2020)
130 pages
Operating Instructions Flexi Soft in Flexi Soft Designer Configuration Software en Im0031659
No ratings yet
Operating Instructions Flexi Soft in Flexi Soft Designer Configuration Software en Im0031659
544 pages
Modern Database Management Slides - ch05
No ratings yet
Modern Database Management Slides - ch05
43 pages
48 - Manual - Leuze Compact Plus
No ratings yet
48 - Manual - Leuze Compact Plus
4 pages
MATLAB Global Optimization Toolbox User s Guide The Mathworks All Chapters Instant Download
100% (4)
MATLAB Global Optimization Toolbox User s Guide The Mathworks All Chapters Instant Download
40 pages
Huawei CloudEngine S5732-H Series Multi-GE Switches Brochure
No ratings yet
Huawei CloudEngine S5732-H Series Multi-GE Switches Brochure
18 pages
All AQs
100% (1)
All AQs
14 pages
Final Comp1 - AS
No ratings yet
Final Comp1 - AS
4 pages
01 - Security Essentials
No ratings yet
01 - Security Essentials
34 pages
Cse322 Ethics MCQ
No ratings yet
Cse322 Ethics MCQ
80 pages
Description of Options: MIC-2 MKII I/O Module User's Manual
No ratings yet
Description of Options: MIC-2 MKII I/O Module User's Manual
24 pages
Intel® NUC Kits NUC10i3FNH
No ratings yet
Intel® NUC Kits NUC10i3FNH
33 pages
Unit 1 ITC
No ratings yet
Unit 1 ITC
25 pages
GE LOGIQ Configurator - LOGIQ
No ratings yet
GE LOGIQ Configurator - LOGIQ
3 pages
Is 400
No ratings yet
Is 400
38 pages
Unit Progress Test 10 - Version A
No ratings yet
Unit Progress Test 10 - Version A
12 pages
answer Quiz- Congestoin control- truong quang tuong
No ratings yet
answer Quiz- Congestoin control- truong quang tuong
4 pages
Pool Controller Dzapasi Project Report
No ratings yet
Pool Controller Dzapasi Project Report
64 pages
Neovarsity Academy Brochure
No ratings yet
Neovarsity Academy Brochure
12 pages
Python Report Lokesh
No ratings yet
Python Report Lokesh
57 pages
Man G31T-M2 D
No ratings yet
Man G31T-M2 D
23 pages
CSS Presentation Slides
No ratings yet
CSS Presentation Slides
467 pages
Canon Powershot Sx100is SM
No ratings yet
Canon Powershot Sx100is SM
163 pages
Shreya Resume
No ratings yet
Shreya Resume
1 page
MAX2020 Agenda
No ratings yet
MAX2020 Agenda
76 pages

DBMS - Unit - 3 - Chapter - 2 - Relationl Database Design

Uploaded by

DBMS - Unit - 3 - Chapter - 2 - Relationl Database Design

Uploaded by

Unit-III Chapter-II

Relational Database Design

Decomposition Algorithms for More Normal

• Non-Trivial functional dependency

• It is used to maintain the quality of data in the database.

• It expresses the facts about the database design.

• It helps in clearly defining the meanings and constraints of databases.

• It helps to identify bad designs.

• Union Rule (IR4)

• Pseudo transitive Rule (IR6)

• Wastage and poor utilization of disk space and resources.

• The likelihood of errors and inconsistencies increases.

• Why do we need Normalization?

• FD set: { we will decompose the relation

EMP_ID EMP_COUNTRY EMP_DEPT DEPT_TYPE EMP_DEPT_NO

264 India Designing D394 283

264 India Testing D394 300

EMP_ID EMP_DEPT Developing D283 549

Our Join Dependency −

• BCNF(refer previous slides)

• Higher NF(4th,5th NF) (refer previous slides)

• Closure of Attribute Sets

• CG → HI. Since CG → H and CG → I , the union

The set F+ of functional dependencies

Sometimes, the result is not only in 3NF, but also in

We can show that if none of the dependencies in F causes a

2. It is less stronger than BCNF. It is comparatively more stronger than 3NF.

4. The redundancy is high in 3NF. The redundancy is comparatively low in BCNF.

In BCNF there may or may not be preservation of all

• 5NF (refer previous slides)

You might also like