0% found this document useful (0 votes)

6 views10 pages

Unit 5

This document outlines the fundamentals of physical database design, emphasizing the separation of conceptual, logical, and physical design phases. It details the steps involved in translating logical data models to physical representations, including designing base relations, selecting file organizations, and implementing security mechanisms. The document also discusses the importance of monitoring and tuning the operational system to enhance performance and address design inefficiencies.

Uploaded by

girum shewatatek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views10 pages

Unit 5

Uploaded by

girum shewatatek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Fundamentals of Database Systems Lecture Note UOG

UNIT FIVE
Physical Database Design and Performance

We have established that there are three levels/sub phases of database

design:
 Conceptual: producing a data model which accounts for the relevant
entities and relationships within the target application domain;
 Logical: ensuring, via normalization procedures and the definition of
integrity rules, that the stored database will be non-redundant and
properly connected;
 Physical: specifying how database records are stored, accessed or
retrieved and related to ensure adequate performance.

It is considered desirable to keep these three levels quite separate -- one of

Codd's requirements for an RDBMS is that it should maintain logical-physical
data independence. The generality of the relational model means that
RDBMSs are potentially less efficient than those based on one of the older
data models where access paths were specified once and for all at the
design stage. However, the relational data model does not preclude the use
of traditional techniques for accessing data - it is still essential to exploit
them to achieve adequate performance with a database of any size.

We can consider the topic of physical database design from three aspects:
 What techniques for storing and finding data exist
 Which are implemented within a particular DBMS
 Which might be selected by the designer for a given application
knowing the properties of the data

Thus, the purpose of physical database design is:

1. How to map the logical database design to a physical database design.

1
Fundamentals of Database Systems Lecture Note UOG

2. How to design base relations for target DBMS.

3. How to design enterprise constraints for target DBMS.
4. How to select appropriate file organizations based on analysis of
transactions.
5. When to use secondary indexes to improve performance.
6. How to estimate the size of the database.
7. How to design user views.
8. How to design security mechanisms to satisfy user requirements.
Physical database design is the process of producing a description of the
implementation of the database on secondary storage. Physical design
describes the base relation, file organization, and indexes used to achieve
efficient access to the data, and any associated integrity constraints and
security measures.
Sources of information for the physical design process include global logical
data model and documentation that describes model. Logical database
design is concerned with the what; physical database design is concerned
with the how. Physical database design describes the storage structures and
access methods used to achieve efficient access to the data.

Steps in physical database design

1. Translate logical data model for target DBMS
1.1. Design base relation
1.2. Design representation of derived data
1.3. Design enterprise constraint
2. Design physical representation
2.1. Analyze transactions
2.2. Choose file organization
2.3. Choose indexes
2.4. Estimate disk space and system requirement
3. Design user view

2
Fundamentals of Database Systems Lecture Note UOG

4. Design security mechanisms

5. Consider controlled redundancy
6. Monitor and tune the operational system

1. Translate logical data model for target DBMS

This phase is the translation of the global logical data model to produce a
relational database schema in the target DBMS. This includes creating the
data dictionary based on the logical model and information gathered.
After the creation of the data dictionary, the next activity is to understand
the functionality of the target DBMS so that all necessary requirements are
fulfilled for the database intended to be developed.

Knowledge of the DBMS includes:

 How to create base relations
 Whether the system supports:
o Definition of Primary key
o Definition of Foreign key
o Definition of Alternate key
o Definition of Domains
o Referential integrity constraints
o Definition of enterprise level constraints

1.1. Design base relation

To decide how to represent base relations identified in global logical model in
target DBMS.
Designing base relation involves identification of all necessary requirements
about a relation starting from the name up to the referential integrity
constraints.
For each relation, need to define:
3
Fundamentals of Database Systems Lecture Note UOG

 The name of the relation;

 A list of simple attributes in brackets;
 The PK and, where appropriate, AKs and FKs.
 A list of any derived attributes and how they should be computed;
 Referential integrity constraints for any FKs identified.
For each attribute, need to define:
 Its domain, consisting of a data type, length, and any constraints on
the domain;
 An optional default value for the attribute;
 Whether the attribute can hold nulls.
The implementation of the physical model is dependent on the target DBMS
since some has more facilities than the other in defining database
definitions.
The base relation design along with every justifiable reason should be fully
documented.
1.2. Design representation of derived data
While analyzing the requirement of users, we may encounter that there are
some attributes holding data that will be derived from existing or other
attributes. A decision on how to represent any derived data present in the
global logical data model in the target DBMS should be devised.

Examine logical data model and data dictionary, and produce list of all
derived attributes. Most of the time derived attributes are not expressed in
the logical model but will be included in the data dictionary. Whether to store
derived attributes in a base relation or calculate them when required is a
decision to be made by the designer considering the performance impact.
Option selected is based on:
 Additional cost to store the derived data and keep it consistent with
operational data from which it is derived;
 Cost to calculate it each time it is required.
4
Fundamentals of Database Systems Lecture Note UOG

Less expensive option is chosen subject to performance constraints.

The representation of derived attributes should be fully documented.
1.3. Design enterprise constraint
Data in the database is not only subjected to constraints on the database
and the data model used but also with some enterprise dependent
constraints. These constraint definitions are also dependent on the DBMS
selected and enterprise level requirements.
One need to know the functionalities of the DBMS since in designing the
enterprise constraints for the target DBMS. Some DBMS provide more
facilities than others.

All the enterprise level constraints and the definition method in the target
DBMS should be fully documented.

2. Design physical representation

This phase is the level for determining the optimal file organizations to store
the base relations and the indexes that are required to achieve acceptable
performance; that is, the way in which relations and tuples will be held on
secondary storage.
Number of factors that may be used to measure efficiency:
 Transaction throughput: number of transactions processed in given
time interval.
 Response time: elapsed time for completion of a single transaction.
 Disk storage: amount of disk space required to store database files.
However, no one factor is always correct.
Typically, have to trade one factor off against another to achieve a
reasonable balance.
2.1. Analyze transactions
To understand the functionality of the transactions that will run on the
database and to analyze the important transactions.

5
Fundamentals of Database Systems Lecture Note UOG

Attempt to identify performance criteria, e.g.:

 Transactions that run frequently and will have a significant impact on
performance;
 Transactions that are critical to the business;
 Times during the day/week when there will be a high demand made on
the database (called the peak load).
Use this information to identify the parts of the database that may cause
performance problems.
To select appropriate file organizations and indexes, also need to know high-
level functionality of the transactions, such as:
 Attributes that are updated in an update transaction;
 Criteria used to restrict tuples that are retrieved in a query.
Often not possible to analyze all expected transactions, so investigate most
‘important’ ones.
To help identify which transactions to investigate, can use:
 Transaction/relation cross-reference matrix, showing relations that
each transaction accesses, and/or
 Transaction usage map, indicating which relations are potentially
heavily used.
To focus on areas that may be problematic:
1. Map all transaction paths to relations.
2. Determine which relations are most frequently accessed by
transactions.
3. Analyze the data usage of selected transactions that involve these
relations.

2.2. Choose file organization

To determine an efficient file organization for each base relation.
File organizations include Heap, Hash, Indexed Sequential Access Method
(ISAM), B+-Tree, and Clusters.

6
Fundamentals of Database Systems Lecture Note UOG

2.3. Choose indexes

To determine whether adding indexes will improve the performance of the
system.
One approach is to keep tuples unordered and create as many secondary
indexes as necessary.
Another approach is to order tuples in the relation by specifying a primary or
clustering index.
In this case, choose the attribute for ordering or clustering the tuples as:
 Attribute that is used most often for join operations - this makes join
operation more efficient, or
 Attribute that is used most often to access the tuples in a relation in
order of that attribute.
If ordering attribute chosen is key of relation, index will be a primary index;
otherwise, index will be a clustering index.
Each relation can only have either a primary index or a clustering index.
Secondary indexes provide a mechanism for specifying an additional key for
a base relation that can be used to retrieve data more efficiently.
Overhead involved in maintenance and use of secondary indexes that has to
be balanced against performance improvement gained when retrieving data.
This includes:
 Adding an index record to every secondary index whenever tuple is
inserted;
 Updating a secondary index when corresponding tuple is updated;
 Increase in disk space needed to store the secondary index;
 Possible performance degradation during query optimization to
consider all secondary indexes.
Guidelines for Choosing Indexes
(1) Do not index small relations.
(2) Index PK of a relation if it is not a key of the file organization.
(3) Add secondary index to a FK if it is frequently accessed.
7
Fundamentals of Database Systems Lecture Note UOG

(4) Add secondary index to any attribute that is heavily used as a

secondary key.
(5) Add secondary index on attributes that are involved in: selection
or join criteria; ORDER BY; GROUP BY; and other operations
involving sorting (such as UNION or DISTINCT).
(6) Add secondary index on attributes involved in built-in functions.
(7) Add secondary index on attributes that could result in an index-
only plan.
(8) Avoid indexing an attribute or relation that is frequently updated.
(9) Avoid indexing an attribute if the query will retrieve a significant
proportion of the tuples in the relation.
(10) Avoid indexing attributes that consist of long character strings.

2.4. Estimate disk space and system requirement

To estimate the amount of disk space that will be required by the database.
Purpose:
 If system already exists: is there adequate storage?
 If procuring new system: what storage will be required?

3. Design user view

To design the user views that was identified during the Requirements
Collection and Analysis stage of the relational database application lifecycle.
Define views in DDL to provide user views identified in data model and Map
onto objects in physical data model
4. Design security mechanisms
To design the security measures for the database as specified by the users.
System security
Data security
5. Consider the Introduction of Controlled Redundancy

8
Fundamentals of Database Systems Lecture Note UOG

To determine whether introducing redundancy in a controlled manner by

relaxing the normalization rules will improve the performance of the system.
Result of normalization is a logical database design that is structurally
consistent and has minimal redundancy.
However, sometimes a normalized database design does not provide
maximum processing efficiency.
It may be necessary to accept the loss of some of the benefits of a fully
normalized design in favor of performance.
Also consider that denormalization:
 Makes implementation more complex;
 Often sacrifices flexibility;
 May speed up retrievals but it slows down updates.
Denormalization refers to a refinement to relational schema such that the
degree of normalization for a modified relation is less than the degree of at
least one of the original relations.
Also use term more loosely to refer to situations where two relations are
combined into one new relation, which is still normalized but contains more
nulls than original relations.
Consider denormalization in following situations, specifically to speed up
frequent or critical transactions:
Step 1- Combining 1:1 relationship
Step 2 -Duplicating non-key attributes in 1: * relationships to reduce joins
Step 3 -Duplicating foreign key attributes in 1:* relationships to reduce
joins
Step 4 -Introducing repeating groups
Step 5 -Merging lookup tables with base relations
Step 6 -Creating extract tables.
6. Monitoring and Tuning the operational system
Meaning of denormalization
When to de normalize to improve performance
9
Fundamentals of Database Systems Lecture Note UOG

Importance of monitoring and tuning the operational system

To monitor operational system and improve performance of system to
correct inappropriate design decisions or reflect changing request

Oracle Apps R-12 Technical
100% (1)
Oracle Apps R-12 Technical
54 pages
Chapter 17 - Physical Database Design
No ratings yet
Chapter 17 - Physical Database Design
44 pages
Phases of Database Design
80% (10)
Phases of Database Design
4 pages
Topic2 4 Stid5014 PDD
No ratings yet
Topic2 4 Stid5014 PDD
70 pages
Fundamentals of Database Systems Module by Aklilu Thomas 2016 e
No ratings yet
Fundamentals of Database Systems Module by Aklilu Thomas 2016 e
97 pages
Chapter 5 Physical Design
No ratings yet
Chapter 5 Physical Design
43 pages
Exit EX Tutorials
No ratings yet
Exit EX Tutorials
61 pages
Chatgpt MCQ
No ratings yet
Chatgpt MCQ
35 pages
Lecture 8
No ratings yet
Lecture 8
32 pages
Chapter 9. Database Design
100% (1)
Chapter 9. Database Design
52 pages
07 Iism
No ratings yet
07 Iism
49 pages
MS21C Computer Applications: Productivity Tools Overview Part 2: Databases
No ratings yet
MS21C Computer Applications: Productivity Tools Overview Part 2: Databases
46 pages
Unit 4 Association Rule Mining
No ratings yet
Unit 4 Association Rule Mining
18 pages
Database Management System Revision Guide
No ratings yet
Database Management System Revision Guide
46 pages
Database Unit Two Architcture of Database and Model
No ratings yet
Database Unit Two Architcture of Database and Model
44 pages
Dbs Note of 2014
No ratings yet
Dbs Note of 2014
63 pages
Chapter Five
No ratings yet
Chapter Five
25 pages
T1-Fundamentals of Databases
No ratings yet
T1-Fundamentals of Databases
30 pages
Chapter 6
No ratings yet
Chapter 6
13 pages
MCS-014 Block 3
No ratings yet
MCS-014 Block 3
70 pages
AWS Certified DevOps Engineer Professional... Tests 2021
100% (3)
AWS Certified DevOps Engineer Professional... Tests 2021
210 pages
CoSc 2041 Chapter 5 and 6-1
No ratings yet
CoSc 2041 Chapter 5 and 6-1
15 pages
DBMS Module 3.3 PDF
No ratings yet
DBMS Module 3.3 PDF
16 pages
DB System Note of 2016 (BEd)
No ratings yet
DB System Note of 2016 (BEd)
76 pages
T2 L6 DB PhysicallDesign
No ratings yet
T2 L6 DB PhysicallDesign
9 pages
Electricity
No ratings yet
Electricity
25 pages
Unit 1
No ratings yet
Unit 1
51 pages
DBMS
No ratings yet
DBMS
75 pages
Databases
No ratings yet
Databases
7 pages
4.database Design
No ratings yet
4.database Design
24 pages
Chapter 5
No ratings yet
Chapter 5
19 pages
DB Desing&other Learning
No ratings yet
DB Desing&other Learning
24 pages
Assiginment Quan
No ratings yet
Assiginment Quan
16 pages
DBMS Chapter 5
No ratings yet
DBMS Chapter 5
13 pages
2 - Physical DB Design
No ratings yet
2 - Physical DB Design
56 pages
M1 - Intro
No ratings yet
M1 - Intro
56 pages
Physical Database Design: Chapter Four
No ratings yet
Physical Database Design: Chapter Four
16 pages
Advanced Database Concepts1
No ratings yet
Advanced Database Concepts1
14 pages
Physical Database Design
No ratings yet
Physical Database Design
13 pages
DBMS Unit 2 Notes
No ratings yet
DBMS Unit 2 Notes
22 pages
Database
No ratings yet
Database
7 pages
MS21C Computer Applications: Productivity Tools Overview Part 2: Databases
No ratings yet
MS21C Computer Applications: Productivity Tools Overview Part 2: Databases
46 pages
Chapter 5
No ratings yet
Chapter 5
9 pages
Name: Ignacio, James Louis R. Assignment #: 5 Course Code: Activity Title
No ratings yet
Name: Ignacio, James Louis R. Assignment #: 5 Course Code: Activity Title
3 pages
DatabaseDesigning (Requirement Formulation)
No ratings yet
DatabaseDesigning (Requirement Formulation)
4 pages
DBMS Assignment-1
No ratings yet
DBMS Assignment-1
3 pages
Ais CH4
No ratings yet
Ais CH4
3 pages
Physical Database Design For Relational Databases
No ratings yet
Physical Database Design For Relational Databases
37 pages
Desain Database 2019
No ratings yet
Desain Database 2019
9 pages
HOL3018 - Hands On Lab Session Oracle Database 23ai Best New Features - 1725977266806001OiqY
No ratings yet
HOL3018 - Hands On Lab Session Oracle Database 23ai Best New Features - 1725977266806001OiqY
19 pages
Learning Guide: Tour Service Level III
No ratings yet
Learning Guide: Tour Service Level III
35 pages
Designing A Database Ok
No ratings yet
Designing A Database Ok
13 pages
Database Systems Course Outline August 2022
No ratings yet
Database Systems Course Outline August 2022
5 pages
Pricing Procedure in SAP MM
100% (1)
Pricing Procedure in SAP MM
26 pages
Database Design Lecture Notes
No ratings yet
Database Design Lecture Notes
9 pages
cb3401 Unit 2
No ratings yet
cb3401 Unit 2
24 pages
Physical Design PDF
No ratings yet
Physical Design PDF
11 pages
Module - 1 DBMS Notes
No ratings yet
Module - 1 DBMS Notes
34 pages
Database Design
No ratings yet
Database Design
4 pages
PhysicalDesign1 PDF
No ratings yet
PhysicalDesign1 PDF
11 pages
MIS Tim UNIT 3
No ratings yet
MIS Tim UNIT 3
45 pages
Practical File Database Management System: (CODE-IT (ID) - 5002)
No ratings yet
Practical File Database Management System: (CODE-IT (ID) - 5002)
23 pages
Developing Java Applications - Db2aje90
No ratings yet
Developing Java Applications - Db2aje90
401 pages
Crypto Format
No ratings yet
Crypto Format
4 pages
Training Schedule: Primavera P6.1
No ratings yet
Training Schedule: Primavera P6.1
7 pages
Aws Certified Solutions Architect Professional - 4
No ratings yet
Aws Certified Solutions Architect Professional - 4
21 pages
E Hanaaw 18
No ratings yet
E Hanaaw 18
11 pages
MS Access Practical Exam
No ratings yet
MS Access Practical Exam
3 pages
Presentation On Cloud Based Secure Text Transfer
No ratings yet
Presentation On Cloud Based Secure Text Transfer
30 pages
AWS - IAM Users
No ratings yet
AWS - IAM Users
6 pages
Network Merchants (NMI) Integration Resources: Direct Post API Documentation
No ratings yet
Network Merchants (NMI) Integration Resources: Direct Post API Documentation
11 pages
MongoDB Ebook 07292020
No ratings yet
MongoDB Ebook 07292020
24 pages
Drupal 8 Theming Deep Dive
No ratings yet
Drupal 8 Theming Deep Dive
55 pages
SVM 10 Install Instructions
No ratings yet
SVM 10 Install Instructions
62 pages
Cyber GPTDo C
No ratings yet
Cyber GPTDo C
2 pages
EMC Data Domain DD160 Data Sheet
No ratings yet
EMC Data Domain DD160 Data Sheet
4 pages
Susneha Ghosal - Acc - Resume
No ratings yet
Susneha Ghosal - Acc - Resume
2 pages
Balu Module 4
No ratings yet
Balu Module 4
9 pages
Splunk Queiries
No ratings yet
Splunk Queiries
1 page
POOA SOHO Integration Technical Design Document
No ratings yet
POOA SOHO Integration Technical Design Document
9 pages
Computer Project File
No ratings yet
Computer Project File
15 pages
1 PB
No ratings yet
1 PB
10 pages
All Dat
No ratings yet
All Dat
13 pages
LDAP Cognos Configuration
No ratings yet
LDAP Cognos Configuration
12 pages
AAKARSH SHRIVASTAVA - RESUME - 1page 2024 (1) - Deleted
No ratings yet
AAKARSH SHRIVASTAVA - RESUME - 1page 2024 (1) - Deleted
1 page
Design and Implementation of Web Based On Laravel Framework: He Ren Yu
No ratings yet
Design and Implementation of Web Based On Laravel Framework: He Ren Yu
4 pages
Assistmyteam Email To PDF: For Outlook
No ratings yet
Assistmyteam Email To PDF: For Outlook
4 pages
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
PrestoDB in Practice: Definitive Reference for Developers and Engineers
From Everand
PrestoDB in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
DB2 Administration and Optimization Guide: Definitive Reference for Developers and Engineers
From Everand
DB2 Administration and Optimization Guide: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Unit 5

Uploaded by

Unit 5

Uploaded by

Fundamentals of Database Systems Lecture Note UOG

We have established that there are three levels/sub phases of database

It is considered desirable to keep these three levels quite separate -- one of

Thus, the purpose of physical database design is:

2. How to design base relations for target DBMS.

Steps in physical database design

4. Design security mechanisms

1. Translate logical data model for target DBMS

Knowledge of the DBMS includes:

1.1. Design base relation

 The name of the relation;

Less expensive option is chosen subject to performance constraints.

2. Design physical representation

Attempt to identify performance criteria, e.g.:

2.2. Choose file organization

2.3. Choose indexes

(4) Add secondary index to any attribute that is heavily used as a

2.4. Estimate disk space and system requirement

3. Design user view

To determine whether introducing redundancy in a controlled manner by

Importance of monitoring and tuning the operational system

You might also like