DB Chapter Three-Part - One
DB Chapter Three-Part - One
Database Modeling
1
Chapter Outline
• Overview of Database Design Process
• Data Modelling Using the Entity-Relationship (ER) Model
– Example Database Application (COMPANY)
– Entity Relationship Model Concepts
• Entities and Attributes
• Entity Types, Value Sets, and Key Attributes
• Relationships and Relationship Types
• Weak Entity Types
• Roles and Attributes in Relationship Types
• ER Diagrams – Notation
• ER Diagram for COMPANY Schema
• Enhanced Entity-Relationship (EER) Modelling
12
Overview of Database Design Process
4
Overview of Database Design Process
– During this step, the database designers interview prospective database users to
understand and document their data requirements.
– In parallel with specifying the data requirements, it is useful to specify the known
functional requirements of the application.
– These consist of the user defined operations (or transactions) that will be applied to the
database, including both retrievals and updates.
5
Overview of Database Design Process
2. Conceptual schema
– Is a concise description of the data requirements of the users and includes detailed
descriptions of the entity types, relationships, and constraints.
– Expressed using the concepts provided by the high-level data model.
6
Overview of Database Design Process
4. Physical design
several projects.
We keep track of the number of hours per week that an
9
Data Modelling Using the Entity-Relationship (ER) Model
Entities and Attributes
Entities are specific objects or things in the mini-world that are represented
in the database.
For example the EMPLOYEE John Smith, the Research DEPARTMENT,
the ProductX PROJECT
Attributes are properties used to describe an entity.
For example an EMPLOYEE entity may have the attributes Name, SSN,
Address, Sex, BirthDate
A specific entity will have a value for each of its attributes.
For example a specific employee entity may have Name='John Smith',
SSN='123456789', Address ='731, Fondren, Houston, TX', Sex='M',
BirthDate='09-JAN-55‘
Each attribute has a value set (or data type) associated with it – e.g. integer,
string, subrange, enumerated type, …
10
Types of Attributes (1)
Simple
Each entity has a single atomic value for the attribute. For example, SSN or Sex.
Composite
The attribute may be composed of several components. For example:
Address(Apt#, House#, Street, City, State, ZipCode, Country), or
Name(FirstName, MiddleName, LastName).
Composition may form a hierarchy where some components are themselves
composite.
Single value
Most attributes have a single value for a particular entity; such attributes are called
single-valued. For example, age is a single-valued attribute of a person.
Multi-valued
An entity may have multiple values for that attribute. For example, Color of a CAR
or PreviousDegrees of a STUDENT.
Denoted as {Color} or {PreviousDegrees}.
11
Types of Attributes (2)
In general, composite and multi-valued attributes may
be nested arbitrarily to any number of levels, although
this is rare.
For example, PreviousDegrees of a STUDENT is a
composite multi-valued attribute denoted by
{PreviousDegrees (College, Year, Degree, Field)}
Multiple PreviousDegrees values can exist
112
Types of Attributes (3)
• Stored vs Derived
– Two (or more) attribute values are related—for example, the Age and
Birth_date attributes of a person. The value of Age can be determined
from the current (today’s) date and the value of that person’s Birth_date.
113
Example of A Composite Attribute
14
Entity Types and Key Attributes (1)
Entities with the same basic attributes are
grouped or typed into an entity type.
For example, the entity type EMPLOYEE and
PROJECT.
An attribute of an entity type for which each entity
must have a unique value is called a key
attribute of the entity type.
For example, SSN of EMPLOYEE.
15
Entity Types and Key Attributes (2)
number.
Each key is underlined
16
Entity-Relationship Diagram
Cardinality
• An Entity-Relationship (ER) Mapping cardinality or the
diagram is a visual cardinality ratio indicates
representation of the data the number of entities of an
model that illustrates the entity set associated with
entities involved in a entities of another entity
set by means of a
database and the relationship
relationships between them.
• Components of ERD 1. 1 to 1
– Entities 2. 1 to many
3. Many to many
– Attributes
– Relationships
– Cardinality
– Primary key
– Foreign key
19
Entity Set
EMPLOYEE
DEPENDENT
22
Refining The Initial Design by Introducing Relationships
The initial design is typically not complete
Some aspects in the requirements will be represented
as relationships
ER model has three main concepts:
Entities (and their entity types and entity sets)
Attributes (simple, composite, multivalued)
sets)
We introduce relationship concepts next
23
Relationships and Relationship Types (1)
A relationship relates two or more distinct entities with a
specific meaning.
For example, EMPLOYEE John Smith works on the ProductX
PROJECT, or EMPLOYEE Franklin Wong manages the Research
DEPARTMENT.
Relationships of the same type are grouped or typed into a
relationship type.
For example, the WORKS_ON relationship type in which
EMPLOYEEs and PROJECTs participate, or the MANAGES
relationship type in which EMPLOYEEs and DEPARTMENTs
participate.
The degree of a relationship type is the number of participating
entity types.
Both MANAGES and WORKS_ON are binary relationships.
24
Relationship Type Vs. Relationship Set (1)
Relationship Type:
Is the schema description of a relationship
Identifies the relationship name and the participating
entity types
Also identifies certain relationship constraints
Relationship Set:
The current set of relationship instances represented
in the database
The current state of a relationship type
25
Relationship Type Vs. Relationship Set (2)
type as follows:
Diamond-shaped box is used to display a relationship type
Connected to the participating entity types via straight lines
26
Refining The COMPANY Database Schema By Introducing Relationships
By examining the requirements, six relationship types are
identified
All are binary relationships( degree 2)
Listed below with their participating entity types:
WORKS_FOR (between EMPLOYEE, DEPARTMENT)
MANAGES (also between EMPLOYEE, DEPARTMENT)
CONTROLS (between DEPARTMENT, PROJECT)
WORKS_ON (between EMPLOYEE, PROJECT)
SUPERVISION (between EMPLOYEE (as subordinate),
EMPLOYEE (as supervisor))
DEPENDENTS_OF (between EMPLOYEE, DEPENDENT)
27
Discussion on Relationship Types
In the refined design, some attributes from the initial entity
types are refined into relationships:
Manager of DEPARTMENT -> MANAGES
Works_on of EMPLOYEE -> WORKS_ON
Department of EMPLOYEE -> WORKS_FOR
etc
In general, more than one relationship type can exist between
the same participating entity types
MANAGES and WORKS_FOR are distinct relationship types
between EMPLOYEE and DEPARTMENT
Different meanings and different relationship instances.
28
Recursive Relationship Type
An relationship type whose with the same participating entity type in
distinct roles
Example: the SUPERVISION relationship
EMPLOYEE participates twice in two distinct roles:
supervisor (or boss) role
supervisee (or subordinate) role
Each relationship instance relates two distinct EMPLOYEE entities:
One employee in supervisor role
One employee in supervisee role
29
Weak Entity Types
An entity that does not have a key attribute
A weak entity must participate in an identifying relationship type with an
owner or identifying entity type
Weak entities are identified by the combination of:
A partial key of the weak entity type
The particular entity they are related to in the identifying entity type
Example:
A DEPENDENT entity is identified by the dependent’s first name, and the
specific EMPLOYEE with whom the dependent is related
Name of DEPENDENT is the partial key
Many-to-many (M:N)
dependent)
31
Recursive Relationship Type is: SUPERVISION (participation role names are
shown)
32
Attributes of Relationship types
A relationship type can have attributes:
For example, HoursPerWeek of WORKS_ON
Its value for each relationship instance describes the
number of hours per week that an EMPLOYEE works on a
PROJECT.
A value of HoursPerWeek depends on a particular
33
Notation for Constraints on Relationships
Cardinality ratio (of a binary relationship): 1:1,
1:N, N:1, or M:N
Shown by placing appropriate numbers on the
relationship edges.
Participation constraint (on each participating
entity type): total (called existence dependency)
or partial.
Total shown by double line, partial by single line.
NOTE: These are easy to specify for Binary
Relationship Types.
34
Alternative (Min, Max) Notation For Relationship Structural Constraints:
Specified on each participation of an entity type E in a relationship type R
Specifies that each entity e in E participates in at least min and at most
max relationship instances in R
Default(no constraint): min=0, max=n (signifying no limit)
Must have minmax, min0, max 1
Derived from the knowledge of mini-world constraints
Examples:
A department has exactly one manager and an employee can
manage at most one department.
Specify (0,1) for participation of EMPLOYEE in MANAGES
Specify (1,1) for participation of DEPARTMENT in MANAGES
An employee can work for exactly one department but a department
can have any number of employees.
Specify (1,1) for participation of EMPLOYEE in WORKS_FOR
Specify (0,n) for participation of DEPARTMENT in WORKS_FOR
35
Alternative Diagrammatic Notation
ER diagrams is one popular example for displaying
database schemas
Many other notations exist in the literature and in
various database design and modeling tools
Appendix A illustrates some of the alternative notations
that have been used
UML class diagrams is representative of another way of
displaying ER concepts that is used in several
commercial design tools
36
Summary Of Notation For ER Diagrams
37
Mapping ER-Models to Relational Tables
1. Identify Entities: Each entity in the ER model becomes a table in the 8. Resolve Many-to-Many Relationships:
relational model. The table name is often derived from the entity name.
Many-to-many relationships in the ER model
2. Identify Attributes: For each attribute of an entity, create a column in
the corresponding table. The attribute name becomes the column name,
are resolved by introducing an associative
and the data type is chosen based on the nature of the attribute. (junction) table. This table includes foreign
3. Identify Primary Keys: Determine the primary key for each table. This keys referencing the primary keys of the two
could be a single attribute or a combination of attributes that uniquely entities involved in the many-to-many
identifies each row in the table. relationship.
4. Identify Relationships: For each relationship in the ER model, 9. Normalizing Tables: Normalize tables to eliminate
determine how it will be represented in the relational model. This redundancy and dependency issues. Ensure that each table
involves identifying foreign keys. adheres to a specific normal form (e.g., 1NF, 2NF, 3NF).
5. Representing Binary Relationships: For binary relationships, where 10. Review and Refine: Review the relational model to ensure
two entities are involved, the primary key of one entity becomes a that it accurately represents the information captured in the
foreign key in the other entity's table. ER model. Refine the model as needed to improve clarity
6. Representing Ternary and N-ary Relationships: For ternary and n-ary and efficiency.
relationships, additional tables may be needed to represent the 11. Integrity Constraints: Implement integrity constraints, such
association between entities. These tables include foreign keys that as primary key constraints, foreign key constraints, and any
reference the primary keys of the entities involved in the relationship. other business rules specified in the ER model.
7. Cardinality and Multiplicity: Represent the cardinality and 12. Indexes: Identify columns that need indexing for efficient
multiplicity constraints by using foreign keys appropriately. For retrieval of data. Indexes can be created on primary keys
example, if an entity has a "many" relationship with another entity, a and foreign keys to optimize query performance.
foreign key is placed in the "many" side's table.
138
Relationships of Higher Degree
Relationship types of degree 2 are called binary
Relationship types of degree 3 are called
Slide 3- 41
Discussion of n-ary relationships (n > 2)
Slide 3- 43
Displaying constraints on higher-degree relationships
The (min, max) constraints can be displayed on the edges –
however, they do not fully describe the constraints
Displaying a 1, M, or N indicates additional constraints
An M or N indicates no constraint
A 1 indicates that an entity can participate in at most one
relationship instance that has a particular combination of the other
participating entities
In general, both (min, max) and 1, M, or N are needed to
describe fully the constraints
Some of the Currently Available
Automated Database Design Tools
COMPANY TOOL FUNCTIONALITY
Embarcadero ER Studio Database Modeling in ER and IDEF1X
Technologies
DB Artisan Database administration, space and security
management
Oracle Developer 2000/Designer 2000 Database modeling, application development
Popkin System Architect 2001 Data modeling, object modeling, process modeling,
Software structured analysis/design
Platinum Enterprise Modeling Suite: Data, process, and business component modeling
(Computer Erwin, BPWin, Paradigm Plus
Associates)
Persistence Pwertier Mapping from O-O to relational model
Inc.
Rational (IBM) Rational Rose UML Modeling & application generation in C++/JAVA
Resolution Ltd. Xcase Conceptual modeling up to code maintenance
Sybase Enterprise Application Suite Data modeling, business logic modeling
Visio Visio Enterprise Data modeling, design/reengineering Visual
Slide 3- 45 Basic/C++