Chapter 10
Chapter 10
and Design
Third Edition
Chapter 10
Structuring System Requirements:
Conceptual Data Modeling
10.1
10.1
Conceptual Data Modeling
Process and logic modeling does not show the definition, structure and
relationships within the data but show how, where, and when data are used
or changed
Data model is most important part of information system requirements as:
The characteristics of data captured during the data modeling are
crucial in design of databases, program, computer screens, and printed
reports – data element is numeric, a name is limited to a specified set, an
item on a customer order can’t be moved to another customer order
Data rather than processes are most complex aspects (validating data,
coordinating movement of data) of many modern information systems like
MIS, DSS, ESS
Characteristics about data(length, format, relationships with other data)
are reasonably permanent whereas paths of data flow are dynamic
Most common format used for data modeling is entity-relationship
(E-R) diagramming
Conceptual Data Modeling
10.3
10.3
Process of Conceptual Data Modeling
10.4
10.4
Deliverables and Outcomes
10.5
10.5
Deliverables and Outcome
10.7
10.7
Gathering Information for Conceptual Data Modeling
• Two perspectives
– Top-down
• Data model is derived from an intimate understanding of the
nature of business
– Bottom-up
• Data model is derived by reviewing specific business
documents – computer displays, reports, business forms
10.8
10.8
Gathering Information for Conceptual Data
Modeling
Requirements Determination Questions for Data Modeling
What are the subjects/objects of business? What type of people,
places, things, events, etc., are used or interact in business whose
data must be maintained? How many instances of each object might
exist? – data entities and their descriptions
What unique characteristic (or characteristics) distinguishes each
object from other objects of same type? Does this distinguish
character change over time or is it permanent? – primary key
What characteristics describe each object? On what basis are
objects referenced, selected, sorted and categorized? – attributes
and secondary keys
How do you use this data? Are you the source of data, do you modify
it or just refer it? Who is not permitted to use it? – security controls
Gathering Information for Conceptual Data
Modeling
Over what period of time are you interested in this data? Do you
need historical trends, current snapshot values, or estimates? –
cardinality and time dimensions of data
Are all instances of each object the same? Are some objects
summaries or combinations of more detailed objects – supertypes,
subtypes, and aggregations
What events occur that imply associations between various objects?
– relationships and their cardinality and degree
Is each activity or event always handled the same way or are there
special circumstances? Can the associations between objects
change over time (employee change departments)? – integrity
rules, minimum and maximum cardinality
Introduction to Entity-Relationship (E-R) Modeling
• Attribute
– A named property or characteristic of an entity that is of interest to an
organization
– Examples of entities and their associated attributes
STUDENT: Student_ID, Student_Name, Student_Address, Major
AUTOMOBILE: Vehicle_ID, Color, Weight, Power, Wheels
– Attribute names are nouns with initial capital letter, followed by
lowercase letters
– Attributes are represented in E-R diagrams by an ellipse with name inside
and a line connecting it to the associated entity
Entity-Relationship (E-R) Modeling
Key Terms
• Candidate keys and Identifiers
– Each entity type must have an attribute or set of attributes that
distinguishes one instance from other instances of the same type
– Candidate key
• Attribute (or combination of attributes) that uniquely identifies
each instance of an entity type
• Candidate key for STUDENT entity type may be Student_ID
– Sometimes more than one attribute is needed to identify a unique
entity
– Some entities may have more than one candidate key
– EMPLOYEE may have one candidate key Employee_ID or a
combination of Employee_Name and Address
10.14
10.14
Entity-Relationship (E-R) Modeling
Key Terms
• Identifier
– A candidate key that has been selected as the unique identifying
characteristic for an entity type
– Selection rules for an identifier
1. Choose a candidate key that will not change its value like
Employee_Address
2. Choose a candidate key that will never be null
3. Avoid using intelligent keys whose structure indicates classifications,
locations, and so on like first 2 digits of a key for a STUDENT entity
might indicate college name
4. Consider substituting single value surrogate keys for large composite
keys
– The name of the identifier is underlined on an E-R diagram
10.15
10.15
Entity-Relationship (E-R) Modeling
Key Terms
• Multivalued Attribute
– An attribute that may take on more than one value for each entity
instance
– A STUDENT can attend more than one class – multivalued attribute
– Represented on E-R Diagram in two ways:
• double-lined ellipse
• weak entity – separate repeating data into another entity and then
using relationship link it to its associated regular entity
• Repeating group – several attributes that repeat together like
EMPLOYEE Dependents – name, age, relation (spouse, child)
• Relationships
– An association between the instances of one or more entity types that is
of interest to the organization
– Association indicates that an event has occurred or that there is a natural
10.16
10.16
link between entity types
– Relationships are always labeled with verb phrase
Degree of Relationship
• Degree
– Number of entity types that participate in a relationship
• Three cases
– Unary
• A relationship between two instances of one entity type (also
called recursive relationship)
– Binary
• A relationship between the instances of two entity types
– Ternary
• A simultaneous relationship among the instances of three entity
types
• Not the same as three binary relationships
• It is recommended that all ternary and higher relationships be
10.17
10.17 represented as associative entities
Figure 10-6
Example relationships of different degrees
10.18
10.18
Cardinality
• The number of instances of entity B that can be associated with each instance
of entity A
• Minimum Cardinality
– The minimum number of instances of entity B that may be associated
with each instance of entity A
– If minimum cardinality of an entity is 0, then that entity is optional
participant in relationship
– If minimum cardinality of an entity is 1, then that entity is mandatory
participant in relationship
• Maximum Cardinality
– The maximum number of instances of entity B that may be associated
with each instance of entity A
Associative Entity
– An entity type that associates the instances of one or more entity types
and contains attributes that are peculiar to the relationship between
10.19
10.19 those entity instances (also called gerund)
Naming and Defining Relationships
Often two or more entity types share common properties but also have
one or more distinct attributes or relationships
Subtype – a subgrouping of the entities in an entity types meaningful
to an organization. Example – STUDENT is an entity type that has two
subtypes GRADUATE STUDENT and UNDERGRADUATE STUDENT
Supertype – a generic entity type that has a relationship with one or
more subtypes
Total specialization rule – each entity instance of the supertype must be
a member of some subtype in the relationship (shown by double line)
Partial specialization rule – an entity instance of the supertype is allowed
not to belong to any subtype in the reationship (shown by single line)
Disjoint rule – if an entity instance of the supertype is a member of one
subtype, it cannot simultaneously be a member of any other subtype
Overlap rule – an entity instance can simultaneously be a member of two
or more subtypes.
Disjoint is shown by a “d” and overlap is shown by a “o” in the circle
Business Rules
The specifications that preserve the integrity of logical data model
Four basic types of business rules:
– Entity integrity – each instance of an entity type must have unique identifier that is
not null
– Referential integrity constraints – rules concerning the relationships between entity
types
– Domains – constraints on valid values for attributes
– Triggering operations – other business rules to protect validity of attribute values
Domains
• The set of all data types and ranges of values that an attribute can assume – data
type, length, format, range, allowable values, meaning
• Several advantages
1. Verify that the values for an attribute are valid
2. Ensure that various data manipulation operations are logical
3. Help conserve effort in describing attribute characteristics
10.22
10.22
Triggering Operations
10.24
10.24