Topic 2 Data Models
Topic 2 Data Models
Database design
focuses on how the database structure
will be used to store and manage end-user data
Data modeling
is the first step to design a database
DATA MODELING AND DATA MODELS
5
DATA MODEL BASIC BUILDING BLOCKS
6
Business Rules
8
Sources of Business Rules
Company managers
Policy makers
Department managers
9
Group Project: Fact-Finding Techniques
10
Examining
documents
(document review)
Questionnaire Interviewing
Commonly Used
Fact-Finding
Techniques
Observation the
Research organization in
operations
Reasons for Identifying and Documenting
Business Rules
Example 1
A painter must paint many paintings.
A painting must be painted by one and only one painter.
Example 2
An employee may learn many skills.
A skill may be learnt by many employees.
Example 3
An employee may manage one store.
A store must be managed by one employee.
12
Translating Business Rules into Data Model
Components
13
Activity: Translating Business Rules into Data
Model Components
Semantic data
- data is
organized in
such a way that
it can be
interpreted
meaningfully
without human
intervention
Hierarchical and Network Models
17
Hierarchical Model
18
Network Model
19
Standard Database Concepts from Network
Model Still Used by Modern Data Models
Relational Model
Based on a relation
• Relation or table: Matrix composed of intersecting tuple (rows) and attribute (columns)
Advantages Disadvantages
Relational Model
23
A Relational Diagram (MS Access)
26
The Object-Oriented Data Model (OODM) or
Semantic Data Model
Object: Contains data and their relationships with operations that are performed on it
• Basic building block for autonomous structures
• Abstraction of real-world entity
Class: Collection of similar objects with shared structure and behavior organized in a
class hierarchy
• Class hierarchy: Resembles an upside-down tree in which each class has only one parent
Advantages Disadvantages
30
Big Data
Aims to:
Find new and better ways to manage large amounts of web
and sensor-generated data and derive business insight from it
Provide high performance and scalability at a reasonable cost
HADOOP
HADOOP DISTRIBUTED FILE
SYSTEM(HDFS)
MapReduce NoSQL
33
Frequently Used Big Data New Technologies
• Hadoop is an open source distributed processing framework that manages data processing and storage for big data
applications running in clustered systems.
• It is at the center of a growing ecosystem of big data technologies that are primarily used to support advanced analytics
initiatives, including predictive analytics, data mining and machine learning applications.
Hadoop • Hadoop can handle various forms of structured and unstructured data, giving users more flexibility for collecting,
processing and analyzing data than relational databases and data warehouses provide.
34
NoSQL (Not only SQL / Non SQL) Databases
Semantic data
- data is
organized in
such a way that
it can be
interpreted
meaningfully
without human
intervention
Data Models: A Summary
40
Degrees of Data Abstraction
42
The External Model
43
The Conceptual Model
48
Summary
Relational model
Current database implementation standard
ER model is a tool for data modeling
◼ Complements relational model
Object-oriented data model: object is basic modeling
structure
Relational model adopted object-oriented extensions:
extended relational data model (ERDM)
OO data models depicted using UML
Data-modeling requirements are a function of different
data views and abstraction levels
Three abstraction levels: external, conceptual, internal &
physical
50