0% found this document useful (0 votes)
22 views63 pages

MOIS Intro To MIS & Databases Ch1 & Ch2

The document outlines the course structure for MOIS 406/3201, focusing on Management Information Systems and Database Management, taught by Professor Galal H. Galal-Edeen. It includes course objectives, topics to be covered, assessment schemes, and expectations for student participation. The course emphasizes the importance of data management and relational databases in modern organizations.

Uploaded by

mirnamonir38
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views63 pages

MOIS Intro To MIS & Databases Ch1 & Ch2

The document outlines the course structure for MOIS 406/3201, focusing on Management Information Systems and Database Management, taught by Professor Galal H. Galal-Edeen. It includes course objectives, topics to be covered, assessment schemes, and expectations for student participation. The course emphasizes the importance of data management and relational databases in modern organizations.

Uploaded by

mirnamonir38
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 63

9/25/24

We l c o m e t o
MOIS 406/3201
Management Information
Systems and Database
Management (3 cr.)
Department of Management
School of Business

Your Professor

Galal H. Galal-Edeen,
BSc (SAMS), BA(Arch, Greenwich), MSc(BSAD, City), MSc(Sp. Des., UCL), PhD (Brunel), MBCS CITP, CT
Professor of Information Systems Engineering,
Department of Management
AUC School of Business
Certified Trainer & Certified Usability Analyst
University Chair on Innovation (CU)

[email protected]
2

1
9/25/24

This session
• Aim & objectives of this course.
• Introductions.
• Your aims from this course.
• Planned topics.
• Communication.

MOS 406-3201 3

Catalog Description
Description
• The course aims at defining a framework of management information
systems with emphasis on the organization. It relates to a number of
important organizational aspects such as the human and
technological infrastructure and the needs and requirements of an
organizational information system. The course also covers the
relational database model, with special emphasis on the design and
querying of relational databases and exploration of the relationship of
database to the rest of the system.
Prerequisites
MOIS 2101 “Introduction to Information Systems/Technology (3 cr.)”

MOS 406-3201 4

2
9/25/24

Checking in
How are you feeling today?

Image Gallery Check-in


What image in the Image Gallery speaks
to you at this moment? Select one.
• Why do you choose this image?
• How does it reflect something you feel or
something you are thinking about at this
point in time?

3
9/25/24

Let’s get introduced!


• You first:
• Name, program & level. What did you like about IT before?
• Tell us something interesting or unusual about yourself!
• What do you really hope to learn in this course? How can it
contribute to your life/ career goals?

4
9/25/24

Your Professor
Galal H. Galal-Edeen BSc, BA(Arch), MSc, MSc( Sp. Design), PhD, MBCS, HRDC/ CT, CUA

•University Chair on Innovation (CU)


•Professor of Information Systems Engineering
•Served as visiting professor @ UCL, AUC & Nile Univ.
•Strategy & Technology Transfer
•Architecture
•Certified Trainer & HR Dev. Consultant
•Certified Usability Analyst

Learning Objectives (agreement?)


Upon passing this course, you should be able to:
1. Understand the importance of systematic and well-planned data management
for the success of any modern organization.
2. Understand new developments and trends in databases and how they are a
main important component in any MIS.
3. Understand the fundamentals of relational database systems including data
modelling, mapping of data models into database structures, database
architectures, and database manipulation techniques.
4. Model enterprise data successfully, map models onto database relations and
write queries to manipulate such relations successfully.
5. Understand the theories and techniques in developing database applications
and be able to demonstrate the ability to build databases using enterprise
DBMS products.

MOS 406-3201 10

10

5
9/25/24

Topics
1. What are databases (DBs)? Why are they important? Who are its users?
(E&N Ch.1).
2. What are the fundamental DB concepts? What are its components? How
are they built, and how are they placed within the organization? (E&N
Ch.2).
3. Modeling Data (using the Entity-Relationship model) (E&N Ch.3).
4. The Relational Data Model & it’s main characteristics (E&N Ch.5).
5. Defining, manipulating, and querying data (using SQL) (E&N Ch.6).
6. The Enhanced Entity Relationship (EER) Model (E&N Ch.4).
7. Mapping ER and EER model constructs to Relations (E&N Ch.9).
8. Simplifying and data models (Normalization) (E&N Ch.14).

MOS 406-3201 11

11

Assessment Scheme
Grade Distribution
• Project Phase-1. 5%
• Project Phase-2. 10%
• Project Phase-3. 10%
• Class Participation 5%
• Assignments/ quizzes. 20%
• Mid-term Exam-1 20%
• Final Exam 30%

Group Project Cla ss Pa rticipation Assignments, Portfolio & Quiz Mid-term Fina l Exam

(Please upload your photo to Canvass!)


MOS 406-3201 12

12

6
9/25/24

Example project areas (v2)


1) Pharmacy CRM
2) Vehicle spare parts and/ or servicing.
3) Electric accessories shop.
4) A doctor’s clinic appointments and follow up system.
5) Tourist excursions bookings.
6) Personal media streaming and blogs (with subscriptions for targeting
announcements).
7) Ma’s kitchen with online food reservation and delivery.
8) A family’s database.
9) A database to support the circular economy.
10) Education course support.
11) A restaurant/ café review and customer comments system (see Foursquare for
example).

13

Projects Group Formation


• Group size is a maximum of 5 students.
Note change
• Each project area will be available to only one group.
• Please email group names and project area to TA Omar Attia:
[email protected] (Sections 1 & 3), or TA Mahmoud
Esmat: [email protected] (Section 2)
• By end of day Wednesday 11th Sept.
• TAs will announce office hours soon, so please be alert and make use
of those hours!

14

7
9/25/24

Attendance
Reminder of AUC policy:
• As stated in the university catalog, “students are expected to attend
class; there is no system of permitted absences.” Students may not
normally receive credit for a course if more than three weeks of
classes (or their equivalent in summer school) are missed. To clarify, if
you miss more than six (6) classes you could get an “F”.

MOS 406-3201 15

15

Some ground rules


• Late submission policy: I never say no to a student who wishes to
catch up on missing work, but of course do not expect to be graded at
the same scale as students who submitted on time!.
• How to lose participation grades (fast):
• Frequently arrive late to class.
• Be often disruptive by talking audibly to classmates.
• Be often obviously distracted by your mobile phone/ tablet, etc.
• Don’t participate by raising your hand from time to time to answer questions.
• Coming to class 10 minutes or more late will result in “late” mark on
the Attendance list!

16

8
9/25/24

Interacting with your Professor and TAs


• Make use of Office Hours.
• Emails should have an expressive Subject, and the course code and
title (MIS-Databases).
• If an email goes unanswered for more than 48 hours, re-send, but
please try not to overuse emails & be to the point!
• If it’s a specific request, please provide your full name, section no.
and AUC ID number.

17

Main Textbooks

Get a personal copy that you can access regularly!

Fundamentals of Database Systems, Global


Edition Paperback – 2017
by Ramez Elmasri (Author), Shamkant Navathe (Author)
Publisher : Pearson; 7th edition (26 Aug. 2016)
Language : English
Paperback : 1272 pages
ISBN-10 : 1292097612
ISBN-13 : 978-1292097619
Dimensions : 18.9 x 3.9 x 23.4 cm

18

18

9
9/25/24

Some rules of engagement!


• Do some (relevant) reading ahead of the class.
• Attend lectures & pay attention: ask if confused or you have
questions.
• Draft lecture slides will normally be posted online before the relevant
session.
• Upload a clear personal photo (preferably: a passport-style photo) to
you Blackboard record please!
• Be an active listener: make notes!
• Do all the exercises and seek assistance when you need it.
• Apply yourself in group projects.
19

19

20

20

10
9/25/24

Session 2

Data & its Role in Organizations

Weeks 1 & 2

21

21

Modern organizations can’t survive


without DATA
• Trading Business • Essential for day-to-day
• Government operations and control
• Health care • Responding to emergencies and
unexpected developments.
• Manufacturing
• And for formulating competitive
• Banking strategy.
• Hospitality
• Education
• Military

22

11
9/25/24

Data in modern applications:


“Big Data Keeps Getting Bigger”
Every minute of the day in 2019:
• YouTube (4,500,000 videos
watched)
• Twitter (511,200 tweets sent)
• Google (4,497,420 searches
conducted)
• Instagram (55,140 photos
posted)
• And many more!!!

https://www.visualcapitalist.com/big-data-keeps-getting-bigger/

https://www.dataversity.net/the-problem-with-big-data-its-getting-bigger/

23

Important Storage Capacity Terminology

Figure 4.6 Important Storage-Capacity Terminology


Term Definition Abbreviation
Byte Number of bits to represent one character
Kilobyte 1,024 bytes KB
Megabyte 1,024 KB 5 1,048,576 bytes MB
Gigabyte 1,024 MB 5 1,073,741,824 bytes GB
Terabyte 1,024 GB 5 1,099,511,627,776 bytes TB
Petabyte 1,024 TB 5 1,125,899,906,842,624 bytes PB
Exabyte 1,024 PB 5 1,152,921,504,606,846,976 bytes EB
Zettabyte 1,024 EB = 1,180,591,620,717,411,303,424 bytes ZB

Source: Kroenke & Boyle, 2023 Kilo = 210 Mega 220 Giga 230 Tera 240

24

12
9/25/24

Questions

• What is Data?
• What is a database?
• What is Metadata?
• What is a UoD (Universe of
Discourse), and how does that
relate to a database?
• What is the difference between a
Database System and DBMS
software?

25

Information Systems: Data Concepts and Definitions

— Data Item
Elementary description of things,
events, activities and transactions that
are recorded, classified and stored but
are not organized to convey any
specific meaning.
A data item has a TYPE.
• Can you think of examples?
Data need to be contextualized to have meaning!
Example: today’s temperature is 16 degrees Celsius
Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc. 26

26

13
9/25/24

Information Systems: Data Concepts and Definitions

— Information
Data organized (processed) so that
they have meaning and value to
the recipient. Information is Data
that has meaning in a particular
context: purpose.
So: it depends on the recipient
and the context!

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc. 27

27

Information Systems: Concepts and Definitions

— Knowledge
Data and/or information organized
and processed to convey
understanding, experience,
accumulated learning and expertise
as they apply to a current problem or
activity.
Knowledge: Explicit functional
associations between items of
information and/ or data (Debenham
‘89).
If it rains => the roads become wet
Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.
28

28

14
9/25/24

Wisdom

Can be defined as the collective and individual experience of


applying knowledge to the solution of problems.

Wisdom involves where , when and where to apply knowledge.

Source: Laudon & Laudon “Management Information Systems - Managing the Digital Firm”, 7e, 2022

29

Context

Wisdom
Understanding Principles
knowledge
Understanding Patterns
information

Understanding relations

data
Understanding
30

30

15
9/25/24

DIKW pyramid

31

31

Figure 6.1 The Data Hierarchy

32

16
9/25/24

Figure 6.2 Traditional File Processing


Question: what do you think are
the problems with this
approach?

33

Problems with the Traditional File


Environment

• Files maintained separately by different departments


• Data redundancy
• Data inconsistency
• Program-data dependence
• Lack of flexibility
• Poor security
• Lack of data sharing and availability

34

17
9/25/24

Organisations as systems
• System: a collection of inter-related parts that has a common
purpose. Change to one element leads to or results from change in
others.
• Do you see where the importance of data lies?

Environmental data
Control

Inputs Process Outputs

35

35

The organisation and its environment

Environmental Resources
Changes, Risks and Constraints
The Firm
Governments
IS
Competitors
Customers

Financial Institutions
Culture

Potential staff

Adapted from: Laudon & Laudon p 80 36

36

18
9/25/24

Figure 6.3 Human Resources


Database with Multiple Views

37

Figure 6.4 Relational Database


Tables

How do you get to this? (design)

38
Source: Laudon & Laudon, 16ed., 2020

38

19
9/25/24

Activity (10 minutes work)


• Think of the data needs of a system that you are familiar with (you’ll
need to think of the stakeholders/ users as well!)

39

1- Financial
• The financial sector is a section of the economy made up of firms and institutions
that provide financial services to commercial and retail customers.
• This sector comprises a broad range of industries including banks, investment
companies, insurance companies, and real estate firms.
• A large portion of this sector generates revenue from mortgages* and loans
• The health of the economy depends on the strength of its financial sector. The
stronger it is, the healthier the economy.
• A weak financial sector typically means the economy is weakening.

*A mortgage is a legal agreement by which a bank, building


society, etc. lends money at interest in exchange for taking
title of the debtor's property, with the condition that the
conveyance of title becomes void upon the payment of the
debt.

40

20
9/25/24

2- Transportation
• The transport industry is one of the largest types of
business industries.
• This industry deals with the movement of humans, things
and animals by various modes of transportation.
• The main modes of transportation are air, land (Road and
Rail) and water.
• The transportation industry makes the largest part of any
country’s economy.
• A strong Transportation is essential for a country’s growth.
• Other than these transportations, water transportation is
one of the elite ways to commute and transport goods from
one place to another.

41

3- Manufacturing
• Manufacturing is the process of converting raw material
and components into the final product to sell in the
marketplace for public consumption.
• The manufacturing industry has many categories, for
example, food industry, petroleum industry, paper
industry, wood industry, leather industry, textile mill
industry, apparel industry, transportation equipment
industry, electrical equipment industry etc.
• Goods can be prepared in the mill, factories or plants for
example fabric and goods can also be prepared by hands to
sell in the market.
• Manufacturing industries provide employment to educated
as well as labor class population of any country.

42

21
9/25/24

4- Retail
• The retail field consists of supermarkets,
department stores, chain stores,
specialty stores, variety stores, franchise
stores, mail-order houses, online
merchants, and door-to-door sellers.
• Retail stores buy their goods from
wholesalers, stock the goods, and resell
them to individual consumers in small
quantities.

43

5- Pharmaceuticals
• The pharmaceutical company is a licensed firm to research, discover,
develop, market and sell the drugs
• It is clearly research and development (R&D) based industry.
• Constant research is an important aspect of the pharmaceutical
industry.
• These companies research and produce drugs to cure diseases of
humans and animals.
• There are some drugs that also treat symptoms.
• Pharmaceutical companies are highly regularized companies.
• The United States’ pharmaceutical industry comprises 45% of the
Global Pharmaceutical Market.
• Johnson & Johnson, Pfizer and Merck &Co. are the world’s leading
pharmaceutical companies.
• The United States pharmaceutical industry together with Canada and
Mexico make the world’s largest pharmaceutical market.
• Medical device industry is another aspect of the pharmaceutical
industry which deals with surgical equipment’s, medical devices and
implants.

44

22
9/25/24

6- Health care
• Health care industry provides remedial, diagnostic, curative,
preventive, rehabilitative, therapeutic services to patients and to
people in their old age.
• The main goal of this industry is to revive and maintain the health of
peoples.
• It is one of the fastest growing industries.
• This type of business industry has great potential for investors and
entrepreneurs.
• Every country’s government spends a handsome amount of money
on this sector.
• There are both private and public companies to provide services.
• There are also many voluntary organizations which provide services
to needy people free of charge.

45

7- Education
• The Education industry or the education system is a group
of schools, colleges, universities, institutes to provide
learning, knowledge, skills, and degrees in various fields.
• Educational institutes can broadly be divided into two
categories: public and private.
• Enrollment of students is increasing every year in both
traditional as well as in online courses.
• Online courses are most preferred by working people or
people above the age of 30 years, whereas regular students
prefer to enroll in traditional courses.
• Every country keeps a large portion of the annual budget for
this sector to update the education system to provide better
skills to students.

46

23
9/25/24

8- Hospitality
• The hospitality industry is a vast type of
business industries.
• It provides uncountable services to
customers, so this industry is highly
customer- oriented industry.
• This industry is exceptional, because the
hospitality industry solely survives on “the
want” of people, not need like health
care industry.
• There are three main broad categories of
this type of industry such as
accommodations, food and beverage,
and travel and tourism.

47

9- Social Media
• News Media industry is an important industry.
• The target of this industry is to provide news to the people.
• It makes people aware of the happenings in the world.
• There are various media platform for example newspaper,
Television, Radio, online blogs, websites, podcasts.
• Nowadays, social media has taken the NEWS Media industry to
another level.
• People come to know about the happenings of the world
instantly.
• Some famous social media platforms are Facebook, Twitter,
Instagram, YouTube etc.
• Getting information has become handy for the people of 21st
century.
• People can also read News online by installing the apps of various
NEWS channels.

48

24
9/25/24

10- Entertainment
• The entertainment industry is a huge business industry.
• It includes innumerable sub-sectors of entertainment.
• This industry runs on both entertainer and viewers.
• Entertainment has been an integral part of human society since ages,
but recently it has become commercialized.
• This industry has grown by leaps and bounds in the past century.
• These types of business industries such as entertainment make a
great part of the Gross Domestic Product (GDP) of any country.
• For example, football teams such as Belgium, France, and Brazil.
• These teams generate great revenue for their respective countries.
• The entertainment industry provides employment to a great number
of people.
• There are many entertainers which are among the richest people in
the world.

49

Entertainment-2

• These days the methods of entertaining people have changed.


• People don’t have to put a lot of efforts to get into the limelight as
they used to have.
• They can entertain people by sharing their stuff on YouTube and
other similar platforms.
• Digital platform has helped to grow entertainment industry
exponentially.
• Also with the availability of online video streaming companies like
Netflix and Amazon prime has changed the way users interact with
movies.

50

25
9/25/24

Session 4

Information Systems Levels and


Types, Data Warehouses

Week 2

51

51

Information – A Key Organizational Resource

• Fuels business and can be the critical factor in determining the


success or failure of a business.
• Needs to be properly managed.
• Vital for decision making & strategy formulation.
• The explosion of data, caused by huge increases in connectivity
(relate IoT?)
• Managing computer-generated information differs from handling
manually produced data.

Source: Kendall & J. Kendall, 10th ed. Ch 1


52

52

26
9/25/24

The Networked Enterprise!

Corp.
Network
Vendors

Suppliers
Internet

Adapted from: L& L p. 227


53
MOS 305-3201
Customers

53

Data, Information, knowledge management


• Levels: Data/ Information/ knowledge/ wisdom.
• Information management (IM) is the collection and management of
information from one or more sources and the distribution of that
information to one or more audiences. This sometimes involves those who
have a stake in, or a right to that information. Management means the
organization of and control over the structure, processing and delivery of
information (in the right form).
• See more at: http://www.aiim.org/What-is-Information-
Management#sthash.AgnA5rkQ.dpuf
• See Abrell et al. 2016. “The role of users and customers in digital
innovation..”

© 2023 Prof. Dr. G. H. Galal-Edeen All rights reserved 54

54

27
9/25/24

Source: L& L p. 41

Types of information systems: use to


guide requirements elicitation
Executive Support Strategic-Level Systems
Systems (ESS) 5-year sales trend forecasting 5-year operating plan
5-year marketing plan 5-year personnel planning etc.

Management Info
Management-Level Systems
Systems (MIS)/ EIS
Sales management/ Inventory management
Decision Support Systems (DSS) Sales analyses by region/ Relocation analysis/ Production scheduling
Self
Knowledge-Level Systems
Knowledge Work
CAD workstations Graphics Workstations
Systems (KWS)/ Word Processing/ Spreadsheets/ Project Management tools
Office Systems Document Management tools/ Personal Information Management

Transaction Operational-Level Systems


Processing Order processing & tracking Machine control Employee record keeping
Systems (TPS) Material movement Cash management Accounts payable/ receivable
ng
&M arketi ing
factur Finance nting 55
© 2023 Dr G H Galal-Edeen Sales M a n u Accou HR

55

Class Discussion (important!)


• What types & examples of information and reports that each of the
previous levels of management might need? Think of an example: a
large chain hypermarket like Carrefour.

56

56

28
9/25/24

From TPS to MIS


TPS ERP = Enterprise Resource Planning

Order Data
Order Sales
Processing Warehouse
file data
System
Reports

Unit
Material Product
Production Resource cost Product MIS
file Planning
data change
data

Data analytics
General
Accts Ledger Expense
file System
data
Insights
Adapted from: L& L p. 46 57
© 2022 Prof. G H Galal-Edeen MOS 406-3201

57

Information System - IS Re
vis
ion
: S
e lf-
An Information System (IS) is a system that collects, stu
dy
processes, stores, analyzes and disseminates
information for a specific purpose.

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

58

29
9/25/24

Computer-based Information
Re
System - CBIS vis
ion
: S
e lf-
stu
dy

An information system that uses computer technology to perform


some or all of its intended tasks.

A related term: Information Resource Management

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

59

Computer-Based Information Systems (continued)


Re
vis
ion
: S
e lf-
stu
dy

Figure 1.4: The Components of a Computer-Based


Information System
Fundamentals of Information Systems, 3rd ed. 60

60

30
9/25/24

Basic Components of
Information Systems Re
vis
ion
: S
e lf-
stu
dy

1. Hardware is a 2. Software is a 3. Database is a


device such as a program or collection of
processor, collection of related files or
monitor, programs that tables
keyboard or enable containing data.
printer hardware to
process data. A
software is a
system software
or an
application
program.

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

61

Re
vis
ion
: S
elf
-st
ud
y
Basic
Components 4. NETWORK IS A 5. PROCEDURES ARE THE 6. PEOPLE ARE THOSE

of Information CONNECTING SYSTEM


(WIRELINED OR
SET OF INSTRUCTIONS
ABOUT HOW TO
INDIVIDUALS WHO USE
THE HARDWARE AND

Systems - 2
WIRELESS) THAT PERMITS COMBINE THE ABOVE SOFTWARE, INTERFACE
DIFFERENT COMPUTERS COMPONENTS IN ORDER WITH IT, OR USES ITS
TO SHARE RESOURCES. TO PROCESS OUTPUT.
INFORMATION AND
GENERATE THE DESIRED
OUTPUT.

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

62

31
9/25/24

Teaser questions
• What do you think a Datawarehouse is?
• What do you think the difference between a Datawarehouse, and a
Database is?
• How can a Datawarehouse be of use to the Management of an
organization?
• What is the difference between a Datawarehouse and a Data Lake?

63

Introduction, Definitions, and


Terminology (1)
• Data Warehouse (DW) was proposed as a new type of database
management system which would keep no transactional data but only
summarized historical information for decision making purposes.
• What are transactional data? Give examples.
• Non-real time with periodic insertions.
• W. H Inmon characterized a data warehouse as:
• “A subject-oriented, integrated, nonvolatile, time-variant collection of
data in support of management’s decisions.”

Slide 29- 64

64

32
9/25/24

Purpose of Data Warehousing


• Traditional databases are not optimized for data access - they have to
balance the requirement of data access with the need to ensure
integrity of data.
• DWs provide access for complex analysis of data, knowledge
discovery and decision support both through ad-hoc and canned
(pre-designed) queries.
• Most of the times the data warehouse users need only read access
but, need the access to be fast over a large volume of data.
• Most of the data required for data warehouse analysis comes from
multiple sources that may include databases from different data
models and sometimes files acquired from independent systems and
platforms.

Slide 29- 65

65

Introduction, Definitions, and


Terminology (2)
• Data warehouses are databases that store and maintain analytical data separately
from transaction-oriented databases for the purpose of decision support
• Traditional databases support online transaction processing - OLTP .
• Data Warehouses are for analytical applications- largely OLAP.
• Applications that data warehouse supports are:
• OLAP (Online Analytical Processing) is a term used to describe the analysis of
complex data from the data warehouse.
• DSS (Decision Support Systems) also known as EIS (Executive Information Systems)
supports organization’s leading decision makers for making complex and important
decisions.
• Data Mining is used for knowledge discovery, the process of searching data for
unanticipated new knowledge (See Chapter 28).

Slide 29- 66

66

33
9/25/24

Data Warehousing
— A Data warehouse is a repository of historical, non-
volatile data organized by subject to support decision
makers in the organization and include:
◦ Mainly Structured, but also some Semi-structured data (not
all data have the same structure/ attributes, no pre-defined
schema).
◦ While certain advancements allow for incorporating some
unstructured data (such as images, text) in data warehouses,
they generally aren't designed for storing large amounts of it.
◦ Multiple/ heterogenous sources: PoS transactions, Marketing,
Automation, Customer Relationship Management, Web
scraping (extracting data from web sites using automating
tools), etc.
◦ Multidimensional data structure/ model which allows data to
be represented in a multi-dimensional matrix (or data cube).
See the STAR schema a few slides ahead...
◦ Supports Online analytical processing (OLAP) which involves
the analysis of accumulated data by end users;
◦ Allows ad hoc and canned (pre-defined) queries.
Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

67

Visualizing Data Warehouses


• Example of Two- Dimensional vs. Multi- Dimensional (3D typically
called “Data Cube”)

Figure 29.2 A two-dimensional Figure 29.3 A three-dimensional


matrix model. data cube model.

Slide 29- 68 Elmasri & Navathe, 7 ed

68

34
9/25/24

Functionality of a Data Warehouse


• Functionality that can be expected:
• Pivot: Cross tabulation (also referred to as rotation) is performed.
• Roll-up (also Drill-up): Data is summarized with increasing generalization (for
example, weekly to quarterly to annually).
• Drill-down: Increasing levels of detail are revealed (the complement of roll-
up).
• Slice and dice: Projection operations are performed on the dimensions.
• Sorting: Data is sorted by ordinal value.
• Selection: Data is filtered by value or range.
• Derived (computed) attributes: Attributes are computed by operations on
stored and derived values.

Slide 29- 69

69

The Pivot operation in a Data Warehouse


• Figure 29.4 Pivoted version of the data
cube from Figure 29.3.

This presents data


in terms of Region
vs. Fiscal Quarter:
product by product.
“Pivoting” is also
called as “Rotation”

Slide 29- 70 Elmasri & Navathe, 7 ed

70

35
9/25/24

The Star schema for modeling DWs

Source: Google Images

71

Benefits of Data
Warehousing
• End users have a consolidated view of organizational
data.
• A repository for mainly structured, filtered data that
has already been processed for a specific purpose.
• Some unstructured data can be stored but the focus is
on structured data.
• End users can access data quickly and easily, usually
via Web browsers because the data types are in one
place.
• End users can conduct extensive analyses with data in
ways that may not have been possible before.
• Supports formulation and management of
organizational knowledge.

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

72

36
9/25/24

Data lakes
• A data lake is a repository of raw, unstructured or
structured data that for the most part has not yet
been analyzed (Laudon and Laudon, 16th ed.).
• The data is stored in their native (original) format
until they are needed.
• No use has yet been determined for this large pool
of raw data.
• Allow for massive storage.
• Large and has many users.
• Unstructured data in natural or “raw” format.
• The data can be accessed in many ways.
• Often stored in some kind of cloud.
• Check also: Data Ocean, Data Pool, Data Pond

73

Data Marts

• A Data mart is a small data warehouse, designed for the


end-user needs in a strategic business unit (SBU) or a
department/ specific population of users. It has a defined,
narrow scope.
• Are generally targeted to a subset of the organization, such
as a department, and are more tightly focused. Example:
clothing sales only, car servicing data only.

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

74

37
9/25/24

Data Mining
• Data mining involves searching for valuable business information
(knowledge discovery) in a large database, data warehouse, or data
mart.
• Used to predict trends and behaviors (important for strategy
formulation, innovation and new product development).
• Identify previously unknown patterns.
• Examples: house prices, crime rates, weather, seasonal variations
in demand, etc.
• Applications: Tableau, PowerBI

Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

75

76

76

38
9/25/24

The Data Life Cycle

Question: what is metadata? Can you give an example?


Introduction to IS, 1st ed. by Rainer, Turban and Potter, John Wiley & Sons, Inc.

77

A Glimpse of Data
Visualization!
Why?

78

39
9/25/24

Column/ Bar Chart

79

Radar or Spider Chart

80

40
9/25/24

Using Graphs
Graphical representations give overview of data
Number of errors made
Number of errors made
4.5

Number of errors made


Number of errors made

10 4
3.5
8
3
6 2.5
2
4 1.5
2 1
0.5
0 0
0 5 10 15 20 1 3 5 7 9 11 13 15 17
User
User

Think about which Internet use

graphs are most suitable < once a day


for the purpose once a day

once a week

2 or 3 times a week

once a month

81

81

Data Visualization Graphics


• Provide representations at the interface that reduce
memory and visual loads and facilitate computational
offloading.. Big Data!
• You can vary the shape, colour and data written on points.
• 2D charts are easier to read than 3D.

Information
visualizations
have been
designed to
allow people
to make sense
of and rapid
decisions
about masses
of data

www.id-book.com 82

82

41
9/25/24

Idea 1: Presentation in a Cube

83

83

Idea 2: Dashboard

84

84

42
9/25/24

Idea 3: Customizable Frames

85

85

Idea 4: Visualization maps (financial)

86

86

43
9/25/24

Idea 5: Selection & scoping (sliding &


adjustable window)

87

87

Video Cases
• Case 3: Maruti Suzuki Business Intelligence and Enterprise Databases
• http://www.youtube.com/watch?v=aZWAzbRm-ms L=5:30

88

44
9/25/24

Databases and Database Users


Based on Chapter 1 of the text by Elmasri and Navathe (7th ed)

89

Basic Definitions
• Database:
• A collection of related data.
• Data:
• Known facts that can be recorded and have an implicit meaning.
• Mini-world (domain of interest?):
• Some part of the real world about which data is stored in a database. For example, student
grades and transcripts at a university.
• Database Management System (DBMS):
• A software package/ system to facilitate the creation and maintenance of a computerized
database.
• Database System:
• The DBMS software together with the data itself. Sometimes, the applications are also
included.

Slide 1- 90 Source: Elmasri & Navathe (2017) 90

90

45
9/25/24

Simplified database system environment

Slide 1- 91

91

DBMS Software Functionality


• Define a database: in terms of data types, structures and constraints

• Construct or Load the initial database contents on a secondary


storage medium

• Manipulating the database: querying, generating reports, insertions,


deletions and modifications to its content

• Concurrent Processing and Sharing by a set of users and programs –


yet, keeping all data valid and consistent

92

46
9/25/24

Application Activities Against a Database


• Applications interact with a database by generating:
- Queries: that access different parts of data and formulate the result of
a request
- Transactions: that may read some data, record data, or “update”
certain values or generate new data and store that in the database
• Applications must not allow unauthorized users to access data.
• Applications must keep up with changing user requirements against
the database.

Slide 1- 93

93

DBMS Software Functionality


Other features:
• Providing Presentation and Visualization of data
• Protection or Security measures to allocate access/ change privileges
and prevent unauthorized access.
• Maintaining data and database structures.

94

47
9/25/24

Example of a Database Definition


Fragment
(with a Conceptual Data Model)
• Mini-world for the example: Part of a UNIVERSITY environment.
• Some mini-world entities:
• STUDENTs
• COURSEs
• SECTIONs (of COURSEs)
• (academic) DEPARTMENTs
• INSTRUCTORs

Slide 1- 95

95

Example of a Database
(with a Conceptual Data Model)
• Some mini-world relationships:
• STUDENTs take SECTIONs
• COURSEs have prerequisite COURSEs
• SECTIONs are of specific COURSEs
• INSTRUCTORs teach SECTIONs
• COURSEs are offered by DEPARTMENTs
• STUDENTs major in DEPARTMENTs

• Note: The above entities and relationships are typically expressed in a conceptual
data model, such as the ENTITY-RELATIONSHIP data model (see Chapters 3, 4)

Slide 1- 96

96

48
9/25/24

Example of a simple database

Slide 1- 97

97

Main Characteristics of the Database


Approach (1)
• Self-describing nature of a database system:
• Data definitions need not be in the application program.
• A DBMS catalog stores the description of a particular database (e.g., data structures,
types, and constraints)
Meta-data: data about the data!
• The description is called meta-data*.
• This allows the DBMS software to work with different database applications.
• Insulation between programs and data:
• Called program-data independence (logical and physical).
• Allows changing data structures and storage organization without having to change
the DBMS access programs (physical independence).
• The structure of the overall schema can change without having to change the
program logical independence).
-----------------------------------------------------------------------------
* Some newer systems such as a few NOSQL systems need no meta-data: they store
the data definition within its structure making it self-describing.
Slide 1- 98

98

49
9/25/24

App programmer: Where has my data


gone??? (Below is a physically distributed
Database or Data Warehouse environment)

Distribution supports:
• Load balancing
• Scalability
• High availability

Disk A Disk B Disk C

99

Example of a simplified database catalog

All the relations


(tables) in the
database

All the columns in the


database

Slide 1- 100

100

50
9/25/24

Main Characteristics of the Database


Approach (2)

• Insulation between Program and Data: Data Abstraction:


• Different levels of abstraction (why?)
• A data model is used to hide storage details and present the users with a
conceptual view of the database.
• Programs refer to the data model constructs rather than data storage details
• Support of multiple views of the data:
• Each user may see a different view of the database, which describes only
the data of interest to that user è Logical independence.

Slide 1- 101

101

Figure 2.2 The three-schema


architecture.
each
t be se of
s o
: mu purp
T
TAN the
R d
M PO stan
I der
un

102

51
9/25/24

Example of a short-answer question:


• Explain the following: “A data model is used to hide storage details
and present the users with a conceptual view of the database.”

103

Main Characteristics of the Database


Approach (3)

• Sharing of data and multi-user transaction processing:


• Allowing a set of concurrent users to retrieve from and to update the
database.
• Concurrency control within the DBMS guarantees that each transaction is
correctly executed or aborted
• Recovery subsystem ensures each completed transaction has its effect
permanently recorded in the database, even in the event of failure.
• OLTP (Online Transaction Processing) is a major part of database applications.
This allows hundreds of concurrent transactions to execute per second.

Slide 1- 104

104

52
9/25/24

Advantages of Using the Database


Approach e
b E
ust ON
m h
N T: n eac Controlling redundancy in data storage and in
users
programs
TA lai development and maintenance efforts.
O R xp
P e
IM le to users programs
ab Sharing of data among multiple users.
users

Restricting unauthorized access to data.

Providing persistent storage for program & Question: why is


data Objects
controlling(note: not
“eliminating”!!)
Providing Storage Structures for efficient Query
Processing concurrency important?

105

Advantages of Using the Database


Approache
b E
ust ON
m h
NT: n eac Providing backup and recovery services.
A i
O RT xpla
P e
IM le to Providing multiple interfaces to different classes of users.
ab
Representing complex relationships among data.

Enforcing integrity constraints on the database.

Drawing Inferences and Actions using rules

106

53
9/25/24

Why use a DMBS Software (work in


small groups)??
1 Controlling redundancy in data storage and in
development and maintenance efforts. Providing backup and recovery services.
6
2 Sharing of data among multiple users.
Providing multiple interfaces to different classes
of users. 7
3 Restricting unauthorized access to data.
Representing complex relationships among data. 8

4 Providing persistent storage for program &


data Objects
Enforcing integrity constraints on the database. 9
Drawing Inferences and Actions using rules 10
5 Providing Storage Structures for efficient
Query Processing

107

Advantages of Using the Database


Approach
• Controlling redundancy in data storage and in development and
maintenance efforts.
• Sharing of data among multiple users.
• Restricting unauthorized access to data. Only the DBA staff uses
privileged commands and facilities.
• Providing persistent storage for program Objects
• E.g., Object-oriented DBMSs make program objects persistent– see Chapter 12.
• Providing Storage Structures (e.g., indexes) for efficient Retrieval and
Query Processing – see Chapter 17.

Slide 1- 108

108

54
9/25/24

Advantages of Using the Database


Approach (continued)
• Providing optimization of queries for efficient processing.
• Providing backup and recovery services.
• Providing multiple interfaces to different classes of users.
• Representing complex relationships among data.
• Enforcing integrity constraints on the database.
• Drawing inferences and actions from the stored data using deductive
and active rules and triggers.

Slide 1- 109

109

Problems with the Traditional File


Environment
• Files maintained separately by different departments
• Data redundancy
• Data inconsistency
• Program-data dependence
• Lack of flexibility
• Poor security
• Lack of data sharing and availability

110

110

55
9/25/24

QUIZ 1
• Likely on Week 4, tbc

111

Database Users
Based on Chapter 1 of the text by Elmasri and Navathe (7th ed)

112

56
9/25/24

Database Users

Users may be divided into


those who actually use and control the content
(called “Actors on the Scene”)
and
those who enable DBMS software to be designed and
implemented (called “Workers Behind the Scene”). Se
lf-
stu
dy

Slide 1-113

113

Database Users
Actors behind the scene
• DBMS designers and implementors who
design and implement the DBMS modules and
interfaces as a software package.

• Operators are responsible for the actual running


ow
and maintenance of the hardware and software
t kn
s
mu les
environment for the database system.
T:
TAN in ro
POR nces
IM fere
dif

114

57
9/25/24

Database Users
Actors on the scene
• Database administrators: responsible for
authorizing access to the database, for
coordinating and monitoring its use, acquiring
software, and hardware resources, controlling its
use and monitoring efficiency of operations. w
k no
ust s
• End-users: they use the data for queries, reports
and some of them actually update the database m
content. N T: role
A
O RT es in
P c
IM feren
dif

115

Skip?

Categories of End-users
• Naïve or Parametric: they make up a large
section of the end-user population. They use
previously well-defined functions in the form of
“canned transactions” against the database.
Examples are bank-tellers, reservation or order
entry clerks who do this activity for an entire
shift of operations.
• Sophisticated: these include business analysts,
data scientists, engineers, others thoroughly Se
lf-
stu
familiar with the system capabilities. Many use dy
tools in the form of software packages that work
closely with the stored database.

116

58
9/25/24

Skip?

Categories of End-users

• Stand-alone: mostly maintain personal


databases using ready-to-use packaged
applications. An example is a tax program user
or a cashier that creates his or her own internal
database. ber rs”??
mem orke
r e w
you edge
Do owl
“kn Se
l f-s
tu
dy

117

Extending Database Capabilities

• New functionality is being added to DBMSs in


the following areas:
• Image Storage and Management
• Audio and Video data management
• Spatial data management
• Time Series and Historical Data Management
• Data Mining & knowledge discovery. Can you think of
examples??
.

118

118

59
9/25/24

Types of Databases and Database


Applications (just a sample!)
• Numeric and Textual Databases

• Multimedia Databases

• Geographic Information Systems (GIS)

• Data Warehouses

• Real-time and Active Databases

119

119

Structured Data

120

120

60
9/25/24

Big Data
What exactly is big data?

• Large volume of unstructured data from different


sources.
• Freedom of the native format.
• Faster accumulation rates.
• Data Lake storage.
• Requires data science expertise to use effectively.
• Specialized tools.
121

121

The Vs of Big data


1. Volume
2. Velocity
3. Variability
4. Variety
5. Veracity
6. Value

122

122

61
9/25/24

Predictive Data Analytics

Se
lf-
stu
dy

Moving from Data to Insight to Decisions (image credit: Kelleher at al. 2e)

123

123

Using machine learning to induce a


prediction model from a training dataset.

y
t ud
l f-s
Se

(Image credit: Kelleher at al., 2020, 2e) 124

124

62
9/25/24

©Prof. Dr. G. H. Galal-Edeen 125

125

63

You might also like