0% found this document useful (0 votes)

25 views6 pages

Chapter 2

A data warehouse is a centralized system for storing and managing large volumes of structured and unstructured data from various sources, designed to support business decision-making through historical data analysis. It categorizes data into structured and unstructured types, and further into quantitative and categorical variables, with analysis types including univariate, bivariate, and multivariate data. Additionally, it discusses measurement scales such as nominal, ordinal, interval, and ratio, each with specific properties for data classification.

Uploaded by

iqra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views6 pages

Chapter 2

Uploaded by

iqra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Data Warehouse:

A data warehouse is a centralized system used for storing and managing large volumes of data
from various sources. It is designed to help businesses analyze historical data and make informed
decisions. Data from different operational systems is collected, cleaned, and stored in a structured
way, enabling efficient querying and reporting.
A data warehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data
in support of management's decision making process.
Subject-Oriented: A data warehouse can be used to analyze a particular subject area. For example,
"sales" can be a particular subject.
Integrated: A data warehouse integrates data from multiple data sources. For example, source A
and source B may have different ways of identifying a product, but in a data warehouse, there will
be only a single way of identifying a product.
Time-Variant: Historical data is kept in a data warehouse. For example, one can retrieve data
from 3 months, 6 months, 12 months, or even older data from a data warehouse.For example, a
transaction system may hold the most recent address of a customer, where a data warehouse can
hold all addresses associated with a customer.
Non-volatile: Once data is in the data warehouse, it will not change. So, historical data in a data
warehouse should never be altered.

Why data is important ?

 Data helps in make better decisions.
 Data helps in solve problems by finding the reason for underperformance.
 Data helps one to evaluate the performance.
 Data helps one improve processes.

REPRESENTATION OF RAW DATA

Categories of Data
 Data can be catogeries into two main parts –
 Structured Data: This type of data is organized data into specific format, making it easy to
search , analyze and process. Structured data is found in a relational databases that includes
information like numbers, data and categories.
 UnStructured Data: Unstructured data does not conform to a specific structure or format. It
may include some text documents , images, videos, and other data that is not easily organized
or analyzed without additional processing.

Quantitative Variable

Numerical Data: Numerical data can further be classified into two categories:
 Discrete Data: Discrete data contains the data which have discrete numerical values for
example Number of Children, age,etc.
 Continuous Data: Continuous data contains the data which have continuous numerical
values for example Weight, Voltage, height, etc.
Numeric values include real-value variables or integer variables such as age, speed, or length. A
feature with numeric values has two important properties: its values have an order relation (2 < 5
and 5 < 7) and a distance relation (d(2.3, 4.2) = 1.9).

2. Categorical Data: In categorical data we see the data which have a defined category, for
example:
 Marital Status
 Political Party
 Eye colour
 Country of Citizenship

Categorical (often called symbolic) variables have neither of these two relations. The two values
of a categorical variable can be either equal or not equal: they only support an equality relation
(Blue = Blue or Red Black).
A categorical variable with two values can be converted, in principle, to a numeric binary variable
with two values: 0 or 1.A categorical variable with N values can be converted into N binary
numeric variables, namely, one binary variable for each categorical value. These coded categorical
variables are known as “dummy variables” in statistics. For example, if the variable eye color has
four values, namely, black, blue, green, and brown, they can be coded with four binary digits.

Univariate data:
Univariate data refers to a type of data in which each observation or data point corresponds to a
single variable. In other words, it involves the measurement or observation of a single
characteristic or attribute for each individual or item in the dataset.

Analyzing univariate data is the simplest form of analysis in statistics.

Heights (in cm) 164 167.3 170 174.2 178 180 186
Suppose that the heights of seven students in a class is recorded (above table). There is only one
variable, which is height, and it is not dealing with any cause or relationship.
Bivariate data
Bivariate data involves two different variables, and the analysis of this type of data focuses on
understanding the relationship or association between these two variables. Example of bivariate
data can be temperature and ice cream sales in summer season.
Temperature Ice Cream Sales

20 2000

25 2500

35 5000

Suppose the temperature and ice cream sales are the two variables of a bivariate data. Here, the
relationship is visible from the table that temperature and sales are directly proportional to each
other and thus related because as the temperature increases, the sales also increase.
Multivariate data
Multivariate data refers to datasets where each observation or sample point consists of multiple
variables or features. These variables can represent different aspects, characteristics, or
measurements related to the observed phenomenon. When dealing with three or more variables,
the data is specifically categorized as multivariate.
Example of this type of data is suppose an advertiser wants to compare the popularity of four
advertisements on a website.
Advertisement Gender Click rate

Ad1 Male 80

Ad3 Female 55

Ad2 Female 123

Ad1 Male 66

Ad3 Male 35

The click rates could be measured for both men and women and relationships between variables
can then be examined. It is similar to bivariate but contains more than one dependent variable.

Univariate Bivariate Multivariate

It only summarize single It only summarize two It only summarize more than
variable at a time. variables 2 variables.
It is similar to bivariate but it
It does not contain any It does contain only one
contains more than 2
dependent variable. dependent variable.
variables.

Time-Dependent Data
Time-dependent data, also known as temporal data, is data that changes over time or has a specific
time reference. For example, a customer's address, a product's price, or a stock's value are all
temporal data.

A special class of discrete variables is periodic variables. A periodic variable is a feature for
which the distance relation exists but there is no fixed order relation. Examples are days of the
week, days of the month, or year. Monday and Tuesday, as the values of a feature, are closer than
Monday and Thursday, but Monday can come before or after Friday.
Scale of Measurement
A scale is a device or an object used to measure or quantify any event or another object.The
variables or numbers are defined and categorised using different scales of measurements.
Each level of measurement scale has specific properties that determine the various use of
statistical analysis.

Nominal Scale
A nominal scale is the 1st level of measurement scale in which the numbers serve as “tags” or
“labels” to classify or identify the objects. A nominal scale usually deals with the non-numeric
variables.A nominal scale is an orderless scale, which uses different symbols, characters, and
numbers to represent the different states (values) of the variable being measured. These values can
be coded alphabetically as A, B, and C or numerically as 1, 2, or 3.

Example:

Some of the situations where nominal measurement scale can be used are given below:

 Study to find the country of birth of people in a town

 In collecting data on the eye color of people
 Classifying people into categories like male/female, working-class population/unemployed,
vaccinated/unvaccinated people, etc.
 Gender
 Marital Status
 College Major

Some of the properties of the nominal scale of measurement are given below:
 It can categorize variables but does not put them in any order(No Ranking).
 It does not show any numerical value.
 It is used for qualitative data.
Ordinal Scale
The ordinal scale is the 2nd level of measurement that reports the ordering and ranking of data
without establishing the degree of variation between them. Ordinal represents the “order.” Ordinal
data is known as qualitative data or categorical data. It can be grouped, named and also ranked.

 AGE (with values young, middle-aged, and old)

 INCOME (with values low, middle-class, upper-middle-class, and rich).

Interval Scale
The interval scale is the 3rd level of the measurement scale. It is defined as a quantitative
measurement scale in which the difference between the two variables is meaningful.
The zero point in the interval scale is placed arbitrarily (not a true meaningful zero point), and
thus it does not indicate the complete absence of whatever is being measured.
Example
The classic example of an interval scale is Celsius temperature because the difference between
each value is the same. For example, the difference between 60 and 50 degrees is a measurable
10 degrees, as is the difference between 80 and 70 degrees.
The best example of the interval scale is the temperature scale, where 0 F does not mean a total
absence of temperature.

Ratio Scale

The ratio scale is the most comprehensive scale among others. It includes the properties of all the
above three scales of measurement. The unique feature of the ratio scale of measurement is that it
considers the absolute value of zero, which was not the case in the interval scale. When we measure
the height of the people, 0 inches or 0 cm means that the person does not exist.

Examples: Quantities such as height, length, and salary, Expense uses this type of scale.

Continue….

Nestle Milk Pak Supply Chain Report
No ratings yet
Nestle Milk Pak Supply Chain Report
18 pages
Distribution Management Module 1
No ratings yet
Distribution Management Module 1
40 pages
Logistics Cost Management
100% (4)
Logistics Cost Management
21 pages
1521642669starting Online Business Course PDF
No ratings yet
1521642669starting Online Business Course PDF
13 pages
Basic Concepts and Foundations of Quantitative Research
No ratings yet
Basic Concepts and Foundations of Quantitative Research
18 pages
Use of Simulation in Manufacturing and Logistics Systems Planning
No ratings yet
Use of Simulation in Manufacturing and Logistics Systems Planning
24 pages
Sacramento's Zoning Code Parking Update
No ratings yet
Sacramento's Zoning Code Parking Update
106 pages
Star Fleet Command Manual - Volume XVI
100% (4)
Star Fleet Command Manual - Volume XVI
292 pages
October 2012
No ratings yet
October 2012
64 pages
Unit II Notes
No ratings yet
Unit II Notes
38 pages
Types of Data in Statistics
100% (1)
Types of Data in Statistics
13 pages
Santiago Queirolo Wine Industry
No ratings yet
Santiago Queirolo Wine Industry
32 pages
Gold Plus SIP-final 2
No ratings yet
Gold Plus SIP-final 2
57 pages
Week 1-4 Statistics Notes
No ratings yet
Week 1-4 Statistics Notes
91 pages
Chp.6 Warehousing Logistics Bms
No ratings yet
Chp.6 Warehousing Logistics Bms
31 pages
Math 540 Week 11 Final Exam
No ratings yet
Math 540 Week 11 Final Exam
9 pages
Cargo Storage and Warehousing
100% (3)
Cargo Storage and Warehousing
27 pages
Government of Maharashtra
No ratings yet
Government of Maharashtra
39 pages
Operations Project
No ratings yet
Operations Project
69 pages
Handling Unit For WM
No ratings yet
Handling Unit For WM
21 pages
19 KPIs
No ratings yet
19 KPIs
54 pages
4cm1 01 Rms 20240125
No ratings yet
4cm1 01 Rms 20240125
22 pages
Chapter 20 Manufacturing of Consumer Electronic Appliances in Indonesia PDF
No ratings yet
Chapter 20 Manufacturing of Consumer Electronic Appliances in Indonesia PDF
24 pages
Manhattan WMS Training
No ratings yet
Manhattan WMS Training
21 pages
Barilla Spa PDF
No ratings yet
Barilla Spa PDF
8 pages
Prospectus: FOR Short Term Educational Courses
No ratings yet
Prospectus: FOR Short Term Educational Courses
15 pages
Lecture 1
No ratings yet
Lecture 1
51 pages
Statistics 4
No ratings yet
Statistics 4
112 pages
Regression
No ratings yet
Regression
82 pages
Dummy Scales
No ratings yet
Dummy Scales
3 pages
Executive Summary
No ratings yet
Executive Summary
20 pages
Accounting Packages and Systems-Assignment-Spring Term
No ratings yet
Accounting Packages and Systems-Assignment-Spring Term
18 pages
BST Sample Paper
No ratings yet
BST Sample Paper
12 pages
Ahmed Talaat Mohamed Mahmoud: Personal Data
No ratings yet
Ahmed Talaat Mohamed Mahmoud: Personal Data
3 pages
Untitled 3
No ratings yet
Untitled 3
5 pages
Logistic Manager Duties and Responsibilities
No ratings yet
Logistic Manager Duties and Responsibilities
1 page
Data Is A Valuable Asset
No ratings yet
Data Is A Valuable Asset
5 pages
Lectures and Notes MATH 212 (Part 1)
No ratings yet
Lectures and Notes MATH 212 (Part 1)
8 pages
Week 2
No ratings yet
Week 2
30 pages
Lesson 1 - Basic Statistical Concepts
No ratings yet
Lesson 1 - Basic Statistical Concepts
4 pages
Dav Theory
No ratings yet
Dav Theory
111 pages
OPMT 2275 Winter 2121 Scenarios
No ratings yet
OPMT 2275 Winter 2121 Scenarios
4 pages
MS 14L2 Levels of Measurement
100% (1)
MS 14L2 Levels of Measurement
32 pages
Unit 2.1 Different Measurement Scales
No ratings yet
Unit 2.1 Different Measurement Scales
46 pages
Probability and Statistics 1 Session 1 - 3: Instructor: Prof. Deepika Jain E-Mail Id: Deepika - Jain@iimrohtak - Ac.in
No ratings yet
Probability and Statistics 1 Session 1 - 3: Instructor: Prof. Deepika Jain E-Mail Id: Deepika - Jain@iimrohtak - Ac.in
180 pages
Class 2
No ratings yet
Class 2
5 pages
1 Introduction To Statistics - 241108 - 104334
No ratings yet
1 Introduction To Statistics - 241108 - 104334
11 pages
Types of Data
No ratings yet
Types of Data
34 pages
9 Correlation
No ratings yet
9 Correlation
123 pages
STATISTICS NOTES and Others
No ratings yet
STATISTICS NOTES and Others
69 pages
RES1N Prefinal Module 4
No ratings yet
RES1N Prefinal Module 4
3 pages
WINSEM2024-25 MCSE615L TH VL2024250502897 2025-01-07 Reference-Material-I
No ratings yet
WINSEM2024-25 MCSE615L TH VL2024250502897 2025-01-07 Reference-Material-I
50 pages
Statistics For-Computing 1
No ratings yet
Statistics For-Computing 1
36 pages
Nominal, Ordinal, Scale Variable
No ratings yet
Nominal, Ordinal, Scale Variable
14 pages
Warehouse Resume 1
No ratings yet
Warehouse Resume 1
2 pages
Internship Report
No ratings yet
Internship Report
13 pages
Types of Data & The Scales of Measurement - UNSW Online
No ratings yet
Types of Data & The Scales of Measurement - UNSW Online
16 pages
Fundamentals of Data Science and Analytics On Descriptive Analysis
No ratings yet
Fundamentals of Data Science and Analytics On Descriptive Analysis
53 pages
Chapter 1 Classification and Graphical Presentation (Becon 2025)
No ratings yet
Chapter 1 Classification and Graphical Presentation (Becon 2025)
67 pages
Statistics
No ratings yet
Statistics
88 pages
Statistics
No ratings yet
Statistics
4 pages
One Way+tables 2
No ratings yet
One Way+tables 2
5 pages
1 Measurement Scales
No ratings yet
1 Measurement Scales
4 pages
Week 10 Data Processing and Management
No ratings yet
Week 10 Data Processing and Management
17 pages
UNIT-I - Data Categorization-by-Dr - SKY
No ratings yet
UNIT-I - Data Categorization-by-Dr - SKY
22 pages
Business Analytics (Tanya Pandey) Mba M3a
No ratings yet
Business Analytics (Tanya Pandey) Mba M3a
64 pages
Describing Data 1
No ratings yet
Describing Data 1
29 pages
Lecture1 Olive's File
No ratings yet
Lecture1 Olive's File
54 pages
Unit-2 Ids
No ratings yet
Unit-2 Ids
64 pages
Notes of Statisitcs
No ratings yet
Notes of Statisitcs
30 pages
Unit One Graphing and Descriptive Statis-1
No ratings yet
Unit One Graphing and Descriptive Statis-1
12 pages
Statistics Intro
No ratings yet
Statistics Intro
7 pages
BioStats CIA1
No ratings yet
BioStats CIA1
10 pages
Assignment 8614
No ratings yet
Assignment 8614
19 pages
EBA2123 1.data and Statistics
No ratings yet
EBA2123 1.data and Statistics
36 pages
(Buiness Statistics) Chapter 1 2
No ratings yet
(Buiness Statistics) Chapter 1 2
33 pages
8614
No ratings yet
8614
12 pages
Business Statistics Note
No ratings yet
Business Statistics Note
15 pages
Slide BMG106 BMG106 Slide 01
No ratings yet
Slide BMG106 BMG106 Slide 01
25 pages
Introduction To Quantitative Methods
No ratings yet
Introduction To Quantitative Methods
5 pages
Module 01 Introduction To Business Statistics
No ratings yet
Module 01 Introduction To Business Statistics
16 pages
Notes (Chapter 1 - 3)
No ratings yet
Notes (Chapter 1 - 3)
15 pages
SBE - 11e ch01
No ratings yet
SBE - 11e ch01
36 pages
Scales of Data
No ratings yet
Scales of Data
6 pages
Basic Ideas of Data Management
No ratings yet
Basic Ideas of Data Management
32 pages
Types of Data & The Scales of Measurement: Data at The Highest Level: Qualitative and Quantitative
No ratings yet
Types of Data & The Scales of Measurement: Data at The Highest Level: Qualitative and Quantitative
7 pages
What Is Statistics
No ratings yet
What Is Statistics
6 pages
CHP1 Mat161
No ratings yet
CHP1 Mat161
4 pages
Report Stat
No ratings yet
Report Stat
21 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet

Chapter 2

Uploaded by

Chapter 2

Uploaded by

Data Warehouse:

Why data is important ?

REPRESENTATION OF RAW DATA

Analyzing univariate data is the simplest form of analysis in statistics.

Ad2 Female 123

Univariate Bivariate Multivariate

 Study to find the country of birth of people in a town

 AGE (with values young, middle-aged, and old)

You might also like