2035 CH1 Notes
2035 CH1 Notes
Organizing Data
• raw data does not tell us anything
• the data may first need to be organize in a
table
Summarizing Data
• data organized in a table may not tell us
anything
• best to display the data in a graph
(chapter 2)
• data can then be further summarized with
specific numerical values such as the
average or a number representing how
spread out the data are (chapter 3)
Analysis of the Data and Interpreting Your
Results
• Analyze the data using the appropriate
statistical method(s)
• How confident are you in the numerical
values you have calculated?
• What do they mean?
• Chapters 8, 9, 10, 11, 12, 13, 14 and 16
discuss these questions in more detail
What is a population?
Examples
You are usually interested in studying a
certain characteristic of the population
Examples
Two Types of Variables
Age Gender
Weight Do you smoke
Height In what year at
Western are you
What cell phone How far is your
brand do you have parents’ home
from Western
How many text Do you currently
messages do you have a job where
send in a week you work 10 or
(approximately) more hours/week
If we examine every unit of the population
(for the variable of interest), we say we are
conducting a census of the population
Examples:
μ = mean or average value of the population
σ 2 = measure of the spread of a population
Statistics
A descriptive measure of a sample is called
a statistic
• Statistics are usually denoted by
Roman letters
Examples:
x = mean or average value of the sample
s 2 = measure of the spread of a sample
In Statistical Inference
As mentioned earlier, we are often interested
in using a sample to draw conclusions (make
inferences) about the population from which
it was drawn
1. Qualitative Variables
There are two levels of measurements:
a. Nominal Level
o Interval level
o Ratio level
a. Interval Level
1 2 3 4 5 6 7
−1 0 1 2 3 4 5
Some background:
• India is the 2nd largest country in the world
with a population > 1 billion
• 75% of the population live in rural areas
• Yet the rural market accounts for only 1/3
of the total national sales
• Since the 1990’s, India’s rural market has
become more open for trade in consumer
goods
• This was an untapped market at the time,
offering potential for large companies to
enter the Indian market
Some Data – Rural India
Income Level
Literacy Rates
1990 1994
Toothpaste 8,825 metric 17,023 metric tons
tons
Laundry Soap 272,540 mts 422,741 mts
Bathroom 158,919 mts 231,084 mts
Soap
Shampoo 497,000 litres 2,116,000 litres
Velocity
This refers to the speed with which
the data are available and can be
processed
Veracity
This has to do with data quality, correctness,
and accuracy
Volume
This has to do with the ever-increasing size
of data and databases
Value
This is sometimes considered a fifth
characteristic; data that does not generate
value makes no contribution to an
organization
Business Analytics (section 1.4)
FIGURE 1.6
Business Analytics Add Value to Data
Categories of Business Analytics
1. Descriptive Analytics
• takes traditional data and describes what
has or is happening in a business
o Used to discover hidden
relationships and patterns
o Simplest and most commonly used
category
o Data visualization is key
o Also called reporting analytics
Data Mining
This is the collecting, exploring, and
analyzing of large volumes of data to
uncover hidden patterns to enhance
decision-making
• Used by companies to turn raw data into
useful information
Example
Here is some recent data of the top five
manufacturing firms to receive Canadian
Government funding
Bar Chart
Bubble Chart