BUSINESSmaths MMS
BUSINESSmaths MMS
BUSINESSmaths MMS
Sinimole K.R.
Definition
Competitive Edge
e.g., for you in the job market!
Activities of Statistics
1. Designing the study:
First step
Plan for data-gathering
Sampling
4. Estimating an unknown:
Best “guess” based on data
Wrong - by how much?
Confidence interval - “we’re 95% sure that
the unknown is between …”
Activities of Statistics
(continued)
5. Hypothesis testing:
Data decide between two possibilities
Does “it” really work? [or is “it” just randomly
better?]
Is financial statement correct? [or is error
material?]
Data Mining
Search for patterns in large data sets
Businesses data: marketing, finance, production ..
Collected for some purpose, often useful for others
From government or private companies
Makes use of
Statistics – all the basic activities, and
Prediction, classification, clustering
&
Inferential Statistics
Descriptive helps us to Inferential help us to
Single Numerical
Summaries
of a batch
questionnaires
interviews
focus group interviews
observation
case-studies
diaries
critical incidents
portfolios.
What is 'interpreting
data'?
Data means information. So interpreting data just
means
working out what information is telling you.
Information is sometimes shown in tables, charts
and
graphs to make the information easier to read.
It is important to read all the different parts of the
table,
chart or graph.
PRESENTATION OF DATA
Classification of Data
Geographical
area-wise, e.g. cities, districts, etc.
Chronological
on the basis of time.
Qualitative
according to some attributes.
Quantitative
in terms of Magnitudes
classification
Classification is the process of arranging
things in groups or classes according to their
resemblance and affinities.
It serves the following purposes
1. eliminating unnecessary details
2. brings out clearly the point of similarity
3. allows the comparisons & drawing of
inferences
Cont..
The characteristic of a population may be
broadly divided into two categories variables
(quantitative) & attributes (qualitative)
Cont..
Attributes – not capable of being described
numerically eg. Sex, nationality, color of eye,
etc,
When classified on the basis of attribute,
groups are differentiated either by the
absence or presence of the attribute or by its
differing qualiites.
Cont..
Variables- numerically described
Eg. Height, weight, turnover, age, export etc..
Quantitative data
The data which assumes numerical values are called
quantitative data.
Qualitative data
The data representing class to which a unit
belongs with respect to a qualitative character
(reli 뉧
For example,
Consider the following for a sample of ten
students chosen from a class of hundred students.
1.Their ages in completed years are
17,19,18,19,21,20,18,18,19,20.
2. the sample included boys and girls in the following
order: B G B B B G B G B.
The age values in (1) are quantitative in nature.
Whereas sex codes B,G in case (2) represent qualitative
data.
Variables
May be
Continuous or
Discrete
Discrete and Continuous
Data.
Discrete data
If the data values are integer in nature, such data
values are called discrete.
Examples
1,2,3,4,5,6.
Nominal &Ordinal
Continuous Data.
If the data values are real in nature (any
numerical with in a certain range), such data
values are called continuous.
Examples
12.5, 13.6, 14.7 ,16.8, 17.1.
Variables like age, height, weight, rainfall records
always lead to continuous data
Interval or ratio
Data types
Qualitative / Quantitative/
categorical Numerical
As you can see, using groups of five makes it easier to see the total!
•Frequency tables show the totals of the tally marks.
Grouping data
When there are a large number of possible
outcomes, we will usually need to group the
data. To do this, we first need to decide on our
classes.
Cont…
Decide on the type and number of classes for
dividing the data.
by quantitative or qualitative measure
The range must be divided by equal classes
The width of the interval from the beginning of
one class to the beginning of the next class
must be the same for every class.
Width of a class interval
width of the class intervals =
next unit value after largest value in data - smallest value in data
total number of class intervals
Step 2 &3
Sort the data points into classes and count
the number of points in each class
Illustrate the data in a chart.
Class intervals
Class limits
The lowest an the highest value that can be included. 20-
30
Class intervals
The difference between upper and lower limit.
Class frequency
The number of observations corresponding to the
particular class
Class mid point
Value lying the half-way between lower & upper limit.
Exclusive & Inclusive
method
Upper limit of one class is the lower limit of
the next class
20-30 30-40 etc.
Upper limit of one class is included in that
class itself
20-29 30-39 etc.
General rules
The number of classes should preferably be
between 5 & 15
K= 1 + 3.322 log N
Where k= the approximate number of classes
N= total number of observation
Whenever possible all classes should be of the
same size.
If not possible form open end classes
Example
Imagine that you are carrying out a survey to
determine the number of CDs bought by your
classmates in the past year. The possible
answers are likely to range from 0 to 50
(about one a week!), so you might draw a tally
chart similar to the one below:
Number of CDs Frequency
0- 4 8
5-9 5
10 - 14 7
15 - 19 4
20 - 24 1
25 - 29
30 - 34 1
35 - 39
40 - 44
45 - 49 2
more than 49
Tables
A table is a systematic arrangement of
statistical data in columns and rows.
Parts of a table
Table number Body of the table
Title of the table Head note
Caption Foot note
Stub
Table……….
Caption ….
Column headings. It explains what the column
represents
Stub ….
Row headings
Head notes ….
Brief explanatory statement
Placed below the title enclosed in brackets.
Bar charts
Bar charts are one way of showing the
information from a frequency table
Types of bar Diagrams
Simple bar diagrams
Subdivided bar diagrams
Multiple bar diagrams
Percentage bar diagrams
Deviation bars
Broken bars
Simple bar diagrams
Only one variable.
2003-04
2002-03
years
2001-02
2000-01
1999-00
2000
1000
0
1999-00 2000-01 2001-02 2002-03 2003-04
Series3 283.4 473.8 402.8 536.6 686.8
Series2 365.3 484.7 668.6 992.8 1067.4
Series1 233.8 301.8 303.2 365.3 416.4
years
Multiple bar diagrams
Corporate secotr profits gross profits
profit beofre tax
profit after tax
1800 retained profits
1663
1600
1376
1400 1219
1200
982
1000 846
800
589
600
400
200
1
0
2002-03 2003-04
Percentage bar diagrams
61%
29%
14% 11% 17%
1 2 3
-74%
sales netprofits
Pie charts
Pie charts are circles divided into segments,
where each segment represents a fraction of
the total amount.
Pie charts
Pie charts are circles divided into segments,
where each segment represents a fraction of
the total amount.
Pie charts…
This pie chart shows the 20 bikes sold at the
bike shop.
The segment for Mountain bikes is one half of
the chart. This is
because 10 Mountain bikes were sold, which
is exactly half the
number of bikes sold in total (20 bikes).
Histograms
A histogram is a series of rectangles, each
proportional in width to the range of values
within a class and proportional in height to the
number of items falling in the class.
If the classes are of equal width, then the
vertical bars in the histogram are also of equal
width.
Histograms
are similar to bar charts apart from the
consideration of areas. In a bar chart, all of
the bars are the same width and the only
thing that matters is the height of the bar. In a
histogram, the area is the important thing.
Example of a Histogram
Histogram
Class Freq.
Count 15 but < 25 3
5 25 but < 35 5
35 but < 45 2
Frequency 4
3
Relative
2 Bars
Frequency
Touch
1
Percent
0
0 15 25 35 45 55
Lower Boundary
Example
75%
Class Cum. %
50% 15 but < 25 0%
25 but < 35 30%
35 but < 45 80%
25% 45 but < 55 100%
0%
0 15 25 35 45 55
Lower Boundary
Errors in Presenting
Data
1. Using ‘Chart Junk’
2. No Relative Basis
in Comparing Data
Batches
3. Compressing the
Vertical Axis
4. No Zero Point on
the Vertical Axis
‘Chart Junk’
Bad Presentation Good Presentation
Minimum Wage Minimum Wage
1960: $1.00 $
4
1970: $1.60
2
1980: $3.10
0
1990: $3.80 1960 1970 1980 1990
No Relative Basis
Bad Presentation Good Presentation
A’s by Class A’s by Class
Freq. %
300 30%
200 20%
100 10%
0 0%
FR SO JR SR FR SO JR SR
Compressing Vertical
Axis
Bad Presentation Good Presentation
Quarterly Sales
$ $ Quarterly Sales
200 50
100 25
0 0
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4
No Zero Point on
Vertical Axis
Bad Presentation Good Presentation
Monthly Sales Monthly Sales
$ $
45 60
42 40
39 20
36 0
J M M J S N J M M J S N
THANK YOU