B.S 1

Business statistics help analyze data to support decision making. Descriptive statistics summarize key aspects of sample data through measures like the mean, median, range, and standard deviation. Inferential statistics make predictions about populations based on samples using hypothesis testing and parameter estimation. Descriptive analysis of flight departure delay data included a frequency distribution showing delays in intervals and measures of central tendency, variability, skewness, and kurtosis.

Uploaded by

Ketan Nanda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views44 pages

B.S 1

Uploaded by

Ketan Nanda

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

Business Statistics

Why statistics?
• Decision making is often based on
analysis of data.
• Statistics helps you to make sense of the
data by using tools that summarize,
present and analyze the data.
• Decision maker can also ascertain the
confidence in the decisions.
Examples
• How many newspapers should the vendor stock
to maximize revenue?
– Depends on the probability distribution of demand and
expected profit
• Are two or more market segments significantly
different?
– Hypothesis testing
• What proportion of people are happy with the
Sixth-pay commission report?
– Parameter estimation
Sample vs. Population
• Population is the entire group/collection of
individuals/objects/things that we want
information about.
• Sample is part of the population that we actually
examine to gather information.
• Example
– We wish to find the average dividend percentage of
all companies traded at NSE.
• All stocks traded at NSE comprises population
• 10% of the stocks selected for gathering information is the
sample
Subdivision within Statistics

 Descriptive Statistics  Inferential Statistics

 Collect
 Predict and forecast
 Organize values of population
 Summarize parameters
 Display  Test hypotheses about
 Analyze values of population
parameters
 Make decisions
Descriptive statistics
- data and frequency distribution
• The following are the departure delay in minutes of 42 flights
selected at random from a particular airport.
10 12 45
13 8 40
13 0 0
20 45 0
95 38 67
4 47 55
0 56 5
45 50 27
50 15 26
34 12 25
48 40 25
50 42 48
53 44 23
56 46 22
Frequency Distribution
 Table with two columns listing:
 Each and every group or class or interval of values
 Associated frequency of each group
• Number of observations assigned to each group
• Sum of frequencies is number of observations
 Class midpoint is the middle value of a group or class or
interval
 Relative frequency is the percentage/proportion of total
observations in each class
 Sum of relative frequencies = 1
Frequency distribution
Delay in Frequency Relative
minutes frequency
0–15 12 0.286

15 - 30 8 0.190

30 – 45 6 0.143

45 – 60 14 0.333

60 or more 2 0.048

Total 42 1
Frequency distribution- histogram

16
14
12
10
Frequency

8
6
4
2
0
0–15 15 - 30 30 – 45 45 – 60 60 or more
Delay in Minutes
Two variable frequency distribution
-cross tabulation
delay in minutes 0-15 15-30 30-45 45-60 60 or more Total
Govt. 5 2 5 9 0 21
Private 7 6 1 5 2 21

Total 12 8 6 14 2 42

A joint frequency distribution of two variables (e.g. ownership of airline, delay

in minutes)
Descriptive statistics - measures
 Measures of Location
 Measures of Variability
 Skewness and Kurtosis
 Association between two variables
Measures of Location
• Arithmetic Mean
• Median
• Mode
• Percentiles
• Quartiles
Arithmetic mean

• The mean of a data set is the average

of all the data values.
 xi Sample mean
x
n

 xi
 Population mean
N
Mean – example
• Average delay in flight departure

x = 1354/42 = 32.2381 minutes

Median
• It is the middle item in a data set that is
arranged in ascending/descending order
• If there are n observations then the
Median = (n+1)/2 th observation.
computation rule
• if n is odd then (n+1)/2 is an integer
• if n is even then use average of n/2 and n/2 +1 th
observation
Example
0 22 45
• Sorted 42 0 23 46

observations  0 25 47
0 25 48
median is average of 4 26 48
21st and 22nd 5 27 50
observation 8 34 50

= (34+38)/2 10 38 50
12 40 53
= 36 12 40 55
13 42 56
13 44 56
15 45 67
20 45 95
Mode
• Mode is the highest occurring observation
– mode in the example is 0
• The greatest frequency can occur at two
or more different values.
• If the data have exactly two modes, the
data are bimodal.
• If the data have more than two modes, the
data are multimodal.
Percentiles and Quartiles

 Given any set of ordered numerical

observations
 The Pth percentile in the ordered set is that
value below which lie P% (P percent) of the
observations in the set.
 The position of the Pth percentile is given by (n +
1)P/100, where n is the number of observations in
the set.
Example
• Calculate 45th percentile of the airline
delay data
the position of 45th percentile is
45*(42+1)/100 = 19.35th
value of 45th percentile
= 19th observation + 0.35 of (20 – 19)th
observation
= 26.35 (26 + 0.35(27-26))
Quartiles
• Quartiles are special names to percentiles
• Q1 = 25th percentile
• Q2 = 50th percentile = median
• Q3 = 75th percentile
Measures of Variability
• Range
• Interquartile Range
• Variance
• Standard Deviation
• Coefficient of Variation
Range
• The range of a data set is the difference
between the largest and smallest data values.
• It is the simplest measure of variability.
• It is very sensitive to the smallest and largest
data values.
• Example from airline delay data
Range = 95 – 0 = 95 minutes
Interquartile range
• The interquartile range of a data set is the
difference between the third quartile and the first
quartile.
• It is the range for the middle 50% of the data.
• It overcomes the sensitivity to extreme data
values.
Variance
• The variance is a measure of variability
that utilizes all the data.
• It is based on the difference between the
value of each observation (xi) and the
mean (x for a sample,  for a population).
22
2  ( xi   )
2 < - Population variance  ( xi  x )
  s2 
N Sample variance - > n 1
Standard deviation
• The standard deviation of a data set is the
positive square root of the variance.
• It is measured in the same units as the
data, making it more easily comparable,
than the variance, to the mean.
• If the data set is a sample, the standard
deviation is denoted s.
• If the data set is a population, the standard
deviation is denoted  (sigma).
Coefficient of Variation
• The coefficient of variation indicates how large the
standard deviation is in relation to the mean.
• If the data set is a sample, the coefficient of variation
is computed as follows:
s s (100)
(100)
x
x
• If the data set is a population, the coefficient of
variation is computed as follows:

(100)

Example
• Variance
= 465.89 minutes square

• Standard Deviation
= 21.585 minutes

• Coefficient of Variation =
= 21.584/32.2381 (100) = 66.95%
Skewness
 Skewness
– Skewness characterizes the degree of
asymmetry of a distribution around its
mean
• Positively skewed
• Symmetric or unskewed
• Negatively skewed
Skewness
Negatively skewed
Skewness
Symmetric
Skewness
Positively Skewed
Skewness - measure
Skewness of a distribution is measured by

( X   ) 3
1 
N 3
For a given data set you may use
Kurtosis
• Kurtosis characterizes the relative
peakedness or flatness of a symmetric
distribution compared to the normal
distribution
Platykurtic (relatively flat)
Mesokurtic (normal)
Leptokurtic (relatively peaked)
Kurtosis
Platykurtic - flat distribution
Kurtosis
Mesokurtic - not too flat and not too peaked
Kurtosis
Leptokurtic - peaked distribution
Kurtosis - measure
• Kurtosis for a distribution is measured by
  2  3
( X   ) 4
where 2 
N 4
For a given data set you may use
Association between two variables
Delay Passengers Delay Passengers Delay Passengers
53 65 56 51 50 68
40 61 42 50 0 72
46 53 25 57 38 74
0 65 13 57 55 68
22 45 40 54 45 73
5 58 8 54 15 63
44 68 27 65 48 68
12 65 67 57 0 55
12 56 48 62 10 45
25 50 4 50 50 71
13 70 45 61 56 64
50 73 0 59 26 60
45 63 34 63 47 61
23 56 95 49 20 48
Association between two variables
• Scatter plot
• Covariance
• Correlation Coefficient
Scatter Plot
• Scatter Plots are used to identify any
underlying relationships among pairs of
data sets.
• The plot consists of a scatter of points,
each point representing an observation.
Scatter Plot

Delay vs Passengers

100
90
80
70
60
Delay

50
40
30
20
10
0
0 10 20 30 40 50 60 70 80
Passengers
Covariance
• The covariance is a measure of the linear
association between two variables.
• Positive values indicate a positive
relationship.
• Negative values indicate a negative
relationship
Covariance
• If the data sets are samples, the covariance
is denoted by
 ( xi  x )( yi  y )
sxy  = 20.42 in the
n 1 Airline
example
• If the data sets are populations, the
covariance is denoted by

 ( xi   x )( yi   y )
 xy 
N
Correlation Coefficient

• The coefficient can take on values between -1 and +1.

• Values near -1 indicate a strong negative linear relationship.
• Values near +1 indicate a strong positive linear relationship.
• If the data sets are samples, the coefficient is
sxy
rxy  = 0.121 in Airline
sx s y example

• If the data sets are populations, the coefficient is

 xy
 xy 
 x y

Business Analytics Project
100% (1)
Business Analytics Project
11 pages
002 Probability-and-Statistics-Part-1-Data
No ratings yet
002 Probability-and-Statistics-Part-1-Data
84 pages
The Relation Between Managers' Emotional Intelligence and The Organizational Climate They Create
No ratings yet
The Relation Between Managers' Emotional Intelligence and The Organizational Climate They Create
15 pages
H1.1 Definitions, Measures, Plots, CLT
No ratings yet
H1.1 Definitions, Measures, Plots, CLT
83 pages
STI - 03 - Data Presentation & Parameter
No ratings yet
STI - 03 - Data Presentation & Parameter
47 pages
Statistics 1
No ratings yet
Statistics 1
10 pages
Week 01
No ratings yet
Week 01
71 pages
Quantitative Methods: Describing Data Numerically
No ratings yet
Quantitative Methods: Describing Data Numerically
32 pages
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
No ratings yet
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
4 pages
Emgt 512 SP 2024
No ratings yet
Emgt 512 SP 2024
156 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Desc. Stat
No ratings yet
Desc. Stat
41 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
44 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Descriptive Statistics and Exploratory Data Analysis
No ratings yet
Descriptive Statistics and Exploratory Data Analysis
36 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
14 pages
Shapes
No ratings yet
Shapes
36 pages
Lecture 03
No ratings yet
Lecture 03
31 pages
3 Descriptive Statistics - Numerical
No ratings yet
3 Descriptive Statistics - Numerical
82 pages
Chapter 1
No ratings yet
Chapter 1
44 pages
Class 1 - 20th August 2024 - Descriptive Statistic
No ratings yet
Class 1 - 20th August 2024 - Descriptive Statistic
6 pages
Measures of Central Tendency and Spread: Chapter 1, Section 2
No ratings yet
Measures of Central Tendency and Spread: Chapter 1, Section 2
36 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Data Management (1) (1) - Compressed
No ratings yet
Data Management (1) (1) - Compressed
46 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
No ratings yet
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
33 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
Week 03
No ratings yet
Week 03
38 pages
Biostat Ch-5
No ratings yet
Biostat Ch-5
58 pages
City Uni of New York
No ratings yet
City Uni of New York
33 pages
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
2 Descriptives
No ratings yet
2 Descriptives
43 pages
Chapter 3, Numerical Descriptive Measures: - Data Analysis Is
No ratings yet
Chapter 3, Numerical Descriptive Measures: - Data Analysis Is
21 pages
Lecture 3 - Numerical Statistics
No ratings yet
Lecture 3 - Numerical Statistics
7 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
Analysis of Statistcal Data
No ratings yet
Analysis of Statistcal Data
46 pages
NITKclass 1
No ratings yet
NITKclass 1
50 pages
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
No ratings yet
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
34 pages
Statistics For Data Science PDF - Statistics-for-Data-Science PDF
No ratings yet
Statistics For Data Science PDF - Statistics-for-Data-Science PDF
14 pages
RM EBBA Class 8 CH0 11 Quatitative Analysis
No ratings yet
RM EBBA Class 8 CH0 11 Quatitative Analysis
37 pages
Chapter 4 Measures of Dispersion (Variation)
No ratings yet
Chapter 4 Measures of Dispersion (Variation)
34 pages
Nursing Research Methods: PH.D in Nursing
No ratings yet
Nursing Research Methods: PH.D in Nursing
66 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
24 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
2 Measures of Location - Dispersion
No ratings yet
2 Measures of Location - Dispersion
61 pages
Actuary Math - Stat. Lec1-9
No ratings yet
Actuary Math - Stat. Lec1-9
22 pages
Spring Semester, 2020-2021
No ratings yet
Spring Semester, 2020-2021
40 pages
Basic Statistics
100% (9)
Basic Statistics
73 pages
Measures of Central Tendency
100% (15)
Measures of Central Tendency
15 pages
Chapter Four
No ratings yet
Chapter Four
21 pages
Statistical Measures 2024 (Part 2) - Word
No ratings yet
Statistical Measures 2024 (Part 2) - Word
8 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
40 pages
Lecture 4
No ratings yet
Lecture 4
56 pages
SLIDES - Statistics-Descriptive Statistics
No ratings yet
SLIDES - Statistics-Descriptive Statistics
25 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
CH IV Stat I
No ratings yet
CH IV Stat I
41 pages
Dispersion 1
No ratings yet
Dispersion 1
32 pages
Lesson 1
No ratings yet
Lesson 1
37 pages
Lecture 1, BAS115
No ratings yet
Lecture 1, BAS115
57 pages
Stats
No ratings yet
Stats
109 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Chapter 09 - Answer
No ratings yet
Chapter 09 - Answer
30 pages
Linear Regression: An Approach For Forecasting
No ratings yet
Linear Regression: An Approach For Forecasting
12 pages
Mauro (QJE 1995)
No ratings yet
Mauro (QJE 1995)
32 pages
Course Structure Bachelor of Commerce Under CBCS Scheme 2019
No ratings yet
Course Structure Bachelor of Commerce Under CBCS Scheme 2019
19 pages
Thesis-Final Edit
100% (1)
Thesis-Final Edit
58 pages
Mathematics in The Modern World-Syllabus
No ratings yet
Mathematics in The Modern World-Syllabus
19 pages
Mcgranahan 1972
No ratings yet
Mcgranahan 1972
14 pages
Symmetry 12 01405 PDF
No ratings yet
Symmetry 12 01405 PDF
17 pages
Basic Concepts: Time Value of Money
100% (1)
Basic Concepts: Time Value of Money
20 pages
Scam Compliance and The Psychology of Persuasion
No ratings yet
Scam Compliance and The Psychology of Persuasion
34 pages
Chapter 1 - Introduction To Survey Adjustment
57% (7)
Chapter 1 - Introduction To Survey Adjustment
20 pages
Belt Transect Method
100% (2)
Belt Transect Method
14 pages
Journal of Scientific Exploration - Volume 14: Number 1-4
100% (2)
Journal of Scientific Exploration - Volume 14: Number 1-4
670 pages
Chapter 8 Simple Linear Regression
100% (3)
Chapter 8 Simple Linear Regression
17 pages
RI H2 Maths 2013 Prelim P2 Solutions
No ratings yet
RI H2 Maths 2013 Prelim P2 Solutions
10 pages
Wilson 2002
No ratings yet
Wilson 2002
16 pages
Psychology Unit 2 Revision Notes (Biological Approach)
No ratings yet
Psychology Unit 2 Revision Notes (Biological Approach)
38 pages
Assessing The Menstrual Hygiene Management Practices in Urban and Rural Areas of Madhya Pradesh
No ratings yet
Assessing The Menstrual Hygiene Management Practices in Urban and Rural Areas of Madhya Pradesh
32 pages
Correlation and The Financial Markets
No ratings yet
Correlation and The Financial Markets
2 pages
Multi-Label Learning With Global and Local Label Correlation
No ratings yet
Multi-Label Learning With Global and Local Label Correlation
14 pages
Descriptive Statistics: Innomatics Research Lab
No ratings yet
Descriptive Statistics: Innomatics Research Lab
78 pages
Luminance Uniformity and Glare
No ratings yet
Luminance Uniformity and Glare
10 pages
Chapter 3 Answer Cost Accounting PDF
No ratings yet
Chapter 3 Answer Cost Accounting PDF
17 pages
Research Paper
No ratings yet
Research Paper
26 pages
Lampiran Uji
No ratings yet
Lampiran Uji
15 pages
The THINC-it Tool For Cognitive Assessment and Mea
No ratings yet
The THINC-it Tool For Cognitive Assessment and Mea
5 pages
American International University-Bangladesh (AIUB) Faculty of Science & Technology Course Syllabus
No ratings yet
American International University-Bangladesh (AIUB) Faculty of Science & Technology Course Syllabus
2 pages
What Is Variance?: Key Takeaways
No ratings yet
What Is Variance?: Key Takeaways
10 pages