0% found this document useful (0 votes)

25 views22 pages

Statistic Part 2

Uploaded by

vedalamuparna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views22 pages

Statistic Part 2

Uploaded by

vedalamuparna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

II.

Measure of
Dispersion
Dispersion measures the spread or variability of
data

1. Range
2. Quartiles
Curve
3. Interquartile A

Range
4. Variance Curve
B
5. Standard
Deviation Curve
C

Mean
(A,B,C)
1. Range
• Difference between the highest and the lowest observed values in a
dataset
• Easy to understand and find
• Usefulness as a dispersion measure is limited – only 2 values are
considered
• Heavily influenced by extreme values
• Range values may change from one sample to another
• For open-ended class, there is no range
Values Max Min range
22 90 6 84
49
Example
78
6
78
76
44
90
18
63
49
62
2. Quartiles
• Division of data into 4 segments according to the distribution of Lowest
observation
values Q1
• The width of the four quartiles need not be the same
Q
• Each part contains 25% data S.No Data 2
• Quartiles are the highest values in each of the 4 parts 1
2
10
11 Q

• FormulaQ1 =to[(n+1)/4] value,
calculate the
th lower quartile
quartiles: 3 14 3
Q
 Q2 = [(n+1)/2]nd value, middle 1
4
5
16
17 Q4
quartile 6 18 Highest
7 19 observation
 Q3 = [3(n+1)/4]th value, upper 8 21
quartile value interpretation
quartile 9 21
1st 18.75 25% values are <= 18.75 Q 10 23
quartile 2 11 24
2nd 27.5 50% values are <= 27.5 12 26
quartile 13 29
3rd 35.25 75% values are <= 35.25 14 30
quartile 15 32
16 33
4th 45 100% values are <= 45 Q
Excel
quartile
17 34
3 18 35
calculation
quartile formula
19 36
1st quartile = 20 37
QUARTILES(<range>,1) 21 39
2nd quartile = Q 22 40
QUARTILES(<range>,2) 4 23 42
3rd quartile = 24 45
QUARTILES(<range>,3)
3. Interquartile Range
• Approximately measures how far from the median on either side to include one-half of
data
• IQR is the difference between the values of the first and third quartiles

Interquartile
Range

1st quartile 2nd q

uartile 3rd quartile
Lowest Highest
observatio observation
n

Q1 Q2 Q3

Media
n
4. Variance
• Average deviation from some measure of central tendency
• Every population / sample has variance
• Represented by the symbol σ2
• Formula to calculate variance
σ2 = (∑(x - μ)2) / N
• σ2 : population variance
• x : observed value
• μ : population mean
• N : total number of items in population

• Units of variance are squares of units of data – eg: squared miles, squared rupees
etc.
• Not intuitively clear or interpreted in the right way
5. Standard Deviation
• Square root of the average of the squared distances of observation from the
mean
• Represented by the symbol σ
• Formula
σ to calculate Standard
(∑( x - μ)Deviation:
2) /
σ 2
• Units = N same units as that of the data
of SD= are in the
• SD enables to determine, with a high accuracy, the values of the frequency distribution in
relation to
the mean
99 %

95 %
68
% • About 68% data lies within ±1 SD from the
mean
• About 95% data lies within ±2 SD from the
mean
• About 99% data lies within ±3 SD from the
mean

μ- μ - 2σ μ- μ μ+ μ+ μ-
3σ σ
Difference between Standard
Deviation and Variance
Standard deviation and variance are both measures of variability in a distribution,
but they differ in a few ways:
• Definition
Variance is the average of the squared differences between each data point and the
mean. Standard deviation is the square root of the variance.
• Units
Standard deviation is expressed in the same units as the original data, such as
minutes or meters. Variance is expressed in larger units, such as meters squared.
• Interpretation
Standard deviation measures how far apart the numbers in a data set are. A small
standard deviation means the data is tightly grouped around the mean, while a
larger standard deviation means the data is more spread out. Variance gives a value
to how much the numbers in a data set vary from the mean. A significant variance
means the data points are far away from the mean.
• In practice, standard deviation is probably preferred over variance because it has
the same units as the data. Variance is more often used in the background, such as
in theory or deriving something else.
You’re interested in calculating the standard deviation
of the exam scores of a national standardized test to see
if many people scored close to the mean or not. Use the
following dataset.
Test Taker Score
1 20
2 40
3 60
4 60
5 75
6 80
7 70
8 65
9 70
10 90
• In order to solve for the standard deviation, we have to
follow the formula given earlier. Take a look at
the solution below.
Test Taker Score
1 20 -43 1849
2 40 -23 529
3 60 -3 9
4 60 -3 9
5 75 12 144
6 80 17 289
7 70 7 49
8 65 2 4
9 70 7 49
10 90 27 729
63 3660
III. Measure of Association
Measures the relationship (degree and strength) between two variables that are linearly
related

1. Covariance
2. Correlation
3. Coefficient of
Variation
1. Covariance (+ -)
• Covariance is the joint variability of two random variables
• Measures the direction / sign of relationship only (+ or -) and not the
strength
• How X and Y variables are linearly associated, working in tandem
 Eg: Weight lifter training time vs Sprinter training time
• Weight lifter trains more and lifts more weight (+)
• Covariance
• Trainer
measured
trainsas positive,
more and runsnegative or zero
in less time (-)
 Positive: indicates direct or increase linear relationship
 X up - Y up
 X down – Y down
 Negative: indicates indirect or decrease in linear
relationship
 X up - Y down
 X down – Y up
• Covariance can be any number and not restricted to 0 and 1
• Formula
• Sample CoVxy
= Ʃ(x-x’)(y-y’) / n-1 x and y are the 2 random variables
where
• Population CoVxy x̄ and ȳ are the means of the 2 random
ȳ̄
= Ʃ(x-x’)(y- )/ n variables
2. Correlation ( ) o

• Measures the degree to which one variable is linearly related to the other
• 2 measures are used to describe correlation
 Coefficient of Correlation (r)
• 0 ≤ r ≤ -1 : Inverse relationship -> X-increases, Y-decreases
• 0 ≤ r ≤ 1 : direct relationship -> X-increases, Y-increases
• Measures the strength and direction
• Formula (Karl Pearson’s Coefficient of Correlation / Product
moment) r = covariance of x and y / (SD of x)*(SD of y)

r = Cov (xy) / std(x).std(y)

 Coefficient of Determination (r2)

• r2 = r * r
• Measured in percentage
• Eg: r2 = 0.83 means 83% of variation in Y (dependent variable) accounted by X
(independent variables)
• r does not mean anything, r2 conveys the actual meaning
3. Coefficient of Variation
• Relative Standard Deviation
• Measured in %
• Shows variations with relation to the mean
• Does not have any units
• Smaller CoV is better represents better q u atil y
• Formula
CoV = σ / μ

Example: CovA = CovB = 1.02/87.5

Last 15 days, trading of 2 stocks are as follows: 15.35/115 =
Stock A Stock B = 0.011
Average price: 135 Average price: 87.5 0.133
Stock A is more
SD : 15.35 SD : 1.02
risky

Which is more risky ?

Sum of Squares
Total sum of squares is used to denote the amount of variation in the
dependent variable.
Mathematically, the difference between variance and SST is that we
adjust for the degree of freedom by dividing by n–1 in the variance
formula.

SST=∑(yi−¯y) exp2
Where:
• yi – observed dependent variable
• ¯y – mean of the dependent variable

The SST tells us how close sample values are to the mean. As the SST
increases, so does the variability of the data.
Example of Calculating the SST for a
Sample with Low Variability
Calculate the SST for the following data:
{1, 2, 3}
Step 1: The mean of the sample can be calculated by adding up the values in the sample (1
+ 2 + 3) and dividing this sum by the number of values (3). Thus, the mean of this sample is:
y¯=(1+2+3)/3
=6/3
=2
Step 2: Subtract the calculated mean from each value, and square each difference.
1−2 =−1(−1) =1 =1
2−2 =0-0 =0 =0
3−2 =1-1 =1 =1
Step 3: Sum the differences.
SST =1+0+1 =2
Thus, the total sum of squares for the data {1, 2, 3} is 2.
What is the Absolute Deviation
Formula?

M = Σ (x – x̄ )/n
i

where,
M is the average absolute deviation,
x̄ is the mean of data set,
Σ (x – x̄ ) is the summation of deviations
i

from mean,
n is the number of values in data set.
What Is Average Deviation
Formula?
The formula for average deviation is utilized to determine
how much individual observations differ from the mean of
a data set. Presented below is the formula for computing
the average deviation across n observations:

Average Deviation = Σ|xi – x̅|/n

where xi are the data points,

x̅ is the mean, and
n is the number of data points.
Question : Find the Average Deviation for
the data 10,25,30,14,39,18,17. (Use median
to find central point)
Step 1: Find median for the given data.
To find median first we need to sort the given data either
in ascending order or descending order.
Sorted data- 10,14,17,18,25,30,39
Here the size of data set is odd i.e., count=7.
So we have only one middle value18 which is median.
Step 2: Find absolute deviations from data using median.
abs(10-18) = 8
abs(14-18) = 4
abs(17-18) = 1
abs(18-18) = 0
abs(25-18) = 7
abs(30-18) = 12
abs(39-18) = 21
Step 3: Sum of all deviations = 8+4+1+0+7+12+21
=53
Step 4: Find Average Deviation=sum of all deviations/count of values in
data
=53/7
=> 7.57
So Average Deviation within the given data is 7.57
Question : Find the Average Deviation for
the data 10,20,30,40,50 (Use mean/median
to find central point)
Step 1: Find the center point for the given data.
As data is already in sorted order it is preferred to use the
median to find the central point.
Here the size of the data set is odd i.e., count=5.
So we have only one middle value 30 which is the median.
Step 2: Find absolute deviations from data using the median.
abs(10-30)=20
abs(20-30)=10
abs(30-30)=0
abs(40-30)=10
abs(50-30)=20
Step 3: Sum of all deviations=20+10+0+10+20 =60
Step 4: Find Average Deviation=sum of all
deviations/count of values in data
=60/5
=>12
So Average Deviation within the given data is 12
Problem 1. Calculate the average absolute
deviation of the data set, 2, 6, 7, 4, 1.
Solution:
The data set is 2, 6, 7, 4, 1.
Here, n = 5.
Mean of the data, x̄ = (2 + 6 + 7 + 4 + 1)/5
= 20/5
=4
Using the formula we get,
M = Σ (x – x̄ )/n
i

= [|4 – 2| + |4 – 6| + |4 – 7| + |4 – 4| + |4 – 1|]/5
= (2 + 2 + 3 + 0 + 3)/5
= 10/5
=2

Midterms 2024-06-29 13 - 28 - 56
No ratings yet
Midterms 2024-06-29 13 - 28 - 56
169 pages
Unit 1 Computational Statistics
No ratings yet
Unit 1 Computational Statistics
58 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
38 pages
Variance 6 Disperson
No ratings yet
Variance 6 Disperson
68 pages
Measures of Dispersion
50% (2)
Measures of Dispersion
52 pages
Chapter 3 Measures of Variability
No ratings yet
Chapter 3 Measures of Variability
69 pages
Measures of Dispersion Updated
No ratings yet
Measures of Dispersion Updated
38 pages
Nolan S.A. - Heinzen, T. E. Statistics For Behavioral Sciences 2nd Edition
100% (1)
Nolan S.A. - Heinzen, T. E. Statistics For Behavioral Sciences 2nd Edition
710 pages
Biostat Ch-5
No ratings yet
Biostat Ch-5
58 pages
2.5 - Variance and Standard Deviation
No ratings yet
2.5 - Variance and Standard Deviation
14 pages
Measure of Variation
No ratings yet
Measure of Variation
13 pages
Measures of Dispersion The Range Standard Deviation and Variance 1
No ratings yet
Measures of Dispersion The Range Standard Deviation and Variance 1
23 pages
Concept Extract
No ratings yet
Concept Extract
27 pages
Lecture III-Measures of Dispersion
No ratings yet
Lecture III-Measures of Dispersion
33 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
32 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
17 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
3 pages
Standard Deviation
No ratings yet
Standard Deviation
37 pages
Notes Module 5
No ratings yet
Notes Module 5
19 pages
Dispersion
No ratings yet
Dispersion
16 pages
Measures of Dispersion (Autosaved)
No ratings yet
Measures of Dispersion (Autosaved)
64 pages
PS Presentation
No ratings yet
PS Presentation
20 pages
Cha - 4
No ratings yet
Cha - 4
22 pages
Business Statistics: Session 2
No ratings yet
Business Statistics: Session 2
60 pages
Unit 5 BRM
No ratings yet
Unit 5 BRM
17 pages
DISPERSION
No ratings yet
DISPERSION
5 pages
BS Lect 05
No ratings yet
BS Lect 05
35 pages
Measures of Dispersion OR Measures of Variations
No ratings yet
Measures of Dispersion OR Measures of Variations
7 pages
Statistics Computation
No ratings yet
Statistics Computation
27 pages
Notes Stats Quiz 2
No ratings yet
Notes Stats Quiz 2
10 pages
Variance
No ratings yet
Variance
11 pages
Contemporary Math (Statistics - Docx Semi
No ratings yet
Contemporary Math (Statistics - Docx Semi
22 pages
Business Statistics: by Dr. Anugamini Srivastava
No ratings yet
Business Statistics: by Dr. Anugamini Srivastava
51 pages
Statistics 05 013726
No ratings yet
Statistics 05 013726
8 pages
Chapter Four Bio
No ratings yet
Chapter Four Bio
13 pages
Analysis of Variance: Budhi Setiawan, PHD
No ratings yet
Analysis of Variance: Budhi Setiawan, PHD
17 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
6 pages
Methods of Center Measurement: X N X X X
No ratings yet
Methods of Center Measurement: X N X X X
85 pages
Measures of Dispersion or Variability
No ratings yet
Measures of Dispersion or Variability
15 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
79 pages
Class 5.2 B Business Statistics Measures of Dispersion
No ratings yet
Class 5.2 B Business Statistics Measures of Dispersion
63 pages
الثامنة
No ratings yet
الثامنة
14 pages
Chapter Four
No ratings yet
Chapter Four
21 pages
Lec 8 Measures of Dispersion 2
No ratings yet
Lec 8 Measures of Dispersion 2
16 pages
Chapter Four: Measures of Variation
No ratings yet
Chapter Four: Measures of Variation
26 pages
Measures of Dispersion: Unit 1 Part 3
No ratings yet
Measures of Dispersion: Unit 1 Part 3
33 pages
Variance and Standard Deviation
100% (3)
Variance and Standard Deviation
15 pages
Chapter Four
No ratings yet
Chapter Four
21 pages
Measures of Variability For Ungrouped Data
100% (1)
Measures of Variability For Ungrouped Data
16 pages
Measures of Dispersion and Relative Standing
No ratings yet
Measures of Dispersion and Relative Standing
11 pages
Measures of Variability
No ratings yet
Measures of Variability
20 pages
Measure of Dispersion Kurtosi, Skiwness
No ratings yet
Measure of Dispersion Kurtosi, Skiwness
22 pages
Malayo Man, Malapit Din: Outline
No ratings yet
Malayo Man, Malapit Din: Outline
4 pages
SLG 4.2 Measures of Variability
No ratings yet
SLG 4.2 Measures of Variability
5 pages
Name: John Carlo S. Mallabo Year&Course: Bs Cpe: Topic 5
No ratings yet
Name: John Carlo S. Mallabo Year&Course: Bs Cpe: Topic 5
4 pages
Solucionario Econometria Jeffrey M Wooldridge PDF
11% (9)
Solucionario Econometria Jeffrey M Wooldridge PDF
4 pages
Measures of Variation
No ratings yet
Measures of Variation
30 pages
Steps in Factor Analysis
No ratings yet
Steps in Factor Analysis
3 pages
M11n - Lesson 3.2 - PPT - Handout - Measures of Variability - 1sem22-23
100% (1)
M11n - Lesson 3.2 - PPT - Handout - Measures of Variability - 1sem22-23
8 pages
Management Science - Chapter 7 - Test Reveiwer
No ratings yet
Management Science - Chapter 7 - Test Reveiwer
10 pages
Department of Education: 4 QUARTER - Module 1
No ratings yet
Department of Education: 4 QUARTER - Module 1
16 pages
Understanding The Independent-Samples T Test
No ratings yet
Understanding The Independent-Samples T Test
8 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
66 pages
Federal University of Kashere Faculty of Education: FUKU/EDU/20/BIO/0022
No ratings yet
Federal University of Kashere Faculty of Education: FUKU/EDU/20/BIO/0022
6 pages
Regression Analysis 2022
No ratings yet
Regression Analysis 2022
92 pages
Regression Analysis
100% (1)
Regression Analysis
11 pages
10.1055@s 0035 1548890
No ratings yet
10.1055@s 0035 1548890
9 pages
House Price Regression Analysis
No ratings yet
House Price Regression Analysis
15 pages
UBE Automotive MSA System Bias and Linearity Studies
No ratings yet
UBE Automotive MSA System Bias and Linearity Studies
6 pages
Chapter 7: BIOSTATISTICS
No ratings yet
Chapter 7: BIOSTATISTICS
19 pages
Bertrand Et Al. (2004) - How Much Should We Trust Differences-In-Differences Estimates
No ratings yet
Bertrand Et Al. (2004) - How Much Should We Trust Differences-In-Differences Estimates
28 pages
Arora 2019
No ratings yet
Arora 2019
29 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Control Charts Template
No ratings yet
Control Charts Template
14 pages
How To Be A Bayesian in Sas
No ratings yet
How To Be A Bayesian in Sas
9 pages
Data Science in Medicine - Precision & Recall or Specificity & Sensitivity? - by Alon Lekhtman - Towards Data Science
No ratings yet
Data Science in Medicine - Precision & Recall or Specificity & Sensitivity? - by Alon Lekhtman - Towards Data Science
11 pages
4 Measures of Centrality: Mean, Median, Mode, Grouped Data
No ratings yet
4 Measures of Centrality: Mean, Median, Mode, Grouped Data
18 pages
Data Preparation
No ratings yet
Data Preparation
12 pages
Tutorial 7
No ratings yet
Tutorial 7
3 pages
Template For Activity No. 3 T TESTANOVA
No ratings yet
Template For Activity No. 3 T TESTANOVA
6 pages
14622inferenceforsingleproportions 160909005557
No ratings yet
14622inferenceforsingleproportions 160909005557
19 pages
Chapter 6 Multicollinerity
No ratings yet
Chapter 6 Multicollinerity
4 pages
4 Rejection Region For The Population Mean
No ratings yet
4 Rejection Region For The Population Mean
2 pages
Star Test
No ratings yet
Star Test
7 pages
Formulas (Appendix)
No ratings yet
Formulas (Appendix)
2 pages
Exercise TimeSeries
No ratings yet
Exercise TimeSeries
1 page
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet

Statistic Part 2

Uploaded by

Statistic Part 2

Uploaded by

II.

1st quartile 2nd q

r = Cov (xy) / std(x).std(y)

 Coefficient of Determination (r2)

Example: CovA = CovB = 1.02/87.5

Which is more risky ?

Average Deviation = Σ|xi – x̅|/n

where xi are the data points,

You might also like