Variance StdDev

Variance StdDev(4)

Uploaded by

varsha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views47 pages

Variance StdDev

Variance StdDev(4)

Uploaded by

varsha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 47

What Is Variance?

• The variance is a measure of variability. It is

calculated by taking the average of squared
deviations from the mean.
• Variance tells you the degree of spread in
your data set.
• The more spread the data, the larger the
variance is in relation to the mean.
standard deviation

• The standard deviation is derived from variance

and tells you, on average, how far each value lies
from the mean.
• It’s the square root of variance.
• Both measures reflect variability in a distribution,
but their units differ:
• Standard deviation is expressed in the same
units as the original values (e.g., meters).
• Variance is expressed in much larger units (e.g.,
meters squared)
• Since the units of variance are much larger
than those of a typical value of a data set, it’s
harder to interpret the variance number
intuitively. That’s why standard deviation is
often preferred as a main measure of
variability.
• However, the variance is more informative
about variability than the standard deviation,
and it’s used in making statistical inferences.
steps for calculating the sample
standard deviation:
• Calculate the mean (simple average of the
numbers).
• For each number: subtract the mean.
• Square the result.
• Add up all of the squared results.
• Divide this sum by one less than the number of
data points (N - 1).
• This gives you the sample variance.
• Take the square root of this value to obtain the
sample standard deviation.
• Population Standard Deviation
• The population standard deviation, the standard definition of σ, is used
when an entire population can be measured, and is the square root of
the variance of a given data set.
• In cases where every member of a population can be sampled, the
following equation can be used to find the standard deviation of the
entire population:

• Where xi is an individual value

μ is the mean/expected value
N is the total number of values
• i.e. for the data set 1, 3, 4, 7, 8, i=1 would be 1,
i=2 would be 3, and so on. Hence the summation
notation simply means to perform the operation
of (xi - μ)2 on each value through N, which in this
case is 5 since there are 5 values in this data set.
• EX: μ = (1+3+4+7+8) / 5 = 4.6
σ = √[(1 - 4.6)2 + (3 - 4.6)2 + ... + (8 - 4.6)2)]/5
σ = √(12.96 + 2.56 + 0.36 + 5.76 + 11.56)/5 =
2.577
Sample Standard Deviation

• In many cases, it is not possible to sample every member within a

population, requiring that the above equation be modified so that the
standard deviation can be measured through a random sample of the
population being studied. A common estimator for σ is the sample
standard deviation, typically denoted by s.

Where xi is one sample value

x̄ is the sample mean
N is the sample size
• https://www.thoughtco.com/sample-
standard-deviation-problem-609528
• https://byjus.com/maths/standard-deviation/
Applications of Standard Deviation

• Standard deviation is widely used in experimental and

industrial settings to test models against real-world
data.
• An example of this in industrial applications is quality
control for some products.
• Standard deviation can be used to calculate a minimum
and maximum value within which some aspect of the
product should fall some high percentage of the time.
• In cases where values fall outside the calculated
range, it may be necessary to make changes to the
production process to ensure quality control.
• Standard deviation is also used in weather to determine differences in
regional climate.
• Imagine two cities, one on the coast and one deep inland, that have the
same mean temperature of 75°F. While this may prompt the belief that
the temperatures of these two cities are virtually the same, the reality
could be masked if only the mean is addressed and the standard deviation
ignored.
• Coastal cities tend to have far more stable temperatures due to regulation
by large bodies of water, since water has a higher heat capacity than land;
essentially, this makes water far less susceptible to changes in
temperature, and coastal areas remain warmer in winter, and cooler in
summer due to the amount of energy required to change the
temperature of the water. Hence, while the coastal city may have
temperature ranges between 60°F and 85°F over a given period of time to
result in a mean of 75°F, an inland city could have temperatures ranging
from 30°F to 110°F to result in the same mean.
• Another area in which standard deviation is largely used is finance,
where it is often used to measure the associated risk in price
fluctuations of some asset or portfolio of assets.
• The use of standard deviation in these cases provides an estimate
of the uncertainty of future returns on a given investment.
• For example, in comparing stock A that has an average return of 7%
with a standard deviation of 10% against stock B, that has the same
average return but a standard deviation of 50%, the first stock
would clearly be the safer option, since the standard deviation of
stock B is significantly larger, for the exact same return. That is not
to say that stock A is definitively a better investment option in this
scenario, since standard deviation can skew the mean in either
direction. While Stock A has a higher probability of an average
return closer to 7%, Stock B can potentially provide a significantly
larger return (or loss).
Types of data
Chi-squared test
• A chi-square (χ2) statistic is a test that
measures how a model compares to actual
observed data.
• The data used in calculating a chi-square
statistic must be random, raw, mutually
exclusive, drawn from independent variables,
and drawn from a large enough sample. For
example, the results of tossing a fair coin
meet these criteria.
• Chi-square tests are often used to test
hypotheses. The chi-square statistic compares
the size of any discrepancies between the
expected results and the actual results, given
the size of the sample and the number of
variables in the relationship.
• For these tests, degrees of freedom are used
to determine if a certain null hypothesis can
be rejected based on the total number of
variables and samples within the experiment.
• As with any statistic, the larger the sample
size, the more reliable the results.
• Degrees of Freedom
• Degrees of freedom are the number of independent variables
that can be estimated in a statistical analysis. These value of
these variables are without constraint, although the values do
impost restrictions on other variables if the data set is to
comply with estimate parameters.
What Does a Chi-Square Statistic Tell
You?
• There are two main kinds of chi-square tests: 1.
The test of independence, which asks a question
of relationship, such as, "Is there a relationship
between student gender and course choice?“
2. Goodness-of-Fit
χ2 provides a way to test how well a sample of data
matches the (known or assumed) characteristics
of the larger population that the sample is
intended to represent. This is known as goodness
of fit.
• When considering student gender and course choice,
a χ2 test for independence could be used. To do this
test, the researcher would collect data on the two
chosen variables (gender and courses picked) and
then compare the frequencies at which male and
female students select among the offered classes
using the formula given above and a χ2 statistical
table.
• If there is no relationship between gender and
course selection (that is, if they are independent),
then the actual frequencies at which male and
female students select each offered course should be
expected to be approximately equal, or conversely,
the proportion of male and female students in any
selected course should be approximately equal to
the proportion of male and female students in the
sample.
• A χ2 test for independence can tell us how
likely it is that random chance can explain any
observed difference between the actual
frequencies in the data and these theoretical
expectations.
Goodness-of-Fit

• For example, consider an imaginary coin with exactly a 50/50

chance of landing heads or tails and a real coin that you toss 100
times. If this coin is fair, then it will also have an equal probability of
landing on either side, and the expected result of tossing the coin
100 times is that heads will come up 50 times and tails will come up
50 times.4
• In this case, χ2 can tell us how well the actual results of 100 coin
flips compare to the theoretical model that a fair coin will give
50/50 results. The actual toss could come up 50/50, or 60/40, or
even 90/10. The farther away the actual results of the 100 tosses is
from 50/50, the less good the fit of this set of tosses is to the
theoretical expectation of 50/50, and the more likely we might
conclude that this coin is not actually a fair coin.
When to Use a Chi-Square Test

• A chi-square test is used to help determine if

observed results are in line with expected
results, and to rule out that observations are
due to chance.
• A chi-square test is appropriate for this when
the data being analyzed are from a random
sample, and when the variable in question is a
categorical variable.
• A categorical variable is one that consists of
selections such as type of car, race, educational
attainment, male or female, or how much
somebody likes a political candidate (from
• These types of data are often collected via survey
responses or questionnaires. Therefore, chi-
square analysis is often most useful in analyzing
this type of data. very much to very little).
How to Perform a Chi-Square Test

• These are the basic steps whether you are performing a

goodness of fit test or a test of independence:
1. Create a table of the observed and expected frequencies;
2. Use the formula to calculate the chi-square value;
3. Find the critical chi-square value using a chi-square value table
or statistical software;
4. Determine whether the chi-square value or the critical value is
the larger of the two;
5. Reject or accept the null hypothesis.
Example
Problem1
Q.1
Ans
Q.2
Ans
• Step 1: Formulate the hypotheses
 Null Hypothesis:
H0: There is no significant association between
students’ educational level
and their preference for online or face-to-face
instruction.
or
H0: There is no difference in the distribution of
instructional preferences
between undergraduate and graduate students.
• Alternative Hypothesis:
Ha: There is a significant association between
students’ educational level and
their preference for online or face-to-face
instruction.
or
Ha: There is a significant difference in the
distribution of instructional
preferences between undergraduate and
graduate students

Statistics and Probability Notes Part 1
No ratings yet
Statistics and Probability Notes Part 1
23 pages
Chap2 Data
No ratings yet
Chap2 Data
101 pages
Variance and Standard Deviation
100% (3)
Variance and Standard Deviation
15 pages
Week2 Class3
No ratings yet
Week2 Class3
19 pages
Chi-Square+Test+ +Analysis+of+Variance
No ratings yet
Chi-Square+Test+ +Analysis+of+Variance
19 pages
DAV Unit 4 Material
No ratings yet
DAV Unit 4 Material
49 pages
Measures of Variability - Khyati
No ratings yet
Measures of Variability - Khyati
40 pages
Methods Guide
No ratings yet
Methods Guide
16 pages
Introductory Statistics 8th Ed by Mann (PDFDrive) - 16-20
No ratings yet
Introductory Statistics 8th Ed by Mann (PDFDrive) - 16-20
5 pages
Slideset 2
No ratings yet
Slideset 2
63 pages
Ai Hon 4
No ratings yet
Ai Hon 4
22 pages
Lecture 4
No ratings yet
Lecture 4
38 pages
Define The Null Hypothesis (No Difference Between Sample and Theoretical Distribution) and The Alternative Hypothesis (Difference Exists) .
No ratings yet
Define The Null Hypothesis (No Difference Between Sample and Theoretical Distribution) and The Alternative Hypothesis (Difference Exists) .
21 pages
AYURSURE (Research and Stat) 4
No ratings yet
AYURSURE (Research and Stat) 4
44 pages
Measures of Variability
No ratings yet
Measures of Variability
20 pages
The Derivation and Choice of Appropriate Test Statistic (Z, T, F and Chi-Square Test) in Research Methodology
No ratings yet
The Derivation and Choice of Appropriate Test Statistic (Z, T, F and Chi-Square Test) in Research Methodology
9 pages
Week 6. Chapter 7 Introduction To Inferential Statistics
No ratings yet
Week 6. Chapter 7 Introduction To Inferential Statistics
24 pages
Liv-Stats 2
No ratings yet
Liv-Stats 2
15 pages
Standard Deviation
No ratings yet
Standard Deviation
8 pages
Adstat Final Exam Reviewer2
No ratings yet
Adstat Final Exam Reviewer2
29 pages
4.2 Different Tests
No ratings yet
4.2 Different Tests
31 pages
ML Unit-3
No ratings yet
ML Unit-3
18 pages
Chapter 11 - ANOVA 5
No ratings yet
Chapter 11 - ANOVA 5
36 pages
7 Chi-Square and F
No ratings yet
7 Chi-Square and F
68 pages
PSAI Unit 5
No ratings yet
PSAI Unit 5
25 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
Week 017 Measures of Central Tendency
No ratings yet
Week 017 Measures of Central Tendency
15 pages
Chi-Square As A Test For Comparing Variance
No ratings yet
Chi-Square As A Test For Comparing Variance
9 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Econ Review Stat W2 2025
No ratings yet
Econ Review Stat W2 2025
49 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
What Is A Chi-Square Statistic?
No ratings yet
What Is A Chi-Square Statistic?
10 pages
P102 Lesson 4
No ratings yet
P102 Lesson 4
24 pages
Basic Univariate Statistics For Engineers 2019
No ratings yet
Basic Univariate Statistics For Engineers 2019
32 pages
Variance
No ratings yet
Variance
6 pages
Statistics 1
No ratings yet
Statistics 1
9 pages
Statistics From PLTW
No ratings yet
Statistics From PLTW
64 pages
Chapter 5
No ratings yet
Chapter 5
11 pages
Amrch15 17
No ratings yet
Amrch15 17
2 pages
CHAPTERS
No ratings yet
CHAPTERS
17 pages
BDU Biometrics
No ratings yet
BDU Biometrics
122 pages
Statistics - Compendium - DMS IIT DELHI - 2025
No ratings yet
Statistics - Compendium - DMS IIT DELHI - 2025
18 pages
Statistics Unit 9 Notes
No ratings yet
Statistics Unit 9 Notes
10 pages
Essentials of Biostatistics and Research
0% (1)
Essentials of Biostatistics and Research
6 pages
Adstat Final Exam Reviewer2highlighted
No ratings yet
Adstat Final Exam Reviewer2highlighted
29 pages
Chisquare
No ratings yet
Chisquare
10 pages
7 Reference
No ratings yet
7 Reference
1 page
Statistics For Business and Economics
100% (1)
Statistics For Business and Economics
7 pages
Lecture3 - Contingency Analysis
No ratings yet
Lecture3 - Contingency Analysis
16 pages
Chapter 4
No ratings yet
Chapter 4
8 pages
SDM 1 Formula
No ratings yet
SDM 1 Formula
9 pages
Lecture 4. Dispersion
No ratings yet
Lecture 4. Dispersion
6 pages
Chi Square Test
No ratings yet
Chi Square Test
7 pages
Standard Deviation
No ratings yet
Standard Deviation
9 pages
Statistics 1 (Final) / Orthodontic Courses by Indian Dental Academy
No ratings yet
Statistics 1 (Final) / Orthodontic Courses by Indian Dental Academy
15 pages
1 - Chapter (1) Analysis of Data and Its Types Exercise
No ratings yet
1 - Chapter (1) Analysis of Data and Its Types Exercise
10 pages
Basic - Statistics 30 Sep 2013 PDF
100% (1)
Basic - Statistics 30 Sep 2013 PDF
20 pages
Descriptive and Inferential Statistics
No ratings yet
Descriptive and Inferential Statistics
10 pages
Module 1 - Statistical Process Control PDF
No ratings yet
Module 1 - Statistical Process Control PDF
37 pages
Types of Statistical Tests
No ratings yet
Types of Statistical Tests
4 pages
MCQ Business Statistics
50% (2)
MCQ Business Statistics
41 pages
Identification of Outliers (Monographs On Statistics and - D. M. Hawkins (Auth.)
No ratings yet
Identification of Outliers (Monographs On Statistics and - D. M. Hawkins (Auth.)
194 pages
Business Quantitative Techniques
No ratings yet
Business Quantitative Techniques
52 pages
Bharathidasan University-Statistics-QP-Nov-2010
No ratings yet
Bharathidasan University-Statistics-QP-Nov-2010
3 pages
Par Inc Case Problem
No ratings yet
Par Inc Case Problem
2 pages
Unit-5 Bda
No ratings yet
Unit-5 Bda
21 pages
Training Survey For Banks
No ratings yet
Training Survey For Banks
21 pages
The Analysis of Biological Data Michael C Whitlock Dolph Schluter Download
No ratings yet
The Analysis of Biological Data Michael C Whitlock Dolph Schluter Download
76 pages
Vector Error Correction Models
No ratings yet
Vector Error Correction Models
6 pages
Educ 98 Measures of Variability
No ratings yet
Educ 98 Measures of Variability
7 pages
HW 7 Solutions
No ratings yet
HW 7 Solutions
7 pages
Complete Time Series Analysis in Python 1673057003
No ratings yet
Complete Time Series Analysis in Python 1673057003
56 pages
EPS and DPS
No ratings yet
EPS and DPS
11 pages
Module 5 - Data Visualization - File 1
No ratings yet
Module 5 - Data Visualization - File 1
3 pages
KDS 551 Lab
No ratings yet
KDS 551 Lab
1 page
Summary of Frequency Distribution, Cross Tabulation and Hypothesis Testing
No ratings yet
Summary of Frequency Distribution, Cross Tabulation and Hypothesis Testing
3 pages
Individual Household Electric Power Consumption
No ratings yet
Individual Household Electric Power Consumption
29 pages
Worksheet For Dspersion
No ratings yet
Worksheet For Dspersion
7 pages
Theory of Estimation
No ratings yet
Theory of Estimation
21 pages
Corelation
No ratings yet
Corelation
14 pages
May 23
No ratings yet
May 23
21 pages
Correction of Measurement Error - Part 1
No ratings yet
Correction of Measurement Error - Part 1
22 pages
Ba 4 Sem Psychology Statistical Methods and Psychological Testing Winter 2018
No ratings yet
Ba 4 Sem Psychology Statistical Methods and Psychological Testing Winter 2018
9 pages
Homework Week 6 1 6 1
No ratings yet
Homework Week 6 1 6 1
5 pages
Bba 1
No ratings yet
Bba 1
3 pages
Forecast UPC-Level FMCG Demand, Part II: Hierarchical Reconciliation
No ratings yet
Forecast UPC-Level FMCG Demand, Part II: Hierarchical Reconciliation
9 pages
A Classifiers Voting Model For Exit Prediction of Privately Held Companies
No ratings yet
A Classifiers Voting Model For Exit Prediction of Privately Held Companies
6 pages
Lab 11 ANOVA 2way Worksheet
No ratings yet
Lab 11 ANOVA 2way Worksheet
2 pages

Variance StdDev

Uploaded by

Variance StdDev

Uploaded by

What Is Variance?

• The variance is a measure of variability. It is

• The standard deviation is derived from variance

• Where xi is an individual value

• In many cases, it is not possible to sample every member within a

Where xi is one sample value

• Standard deviation is widely used in experimental and

• For example, consider an imaginary coin with exactly a 50/50

• A chi-square test is used to help determine if

• These are the basic steps whether you are performing a

You might also like