Variance and Standard Deviation Math 217 Probability and Statistics
Variance and Standard Deviation Math 217 Probability and Statistics
1
(Absolute value is needed in case c is negative.) It’s S = X1 + X2 + · · · + Xn where each Xi equals 1
easy to show that Var(cX) = c2 Var(X): with probability p and 0 with probability q = 1 − p.
By the preceding theorem,
Var(cX) = E (cX − µ)2
Var(S) = Var(X1 ) + Var(X2 ) + · · · + Var(Xn ).
= E c2 (X − µ)2
In general, the variance of the sum of two random Var(S) = n Var(X) = npq.
variables is not the sum of the variances of the two Taking the square root, we see that the standard
random variables. But it is when the two random deviation of that binomial distribution is √npq.
variables are independent. That gives us the important observation that the
Theorem. If X and Y are independent random spread of a binomial distribution is proportional to
variables, then Var(X + Y ) = Var(X) + Var(Y ). the square root of n, the number of trials.
Proof: This proof relies on the fact that E(XY ) = The argument generalizes to other distributions:
E(X) E(Y ) when X and Y are independent. The standard deviation of a random sample is pro-
portional to the square root of the number of trials
Var(X + Y ) in the sample.
= E((X + Y )2 ) − µ2X+Y
Variance of a geometric distribution. Con-
= E(X 2 + 2XY + Y 2 ) − (µX + µY )2
sider the time T to the first success in a Bernoulli
= E(X 2 ) + 2E(X)E(Y ) + E(Y 2 ) process. Its probability mass function is f (t) =
− µ2X − 2µX µY − µ2Y pq t−1 . We saw that its mean was µ = E(T ) =
1
= E(X 2 ) + 2µX µY + E(Y 2 ) p
. We’ll compute its variance using the formula
− µ2X − 2µX µY − µ2Y Var(X) = E(X 2 ) − µ2 .
= E(X 2 ) − µ2X + E(Y 2 ) − µ2Y X∞
2
E(T ) = t2 pq t−1
= Var(X) + Var(Y )
t=1
q.e.d. = 1p + 22 pq + 32 pq 2 + · · · + n2 pq n−1 + · · ·
The last power series we got when we evaluated
Variance of the binomial distribution. Let S E(T ) was
be the number of successes in n Bernoulli trials, 1
where the probability of success is p. This random = 1 + 2x + 3x2 + · · · + nxn−1 + · · ·
(1 − x)2
variable S has a binomial distribution, and we could
use its probability mass function to compute it, but Multiply it by x to get
there’s an easier way. The random variable S is x
actually a sum of n independent Bernoulli trials = x + 2x2 + 3x3 + · · · + nxn + · · ·
(1 − x)2
2
Differentiate that to get
1+x
= 1 + 22 x + 32 x2 + · · · + n2 xn−1 + · · ·
(1 − x)3