How To Measure Anything 3rd - Ed Ch.9 Examples
How To Measure Anything 3rd - Ed Ch.9 Examples
(From pg 134-135)
This shows the steps on pages 134-135 for computing the upper and lower bounds
of a 90% confidence interval. The "t-stat" is used for small samples. Note that two
excel formulas are introduced here that give the same answers as the steps in the
book: VAR() and TINV(). Th will slightly simplify the calculation. You can use this
spreadsheet to compute ranges for larger samples, too. Just add however many
rows you need and don't forget to change the references in the cells (i.e. "B5:B9")
to capture all the new rows. See how close the answers would be if you just used
the "Mathless" table on the next page.
Squared Differences
Sample Value (step 1.b in book)
1.4 0
1.4 0
1.5 0.01
1.6 0.0400000000000001
1.1 0.0899999999999999
This is just another copy of exhibit 9.3 from the book so you can
quickly compare it to the answers you get in "JellyBeanTstat" page
in the spreadsheel. This is a much simpler approximation than the
t-stat method. As described in the book, you simply use the table to
determine which values in the sample approximate the 90% CI. For
example, if you have 11 random samples, simply take the 3rd largest
and 3rd smallest values to approximate the upper and lower
bounds.
sample size largest/smallest Actual confidence
5 1st 93.8%
8 2nd 93.0%
11 3rd 93.5%
13 4th 90.8%
16 5th 92.3%
18 6th 90.4%
21 7th 92.2%
23 8th 90.7%
26 9th 92.4%
28 10th 91.3%
30 11th 90.1%
Chapter 9: Computing 90% Confidence Intervals with Small Samples
Example follows steps on Pages 202-203 of the 3rd edition of How to Measure Anything
This example follows the steps shown on pages 202-203 of the 3rd edition How to Measure
Anything for computing the upper and lower bounds of a 90% confidence interval. The "t-stat"
is used for small samples. Note that two excel formulas are introduced here that give the same
answers as the steps in the book: VAR() and TINV(). This will slightly simplify the calculation.
You can use this spreadsheet to compute ranges for larger samples, too. Just add however
many rows you need and don't forget to change the references in the cells (i.e. "C5:C9") to
capture all the new rows.
Data
2 4 6 8 10 12 14 16 18 20
20
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
Chapter 9 Example: Computing 90% Confidence with
Catch/Recatch
Example starts on Page 214 of the 3rd edition of How to Measure Anything
This shows the calculations for the estimation of the size of a population
when you can't actually count or even see the entire population.
This controlled experiment example compares each of three groups - the test group, the control
group and the original baseline of both groups before the additional training started. The experiment
showed that the training improved the results in the test group compared to the control group. But
what is the chance this is a random fluke? The three values under "Chance the population order is
different" shows the probability that the test group could really be equal to or worse than the control
group, even though the experiment showed otherwise. We also check the control vs. the baseline
and the test vs. the baseline. This shows us that the control and baseline were virtually identical (it is
almost a coin flip which one is higher) and that the test group is also a significant improvement on
the baseline. It is highly unlikely that the difference is due to chance alone or due to factors other
than the new training program.
Variance of
Chance the
Difference Difference
population order is
Between Means between
different
means
Test vs. Control 0.34 0.145199619328 1.0%
Control vs. Baseline 0.01 0.11725428952 47.6%
Test vs. Baseline 0.35 0.137098166133 0.6%
Chapter 9 Example: Television Ratings Regression Example
Example starts on Page 236 of the 3rd edition of How to Measure Anything
This is the television ratings regression example shown on pages 236-242 of the 3rd
edition of How to Measure Anything. It shows how simple Excel formulas compare to
Excel's regression wizard (see below). These simple Excel formulas only work for
situations where we are correlating a single variable to another. The regression wizard,
however, can work for multiple variables (as explained in the example in the book).
This shows a correction to the book where the slope and intercept were different
because column references in the Excel function were reversed.
Slope: Additional ratings points for one extra week of promotion 0.215453402729
Duration of
Promotion Ratings Points
Period (weeks) Below is the "summary output" table generated by the regression
"Analysis Toolpack" add on in Excel).
2 0.3
SUMMARY
1 1.3
1 1.0 Regression Statistics
1 0.9 Multiple R 0.702952136634
3 1.6 R Square 0.494141706399
2 1.3 Adjusted R Square 0.474685618183
4 1.9 Standard Error 0.543101204661
1 0.3 Observations 28
4 2.2
3 2.1 ANOVA
4 1.2 df SS
5 2.1 Regression 1 7.491305636
3 2.5 Residual 26 7.668931881
5 1.8 Total 27 15.16023752
5 2.3
6 2.5 Coefficients Standard Error
6 2.4 Intercept 0.877331144348 0.226161926
6 1.9 Duration of Promo 0.215453402729 0.042751895
6 1.5
6 2.6
8 2.1
6 2.5
7 3.3
7 1.3
7 1.8
7 2.4
9 3.0
9 2.7
Ratings Points for the Program
12
10
0
0 2 4 6 8 10 12
Duration of Promotion Period (Weeks)
enerated by the regression wizard in Excel's data analysis tool (available if you set up the
SUMMARY OUTPUT
MS F Significance F
7.491306 25.39779 3.027E-05
0.294959
t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
3.879217 0.00064 0.41244865 1.34221364 0.4124486471 1.3422136416
5.039622 3.03E-05 0.12757562 0.30333118 0.1275756249 0.3033311806