Fathom Tutorial
Fathom Tutorial
by Roland W. Meisel
Version 1.0 2002-09-12
Foreword
This tutorial is designed for the teacher who will be using Fathom to teach the Grade 12 Mathematics
of Data Management MDM4U course. It is keyed to the text Mathematics of Data Management,
2002, McGraw-Hill Ryerson. All worked examples and technical extensions that specifically make use
of Fathom are included in this tutorial. Step-by-step keystroke instructions and liberal use of screen
shots will ease the novice along the learning curve for this powerful new technology. The user will find it
helpful to follow through the text as he or she works through this tutorial. Once the user has worked
through the tutorial, he or she will have a solid knowledge of Fathom as applied to the MDM4U course,
including:
* entering, displaying, sorting, and filtering data
* calculating values such as maximum, minimum, mean
* calculating measures of central tendency and measures of spread
* creating histograms
* creating and automating simulations of processes involving random numbers
* scatter plots, lines of best fit, and correlation coefficients
* linear and non-linear regression
* dynamic curve-fitting using sliders
* detecting a hidden variable
* analysing distributions
* creating normal probability plots
About Fathom
Fathom is a powerful dynamic statistical software package published by
An excellent web site, with Fathom resources and links to other web sites, is at www.keypress.com.
2 Fathom Tutorial
Fathom is a statistics software package that offers a variety of powerful data analysis tools in an easyto-use format. This section introduces the most basic features of Fathom: entering, displaying, sorting,
and filtering data. A complete guide is available on the Fathom CD. The real power of this software
will be demonstrated in later chapters with examples that apply its sophisticated tools to statistical
analysis and simulations.
When you enter data into Fathom, it creates a collection, an object that contains the data. It can then
use the data from the collection to produce other objects, such as a graph, table, or statistical test. These
secondary objects display and analyse the data from the collection, but they do not actually contain the
data themselves. If you delete a graph, table, or statistical test, the data still remains in the collection.
Fathom considers a collection as a set of cases. Each case in a collection can have a number of
attributes. For example, the cases in a collection of medical records could have attributes such as the
patients name, age, sex, height, weight, blood pressure, and so on. There are two basic types of attributes,
categorical (such as male/female) and continuous (such as height or weight). The Case Table feature
displays the cases in a collection in a format similar to a spreadsheet, with a row for each case and a
column for each attribute. You can add, modify, and delete cases using a case table.
Example 1: Tables and Graphs
a) Set up a collection for the hockey league standings from Example 3 on page 17.
b) Graph the Team and Points attributes.
Note: The Student e-book contains a Microsoft PowerPoint presentation showing the solutions for this
section. Appendix B of the student text contains detailed information on using Fathom.
Solution for Example 1
Launch Fathom and drag the case table
icon from the shelf to the work area. Click on the
attribute <new>, type the heading Team, and press Enter. Similarly, create attribute columns for Wins,
Losses, Ties, GF, GA, and Points. Enter the data into each attribute column. When you are finished, your
case table will look like the screen shot shown below.
Fathom Tutorial 3
4 Fathom Tutorial
b) Select Add Filter from the Data menu. Click on the plus
sign beside Attributes, as shown in the screen shot at the
right. Double-click on the Points attribute, select the lessthan button, and type 16. Click the Apply button, and then
OK button.
Note that the Filter is listed at the bottom as Points < 16.
c) Click on the HockeyStats collection box, and drag a new
case table onto the work area. Click on the Wins attribute.
Select Hide Attribute from the Display menu. Use the same
method to hide the Losses, Ties, and Points attributes. Rightclick on the GF attribute, and use Sort Descending to rank
the teams. Your result will look like the screen shot at the
right.
Fathom Tutorial 5
Questions:
1. Enter the data from Example 1 into Fathom. Use the built-in functions in Fathom to find the
following (see Appendix B or consult the Fathom Help screen or manual):
a) the mean of goals against (GA)
b) the largest value of goals for (GF)
c) the smallest value of GF
d) the sum of GA
e) the sum of GA and GF for each case
2. a) Set up a new collection with the following student marks:
65, 88, 56, 76, 74, 99, 43, 56, 72, 81, 80, 30, 92
b) Sort the marks from lowest to highest.
ii) Calculate the mean mark.
iii) Determine the median (middle) mark.
Solution for Question 1
a) Double-click on the HockeyStats collection box to open the Inspector. Select the Measures tab.
Double-click on <new> and rename it MeanGA. Right-click on the box under Formula, and select Edit
Formula. Select Functions/Statistical/One Attribute. Double-click on Mean. Move up to Attributes,
and double click on GA. Your final formula will look like the screen shot at the left below.
Click Apply and OK. The Inspector will look like the screen shot at the right above. Note that the mean
of goals against has been calculated under Value.
6 Fathom Tutorial
c) Add the smallest value of goals for GF to the Inspector using the min function under
Functions/Statistical/One Attribute.
d) Add the sum of goals against GA to the Inspector using the sum function under
Functions/Statistical/One Attribute. The final result will look like the screen shot at the right.
e) Add an attribute called GAplusGF. Edit the formula to GA + GF, as shown in the screen shot below.
Fathom Tutorial 7
1.4 Simulations
8 Fathom Tutorial
Solution 4 FathomTM
a) FathomTM has built-in functions to generate random numbers and count the
scores in the simulations.
Launch FathomTM and open a new document if necessary. Drag a new collection
box to the document and rename it MCTest. Right click on the box and create 20
new cases.
Drag a case table to the document. You should see your 20 cases listed. You can
expand the table by dragging the edges if you cannot see them all on the screen.
Rename the <new> column Guess. Right click on Guess and select Edit
Formula. Double-click on Functions, then Random Numbers. Double-click the
randomInteger() function. Enter 1,5 between the brackets of the
randomInteger() function, and click OK. This will fill the Guess column with
random integers between 1 and 5. For the purposes of this simulation, assume that
the correct answer for each question is 1. Scroll down the column to see how many
correct guesses there are in this simulation.
Note: Since these numbers are random, your numbers are unlikely to be the same
as those shown in the screen shot at the right.
b) You can run a new simulation by pressing Ctrl-Y, which will fill the Guess
column with a new set of random numbers. Try this a couple of times.
Better still, you can set FathomTM to repeat the simulation 100 times automatically
and keep track of the number of correct guesses. Continue below to see how this is
done.
First, set up the counting
function. Right click on the
collection box and select
Inspect Collection. Select the
Measures tab and rename the
<new> column Score. Then,
right-click the column below
Formula and select Edit
Formula. Expand Functions,
then Statistical. Select count,
enter Guess = 1 between the
brackets, and click OK to count
the number of correct guesses
in your case table.
Click on the MCTest collection box. Now, select Analyse/Collect Measures
from the main menu bar. This will create a new collection called Measures
from MCTest. Click on this new collection box, and drag a new case table to
the document. FathomTM will automatically run five simulations of the multiplechoice test and show the results in this case table.
Fathom Tutorial 9
10 Fathom Tutorial
Fathom Tutorial 11
b) Drag a graph icon to the workspace. Drag the StarWars attribute to the
horizontal axis of the graph. Change the graph to a Box Plot using the dropdown menu in the upper right corner of the graph.
12 Fathom Tutorial
Fathom Tutorial 13
14 Fathom Tutorial
Note the effect on the location and equation of the line of best fit. Note also the changes in the coordinates
of this point in the related case table.
Fathom Tutorial 15
16 Fathom Tutorial
Note that Fathom will assign each slider a generic "V" name.
You can double-click on each of these names in turn, and change
them to a, b, and c.
Right-click on the graph, and select Plot Function. The
expression dialogue box will appear.
Fathom Tutorial 17
Adjust the sliders until the curve makes its best fit to the scatter plot. In most cases, it will not fit
perfectly. You can drag the scale on each slider to obtain a wider or narrower range of values. When you
are finished, your screen should look much like the one shown below. Note: Take your time with this
step. Adjusting the sliders and slider scales properly takes some practice. When you have the correct
values, you can make small adjustments to each slider, and observe the smooth dynamic effects on the
graph.
d) At this point, you have done the curve fit by "eyeball". To get a better measure of the accuracy of the
fit, add a "least squares" column to your case table. Right-click on the LeastSquares attribute, and edit
the formula to:
(Weightlbs (a*Agedays2 + b*Agedays + c))^2
as shown below.
This will calculate the square of the difference between the scatter plot data and the value predicted by
your quadratic fit.
18 Fathom Tutorial
Next, calculate the sum of the squares. Double-click on the Curve Fitting collection box to open the
Inspector. Select the Measures tab. Double-click on <new> and rename it SumOfSquares. Right-click
on the box under Formula, and select Edit Formula. Select Functions/Statistical/One Attribute.
Double-click on sum. Move up to Attributes, and double-click on LeastSquares. Click on Apply and
then OK. The sum of the squares will appear in the Value column, as shown below.
You can now move your sliders, and watch the sum of the squares change as you change your quadratic
fit formula. With some practice, you can dynamically find a good approximation of the least squares fit.
Note that almost any spreadsheet will automatically generate the proper regression formula for you. On
the other hand, using Fathom in this way gives you two bonuses:
1) it is an excellent learning tool, and allows the student to see the effects of changing the parameters of
the curve fit formula in real time, rather than just generating a "black box" answer.
2) Fathom can fit any curve that can be defined. Most spreadsheets have a limited number of curves
that can be used for regression purposes.
Fathom Tutorial 19
20 Fathom Tutorial
Fathom Tutorial 21
Double-click on the Number of Women Jurors collection box to open the Inspector. Select the
Measures tab. Double-click on <new> and rename it Ex. Right-click on the box under Formula, and
select Edit Formula. Select Functions/Statistical/One Attribute. Double-click on sum. Move up to
Attributes, and double-click on xpx. Click on Apply, and then OK. The expected number of women
jurors of 3.33333 will appear in the Value column.
A toy tricycle comes with this label: "Easy-To-Assemble. An adult can complete this assembly in 20 min
or less." Thirty-six adults were asked to complete the assembly of a tricycle, and record their times. Here
are the results:
16
17
26
10
13
23
20
18
24
22
19
20
19
17
8
14
21
17
30
29
21
22
22
32
12
16
18
24
28
25
28
21
22
11
15
20
a) Make a normal probability plot of the data. Are the assembly times normally distributed?
b) Find the mean and standard deviation of the data. What is the probability that an adult can complete
this assembly in 20 min or less? What proportion of adults should complete this assembly within 15 to 30
min?
Solution Using Fathom
Launch Fathom, and open a new document, if necessary. Drag a new collection box to the workspace.
Rename the collection Assembly Times, and create 36 new cases. Drag a new case table to the
workspace. Name the first column Times, the second column zTimes, and the third column Quantiles.
Enter the time data in the first column. Sort it into ascending order. Edit the formula in the second
column to zScore(Times). This will calculate the z-scores for the data.
Edit the formula in the third column to normalQuantile((uniqueRank(Times)-0.5)/36/0/1. The 0.5 in
the formula ensures that the normal quantile is calculated for the midpoint of each of the 36 segments of
the distribution. This formula will calculate the expected z-score if the data were indeed distributed
normally. The uniqueRank() function returns the "row number" of the sorted data. Note that most of the
22 Fathom Tutorial
quantile z-scores in the following screen shot are different from the z-scores for the corresponding data.
Your case table will look like the following screen shot.
Fathom Tutorial 23
a) Drag a new graph to the workspace. Drag the Times attribute to the horizontal axis, and the Quantiles
attribute to the vertical axis to generate a normal probability plot. Select Least Squares Fit from the
Graph menu. The linear correlation coefficient for Times and Quantiles is 0.995, indicating that the data
are likely normally distributed.
b) Double-click on the collection to open the Inspector. Choose the Measures tab. Create four measures:
Mean, StdDev, P20orLess, and P15to30. Use the mean, standard deviation, and normalCumulative
functions to calculate the mean, the standard deviation, and the answers to question 3, as shown in the
previous screen shot.
24 Fathom Tutorial