Bio111 Lab Manual 1 Fall 2021
Bio111 Lab Manual 1 Fall 2021
INTRODUCTION
Webster’s New World Dictionary (2010) defines science as “systematized knowledge derived
from observation, study, and experimentation carried on in order to determine the nature or principles of
what is being studied; the systematized knowledge of nature and the physical world.” So, science seeks
to understand the physical world. Therefore, questions about or dependent upon supernatural
phenomena (by definition), or questions regarding ethics or morality, are outside the domain of science.
Likewise, Webster’s definition specifies a method of inquiry – observation, study, and experimentation.
You may know this process as ‘the scientific method.’
In fact, there is no single, monolithic ‘scientific method.’ There are several ways to gain
information about the physical world through systematized observation, study, and experimentation. In
fact, for some questions, observation may be sufficient. These are called ‘descriptive studies.’ For
instance, to answer the question “do humans have tails?” you just have to look – there is no true
‘experiment’ that is conducted. Of course, the validity of the conclusion is still dependent on the rigor
and completeness of the investigation, and rigorous observations may lead you to modify your question
to be more specific. If you simply use common sense and your own biased impressions (i.e., without
applying systematic observation), you might say “of course not – humans do not have tails.” That is
why common sense is not science – it is neither rigorous nor specific. (For instance, common sense
led humans to believe that the Earth was flat, stationary, and the center of the solar system.) If you
observed humans in a systematic, unbiased way, you would find that almost all humans are born
without external tails. However, there are a few individuals who are, indeed, born with a small external
tail. And, barring other rare mutations, human embryos have post-anal tails that develop into internal
tailbones. So, observations of humans at different stages of development, and observations of internal
anatomy, might lead you to answer this simple question about the physical world in a rather specific
manner couched in qualifiers: “Like all Chordates, human embryos typically develop a post-anal tail.
Typically, this develops into post-anal coccygeal vertebrae. At birth, these tail bones usually do not
protrude from the body as an external tail, except in very rare cases.” Therefore, many facts in science
are ‘just’ observations; but they are observations taken in an unbiased, systematic manner. You might
think humans do not have tails, but that is because your nonscientific impressions were based on
biased considerations of only the external anatomy of most postnatal humans. In addition, external tails
are often surgically removed shortly after birth, so observations of older children and adults gives a bias
impression of the frequency of this condition at birth.
Our definition then lists study as a scientific method of inquiry. This is the least specific of all,
and demands further elaboration. In science, we equate ‘study’ with logical, rational thought based on
the principle of natural causality. This means that when we observe a phenomenon in the physical
world and we decide to try to explain it scientifically, we seek a rational explanation for that occurrence
as a function of natural causes. We do this for two reasons. First, we have no reliable, repeatable
method for inquiries based on supernatural explanations, and no way to distinguish between alternate
supernatural ‘hypotheses’ with evidence from the physical world. (Did Yahweh make it rain? Zeus?
2
Vishnu? Odin? There is no evidence from the physical world that can distinguish one supernatural
hypothesis from the other; as such, it is a moot issue scientifically). A second reason for seeking a
rational explanation is that describing the natural causality of events can potentially give us the ability
to control those events by inducing these cause-effect relationships ourselves. In other words, science
WORKS…. It is a useful, predictive tool for understanding and manipulating the physical universe.
Through science, for example, humans have come to understand electricity. We can control it and
cause electrical impulses to pass through thin pieces of metal—causing it to glow. Although many
questions lie outside the domain of science, science is the most appropriate and reliable tool to gain a
correct understanding of cause-effect relationships about the physical universe. The first step in
assessing causality is to describe relationships between variables; ‘when ‘X’ happens, it seems that ‘Y’
then occurs’. This is a correlational pattern – the occurrence of one event correlates with the
occurrence of another. However, this DOES NOT define causality… event ‘Z’ could cause both ‘X’ and
‘Y’ and that could be why X and Y co-occur.
Also, it is important to realize that quantification is typical of a scientific approach. In science,
we often represent what we observe in mathematical terms because these can be relatively precise
and less sensitive to subjective interpretation than other languages. Although scientific facts may also
be described in prose (‘narrative data’), or as relative, semi-quantitative categorical terms (‘high,
medium, low’), eventually a more precise study will demand quantitative data. Biology is a science,
and you must be comfortable with mathematics to do it successfully.
The final method mentioned in the definition is experimentation. The goal of an experiment is
to create a situation in which a potential causal link between two correlated variables can be tested. To
do this, other variables must be controlled, randomized, or measured so that their potential effects can
be nullified or described. There are several types of experiments, too; from laboratory experiments to
field experiments, to post-dictive experiments. Laboratory experiments provide the greatest control of
the experimental environment, and so give the greatest sensitivity for observing a causal relationship
between two variables. However, the goal of science is to describe how the physical (i.e. ‘natural’)
world operates, and the laboratory environment may be so unnatural that what you observe is a
function of this unnatural environment itself, and is not what actually happens in the ‘real world.’ Such
results are called laboratory artifacts. Suppose you are studying the ecological interactions between
spiders and lizards on islands in the Caribbean, and you systematically observe that there are few
spiders on islands that have many lizards. You hypothesize that the lizards are eating the spiders and
reducing spider abundance. You bring the system into the laboratory and you conduct an experiment
demonstrating that lizards do eat the spiders. But, maybe the lizards eat the spiders in the laboratory
because there is nothing else to eat. On the islands, maybe lizards rarely eat the spiders; rather, they
out-compete spiders for flies. Lots of lizards mean fewer flies for spiders to eat, and spider populations
decline. Some good field observations could have prevented this premature lab experiment.
Field experiments are more realistic, but the investigator cannot control (or even measure) all of
the variables that are fluctuating in the environment. As such, it may be difficult to ascertain exactly
which variable is responsible for the response you measure. Finally, post-dictive experiments allow us
to test hypotheses about the past. The typical post-dictive hypothesis is: if ‘X’ happened in the past,
then ‘Y’ should be present now. (It is best if you have not observed ‘Y’ yet… otherwise you may
unconsciously be trying to explain ‘Y’ rather than testing the effects of ‘X.’) For instance, the ‘Big Bang’
hypothesis predicted that there should still be a radiational ‘echo’ of the universal explosion, and that it
should be about 2-3oK (degrees above absolute zero temperature, on the Kelvin scale). Two scientists
at Bell Labs, unaware of this prediction, kept finding ‘background radiation’ of this magnitude, no matter
which direction that they placed their antenna. They had confirmed a prediction of the Big Bang
hypothesis…and with subsequent confirmations of other predictions, this idea was elevated to the
status of a Theory – an explanatory model of how the physical world operates that has been
tested and supported by numerous, independent experiments. Evolutionary Theory is another
example. In 1859, Darwin suggested that species alive today are descended from common ancestors
that lived in the past. Essentially, all of life is one big family tree, with some species closely related (like
siblings in a nuclear family that share the same parents), and others more distantly related (like second
3
cousins that share great-grandparents). Based on anatomical similarities, Darwin hypothesized that
humans were more closely related to other primates than to cats, birds, or fish. This hypothesis of
relatedness was tested and confirmed in the 1970’s by looking directly at DNA similarity among existing
species.
All true experiments – whether laboratory, field, or post-dictive – involve the rigorous
identification of variables. Today’s first activity will give you some practice identifying and categorizing
variables. Then, you will create and conduct experiments with the response of crickets to light. You
will analyze your data statistically (employing quantification) and draw conclusions based on an
understanding of probability. Throughout the work, you should also gain an appreciation for the fact
that scientific investigations are iterative (you have to keep testing and modifying your hypothesis until it
explains ALL of the related data - a single experiment is not sufficient), cooperative (since no one
person can do ALL of the relevant experiments related to a hypothesis, the sharing of data among
investigators is crucial), and tentative (you will never know for SURE that your hypothesis is absolutely
true since it can never be tested in every conceivable way that controls or excludes every other
variable).
ACTIVITY A: VARIOUS VARIABLES
Introduce yourself to the person sitting next to you – you will be lab partners for today. Read the
following description of a hypothetical experiment conducted to determine the value of a new anti-
cancer drug. Then, with your partner, answer the questions that follow. If directed to do so, review
these answers as a class.
You and your partner are the scientific advisors to the Board of Directors of Pilco, a fledgling
pharmaceutical corporation. Pilco has invested a significant percentage of their research budget in a
1.3 million dollar grant to Dr. Marsh, a medical research scientist at Massachusetts General Hospital.
Dr. Marsh is conducting the first human trials on the value of a new drug, X136. In previous
experiments using human colorectal carcinoma (cancer) cell lines, X136 reduced the rate of malignant
cell division by 75% compared to untreated cells. Trials with mice also demonstrated an anti-cancer
effect. She is hopeful that this drug may provide a non-surgical cure for colon cancer. Pilco was
encouraged by the previous results with the drug. They know that colon cancer is one of the most
common cancers in humans, so there is a large market for a successful, non-invasive treatment. In
other words, a truly effective drug could make the company a lot of money and save many lives.
However, if the experimental results are dubious and they promote an ineffective drug, some patients
may die, the public will lose faith in the company, and their business will fail (not to mention U. S. Food
and Drug Administration liabilities). There is a lot riding on your advice. You must advise the board of
directors regarding: 1) the reliability of the results – was the experiment sound? and 2) what do the
results suggest? Should Pilco invest in the process of seeking FDA approval and the manufacture,
marketing, distribution, and sales of this drug?
The Experiment:
Dr. Marsh identifies 300 patients who are otherwise healthy but have early stages of colon
cancer. She selects the 250 patients who are between the ages of 25-55 for screening. She asks them
if they would like to be in an experiment for a new anti-cancer drug. Even though they are told that they
might not receive the drug in the experiment, 205 agree to participate. Dr. Marsh randomly selects 100
of these individuals for the experiment. She and her colleagues conduct a colonoscopy on each patient
and count the number of malignant polyps seen in a particular region of the colon. Fifty patients are
randomly assigned to the treatment group and will receive the drug in a pill they will take every day for
the next three months. The other fifty are assigned to the ‘placebo’ group, and they will receive a pill
every day that does not contain the drug. The patients do not know to which group they have been
assigned; they are ‘blind’ to the treatment. In addition, these assignments were made by Dr. Marsh’s
graduate student. As such, Dr. Marsh is also ‘blind’ to which patients are in which treatment group.
This is called a ‘double-blind’ experiment. Dr. Marsh tells them to eat their normal diet and maintain
their typical daily routine, and she meets with each patient once a month to see how they are doing. All
subjects say they have taken their pill every day.
4
After three months, Dr. Marsh conducts another colonoscopy on each patient and again counts
the malignant polyps. Now that the data are collected, she can look and see to which group the
patients were assigned. She tabulates the number of patients from each group that had at least 5
fewer polyps after three months, about the same number of polyps (within 5 of the original count), or at
least 5 more polyps than in the original sample. Her results look like this:
1. What does Dr. Marsh measure, as an indicator of the effectiveness of her pill? (This is called
the dependent variable, because the value may depend upon the treatment.)
2. What variable is purposefully changed between the groups by the experimenters? (This is
called the independent variable, and it is the hypothesized ‘cause’ of changes in the
dependent variable.)
3. What variables does Dr. Marsh either hold constant or allow to vary in a narrow range? (These
are controlled variables.)
4. What variables are randomized across the experiment? Why control some variables and
randomize others?
6. What is the purpose of the placebo group? Why do they also receive a pill? Could the
decrease in polyp number in the placebo group be biologically interesting? How would you
test for a ‘placebo effect?’
8. Are you confident that the drug X136 has anti-cancer properties? If there was only a 1%
probability that similar groups would differ as much as these, would you be more confident
that the drug has an effect?
9. Suppose that, when the patients were assigned to groups, all the men were assigned to the
X136 group and all the women to the placebo group. Now, can you attribute the difference in
response solely to the effect of the drug, or do the groups now also differ systematically in
another variable? (This is called a confounding variable).
10. Why employ a blind and double-blind experiment? Explain in terms of confounding
variables.
6
Procedure:
1. Examine your arena. The top is clear plastic and can slide out. There is also an entry port on one
side, with a stopper. Also note a black sleeve, and a wooden rod that fits in the sleeve. There are
three slots in the back of the arena so that you can divide the arena in half or into thirds, and you
have two metal dividers that fit these slots. You will insert the dividers at the end of a run so that
you can count crickets at each end of the maze without them moving back and forth. (Here are
some of the choices you have made so far: this arena… its size, color, composition, etc.; this room
and all of the environmental conditions in it.)
2. Position the arena on the table, under a light source. Cover the half of the arena closest to the light
– this creates a shady region under this cover. Align the arena and cover so that half of the arena
is shady and half is lit. Put a piece of tape on the table to note this position. (Choices made: this
type of light: its quality and quantity. This type of shaded spot: its size, position, and amount of
reflected light.)
3. Pick up the arena, open one end, and shake in the crickets. Be quick, and do not bother to count
the crickets entering the box! Grab any stragglers or escapees and add them through the ‘cork-hole’
in the front of the arena. Give the arena a sideways shake to randomize the position of the crickets
in the box. Quickly place the arena in position, with the cover in position, too. Note the time.
(Choices made: Dumping in the crickets through the top of the box; shaking the box; using multiple
crickets at once.)
4. You will terminate the experiment after five minutes by sliding a divider into the center slot. In
the meantime, observe your study organisms. Note that males have two terminal ‘spines’ coming
off their abdomen – these are cerci. Females have shorter cerci, but a very long spine between
them. This is the ovipositor – an egg-laying device. (choices made: five minute interval; both sexes
present in box at the same time).
5. After five minutes, slide the divider into position and count the number of crickets in each end of the
maze. Place your data in the results table on the next page:
6. What do you conclude? How sure are you that this pattern was not just ‘dumb luck?’ In other
words, how likely is it that crickets DO NOT typically gather in one area over the other, but just by
chance ended up sorting themselves as they did? Knowing this particular probability is very
important. If you conclude that crickets move to one area over the other, but in fact this pattern is
due to chance, then you are wrong. This type of error can have grave consequences. Think about
Dr. Marsh and the Pilco drug company. Suppose they claim that the difference between the
treatment group and the placebo group was caused by their treatment – in other words, they
conclude that the drug works. However, suppose this difference was due to chance. They are
WRONG. The drug doesn’t work, their reputations will be shot, and they will be sued. Hmmm….
Knowing the probability that chance could be responsible for your pattern becomes pretty important!
They should be conservative; they should only say the drug works if the probability that chance
caused the pattern is very low (< 1%). This is the function of most statistical tests: determining the
probability that the pattern you observe could be caused by chance. Also, think about the
physicians who read Dr. Marsh’s report in the Journal of the American Medical Association, and
then have to decide whether to prescribe this drug or not, based on their interpretation of her
results. These physicians need to know something about statistics and probability, too!
8
Methods Summary: Crickets were added to an arena and the arena was gently shaken to
randomize their position. The arena was placed beneath a light source and partially covered,
creating a shaded region and a well-lit region within the arena. After five minutes, the number of
male and female crickets at each end of the maze was determined. The frequency distribution of
total crickets in shaded and lit regions was compared to a 1:1 distribution that we would expect if
there were no tendency for congregation in one of the regions, using a Chi-squared Goodness-of-
Fit test.
8. Read ‘The Researcher’s Toolkit #1: Comparing Observed and Theoretical Distributions
Using the Chi-squared Goodness-of-Fit Test’ on page 87. Then, fill in the table below.
Calculate your Chi-squared value, and compare it with the critical values in the toolkit.
Expected Distribution under this testable hypothesis: If there were no preference, you
would expect 10 in the shaded region and 10 in the lit region. So, to calculate your expected
values, divide your total sample size by 2. Enter your expected and observed values in the
table below.
Table comparing observed and expected distributions
Obs. Exp. O-E (O - E)2 (O – E)2/E
Shaded
Lit
(sum) - - X2 =
How likely is it that any differences occurred by chance? (> 5% or < 5%) _______
Conclusion:
Variables:
Dependent:
Independent:
Controlled:
Uncontrolled:
Confounding:
9
squared difference by some index of sample size. Statisticians divide the squared difference by the
expected value to correct for sample size. So, our value, standardized for sample size, is:
Outcome Category Obs. Exp. O-E (O - E)2 (O – E)2/E
Heads 13 10.5 2.5 6.25 6.25/10 = 0.625
Tails 7 10 -2.5 6.25 6.25/10 = 0.625
(sum) 21 21 1.25
But what does 1.25 mean? This is our Chi-squared (X2) value, and statisticians have determined
how likely a value of at least 1.25 is, just by chance. In fact, they have determined how likely every X2
value is, just by chance, and have tabulated particular values in a Chi-squared table:
Probability
# Categories “df” 0.5 0.1 0.05 0.01 0.001
2 1 0.455 2.706 3.841 6.635 10.828
3 2 1.386 4.605 5.991 9.210 13.816
4 3 2.366 6.251 7.815 11.345 16.266
Common ‘critical’ values for evaluating tests with two outcome categories (‘heads’ and ‘tails’) are
3.841, which will occur by chance 5% of the time, and 6.635, which will occur by chance only 1% of the
time. (REMEMBER: the larger the chi-square value, the greater the difference between observed and
expected values and the LESS LIKELY it is that random chance could produce a difference this large.)
In scientific experiments, it is common to use either the 5% or 1% criteria for evaluating hypotheses.
Our value is 1.25. In the table; using the 2 outcome categories row, find where 1.25 would be. Now,
read the column headings – a value of 1.25 would occur by chance between 10% and 50% of the time,
just by chance. So, a perfectly balanced coin would yield a 13:8 ratio 10-50% of the time. Our
conclusion is that this is a pretty common result for a perfectly balanced coin. So, this study has failed
to show that the coin is unbalanced. If we had done a larger study with a greater number of tosses, we
may have found that the coin is indeed slightly unbalanced, but our study has failed to show this. We
have not proven the coin is balanced, we have simply failed to show that it is unbalanced.
There are a few things you need to keep in mind about the Chi-squared Goodness-of-Fit test:
1) You can compare two or more observed distributions to one another, to test for differences
among various observed distributions. (This is called the Chi-squared Test of Independence,
and is described in a later ‘Researcher’s Toolkit’).
2) Your data must be count (integer) data. If the values you put into your observed outcome table
column contains percentages, fractions, or decimal points, you cannot use the Chi-squared
Goodness-of-Fit test. The expected values, however, may have decimal points.
3) The Chi-squared Goodness-of-Fit test may indicate there are differences between the observed
and expected distributions, but it may not indicate which observed values are significantly
different from which expected values. For instance, if you have three categories (blue, green,
brown), and conclude their distribution does not fit a particular theoretical one, you don’t which
color outcome(s) gave rise to the discrepancy. All you know is that a discrepancy exists
somewhere.
4)
11
Middle area 5
Shaded area 5
Make sure you include the units of measurement in your table. They need only be noted once at
the top of the appropriate columns in the table. (‘min.’ in the second column, above, symbolizes
‘minutes.’) It may be obvious to you now that time was measured in minutes, but how about in 3 years
when you return to these data? Would someone reading your table be able to guess this unit if it were
not explicitly indicated? Better that your information is overly specific, rather than too vague.
Choose the units of measurement so that they can be conveniently written and read. For example,
if you were measuring the mass of the crickets, do not use units of kilograms when grams or milligrams
would be more appropriate. Thus, write 2.3 g instead of 0.0023 kg, and write 18.7 ml instead of 18,700
µl. A long string of zeros is easy to misread or miswrite.
For analyzing data, sometimes a particular table structure is necessary. For instance, you need to
set your table in a particular way in order to conduct the Chi-squared Goodness-of-Fit test:
Outcome category Obs. Exp. O-E (O - E)2 (O – E)2/E
Shaded
Lit
(sum) - - X2 =
However, YOU SHOULD NOT USE SUCH A TABLE TO COLLECT DATA. Doing so will cause the
loss of information that could be important later, such as the time interval and the numbers of males
and females. So, remember that there are tables for collecting data, analyzing data, and reporting data
(that’s next).
14
When reporting data in a formal paper, you may need to restructure the table to reduce some of
that redundancy. For instance, with the insect location data above you would not need to include a
column for ‘Elapsed Time’ in the table. Rather, you could refer to the elapsed time in the title of the
table. It is accepted practice to include a title for the table and place it above the actual table. The title
should be clear enough that the table is generally understandable without having to refer to the text of
the paper itself. But the text of the paper must include mention of the table, directing the reader to it.
Give some thought about how to arrange the data in the table in the most concise and readable
manner. A well designed table can actually help you to see relationships in the data. This is especially
true when large amounts of data are reported. Rarely are raw data reported. Rather, tables of means,
standard deviations, or statistical results are often reported because these directly show the patterns in
the data.
Here are examples of a poor table and a good table, using the same hypothetical data.
male hi 0.053
male lo 0.034
female lo 0.018
female hi 0.026
female lo 0.017
male hi 0.042
male lo 0.027
female hi 0.036
Male 34 53
27 42
15
dark, not too washed out. The diaphragm and light control need to be readjusted for each objective
you use.
Calibrating the ocular micrometer:
1. Arrange the stage micrometer for viewing under low power, and move the slide so that the scale of
the ocular micrometer is seen superimposed on the scale of the stage micrometer. The actual
distance between the smallest adjacent lines of your stage micrometer is 0.01 mm (or 10 microns).
Its entire length is 1.00 mm. Align the left ends of the stage micrometer ruler and the ocular
micrometer ruler (see figure).
2. Using the 4X objective, 30 units on the ocular micrometer corresponds to what length on the stage
micrometer ruler? To what actual distance (using the 4X objective) does one unit on the ocular
micrometer correspond? For example, if 30 ocular micrometer units span a distance of
0.6 mm on the stage micrometer (as shown in the figure below), then each ocular micrometer unit
equals 0.6 mm/30, or 0.02 mm. A more convenient unit for microscopic distances is the micron
(µm), which is 1/1000 of a mm. Thus, in this example each ocular micrometer unit would
correspond to 20 µm. Record this calibration factor (in this example, it would be 20 µm per
micrometer unit).
3. Repeat this type of calibration for the 10X and the 40X objectives.
4. Convert ocular micrometer units to microns by multiplying by the appropriate calibration factor you
determined in steps 2-3. For example, if the calibration factor of the 40X objective were 3.0 µm per
micrometer unit, and the object viewed were measured to be 4 1/2 ocular micrometer units in
diameter, the object diameter would be 4.5 X 3.0 = 13.5 µm.
1 2 3 4 5 6 7
17
in magnitude, or that are related to one another by exponential, or power relationships. Thus, a plot of
y=10X is a curve on a normal graph, but becomes a straight line on a semi-log graph.
BAR CHARTS:
1. A baseline is really not needed, since the base of the bars form a visual baseline.
2. The bars should be wider than the spaces between them. Use white, black or dark gray as the
filling pattern of the bars. Avoid using bold stripes, which are distracting, or detailed patterns which are
hard to distinguish from one another.
Below are examples of good graphs and a chart.
21
Computations:
22
1. So, the first step is to calculate the average value for each sample:
(Xa = group mean; xa = each data value in sample A, na = number of data values in sample A.)
2. Calculate the Standard Error of the Mean (SEM) of each mean, which is the average difference
between the mean and the data points - describing the "spread" about the mean. Since we square
each difference (to avoid negative values), we must take the square-root of the final calculation so that
the SEM is in the same units as the mean. Repeat for SEMb, using Xb, xb, and nb.
3. Compute a t value, which is the ratio we want; the difference between the means in relation to the
variation within the samples:
4. Now, we compare this calculated t value with a critical value from a statistical table, p = 0.05. If your
calculated value is greater than the critical value, then you reject the null hypothesis of equality and
accept the alternative: there is a statistically significant difference between your means.
Sum of both Sum of both Sum of both
Critical t Critical t Critical t
group sizes group sizes group sizes
4 4.30 11 2.26 18 2.12
5 3.18 12 2.23 19 2.11
6 2.78 13 2.20 20 2.10
7 2.57 14 2.18 25 2.07
8 2.45 15 2.16 30 2.05
9 2.37 16 2.14 40 2.02
10 2.31 17 2.13 50 2.01
There a few things to keep in mind about the Student’s t test.
Only two groups can be compared. It is not valid to compare numerous data groups to each other
by repeating the Student’s t test for each comparison. Each data group can be included in only one
Student’s t test.
The Student’s t test assumes that both data groups are normally distributed (i.e., each distribution
follows roughly a bell-shaped curve).
The Student’s t test assumes the variation of the two data groups are roughly equal.
23
A quick look at the row and column totals will show you that, indeed, neither the A gene nor the B gene
was produced exactly ½ the time. But those aren’t the hypotheses we are testing here. We want to
know if the genes assort independently. So, we calculate our expected frequencies by using the
product rule.
Under the hypothesis of independent assortment:
1) the frequency of AB = f(A) x f(B) x N; f(A) = 60/100, f(B) = 58/100, and N = 100
2) f(AB) = 60 x 58 x 100 = (60 x 58 ) = (Row Tot. x Col. Tot.) = 34.8
100 100 1 100 Grand Total
If these genes assorted independently, we should expect to see 34.8 AB offspring. This expectation is
based SOLELY on the hypothesis of independent assortment; no assumptions about the frequencies of
A or B are made, because we used the ACTUAL, OBSERVED frequencies of A and B (not a guess at
½ for each). Now, if the observed results do not fit the expectations from our hypothesis, there is only
one reason – the single hypothesis of independent assortment must be wrong.
Next, compute the other three expected ratios the same way: (RT x CT)/ GT
3) f(Ab) = (60 x 42) / 100 = 25.2
f(aB) = (40 x 58) / 100 = 23.2
f(ab) = (40 x 42) / 100 = 16.8
Then compare these calculated expected values with the observed values, using the same formulae as
the Goodness-of-Fit test.
Phenotype Observed Expected (o-e) (o-e)2 / e
AB 40 34.8 5.2 0.78
Ab 20 25.2 5.2 1.07
aB 18 23.2 5.2 1.17
ab 22 16.8 5.2 1.61
SUM = 4.63
Next, compare your calculated value with the critical value from the table (pg. 14). For the test-of-
independence, however, the degrees of freedom as computed based on the number of columns and
rows in the contingency table. Df = (c – 1)(r – 1) = (2-1)(2-1) = 1.
The critical chi-square, with df = 1 and p = 0.05, = 3.84. Your calculated value is greater than the
critical value, so independently assorting genes would only produce a deviant pattern like yours less
than 5% of the time. Your observed results are very unlikely for independently assorting genes. So, the
correct conclusion, at the 95% confidence level, is to reject the hypothesis of independence.
25