Unit 0 - Statistics Unit Notes Dictated (CLOSED)
Unit 0 - Statistics Unit Notes Dictated (CLOSED)
Even qualitative data at face becomes quantitative change in chemical to Yellow but how
yellow?
Scientific hypothesis can only be supported or not supported by data never proven!!!
Data can be skewed through response to stimuli, a requirement of life. Therefore, no study can
remove all noise variables.
Lots of graphs and models. Science conveys the most information in the least amount of text
possible.
Ideally, the amount of data is large enough to represent a good sample size, which should result
in a bell-shaped curve, AKA a normal distribution.
Ideally, the mean should be a perfect representation
Realities of data:
The problem with plotting the mean: Me may not accurately represent the spread, leading to a
positive skew or negative skew
Problem with the mean: Mean does not indicate spread of data
Rule of thumb: Be smaller the spread, the more valid the mean.
Standard deviation represents confidence in the data. Smaller standard deviation means higher
confidence, vice versa.
Representation of confidence in models happens via error bars, based off standard error, based
off standard deviation. Standard error of mean represents the uncertainty of the mean due to
sample size as well as spread.
Standard error is a better representation of error, taking into account sample size as well as
spread.
As with standard deviation, lower standard error is preferable. Small standard error means the
likelihood of the mean being correct increases.
Data may be valid, but when data is small enough, the question becomes significance.
We know that a mutation may happen, we do not know when, what cell, what gene.
Mutations happen every billion nucleotides. Cells have 6 billion nucleotides. There should be six
mutations per cell cycle.
Organisms are ordered, and stand against the entropy of the universe
In cancer research, these cells exposed to carcinogens show a faster rate of mutation
Standard error bars communicate
1. How accurately the mean represents the data accounting for low sample size
The smaller the error bar, the more reliable the data, vice versa
Data is significant if results are not due to chance or sampling size error
How does radiation cause cancer? Radiation introduces energy to cell molecules, making them
more likely to do things they would not normally do.
Carcinations also destabilize cell molecules, though not necessarily via introducing energy into
the system.
Insignificant v Significant
When the sem is very small, it can be difficult to distinguish if there is overlap.
2SE makes it easier to ascertain overlap.
Experimental design
Negative control group: Is not exposed to the experimental treatments. Provides no response to
treatment. Tests influence of external factors. Placebo.
Positive control group: Exposed to independent variable. Provides an expected / known results.
Where negative gets placebo, positive gets ibuprofen. Negative is far more common.
Outlier is no longer the first claim we can make; such a claim must be backed by statistical
analysis.
Alternate hypothesis: assumes the variable & relationships are true, and not a product of
chance. Significant data. Designated Ha.
In order to move forward with the alternate, we must first reject the null, which implies there is a
scientific explanation of the phenomenon.
Chi-Square
Chi-square is a statistical analysis test to determine the significance between the observed and
expected data.
If the discrepancy is not significant, we have failed to reject the null hypothesis, and thus the
discrepancy is due to random chance.
If the discrepancy is significant, we have rejected the null hypothesis and may argue that the
discrepancy is due to scientific phenomena.
A null hypothesis may only ever be used when you have expected results.
Chi-Square equation:
The chi-square value (x2) is compared to the critical value.
If chi-square is less than the critical value, the discrepancy is not significant. Therefore, we have
failed to reject the null.
If chi-square is greater than the critical value, the discrepancy is significant. Therefore, we have
rejected the null.