0% found this document useful (0 votes)

31 views10 pages

Unit 0 - Statistics Unit Notes Dictated (CLOSED)

Uploaded by

Orion Gjoni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views10 pages

Unit 0 - Statistics Unit Notes Dictated (CLOSED)

Uploaded by

Orion Gjoni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Class 8/26 statistics unit starts

Importance: Why stats and data?

Science is a discipline based on a collection of quantifiable data.

Science can only ask questions answered by quantifiable data

No qualitative data of any kind e.g. Theology, philosophy

Even qualitative data at face becomes quantitative change in chemical to Yellow but how
yellow?

Scientific hypothesis can only be supported or not supported by data never proven!!!

Structure defines function of everything.

Science is not old; understanding limited by technology.

There is no law in biology, unlike chemistry or physics.

Validity & significance of data

Data is not always true.

Trust, but verify.

Verification will happen via statistical analysis.

End class 8/26.

Class 8/28 Validity and Significance of
Data
Validation & statistical analysis

Not all data is significant or valid, as data can be skewed / flawed.

Conclusions are valid only when supported by valid / significant data.

How to validate data? Statistics

Data can be skewed through response to stimuli, a requirement of life. Therefore, no study can
remove all noise variables.

Statistics remove significant data from noise.

Collection of v&s data requires multiple trials.

When writing procedures always minimum three trials

Greater sample size, better and v&s data

Lots of graphs and models. Science conveys the most information in the least amount of text
possible.

One model / graph > multi- page writeup

Only ever is the mean plotted

Example: Enzymatic activity in the face of temperature swings

Three trials at room temperature plot 1 points: Average of Trials

Three trials at high temperature plot 1 points: Average of Trials

Three trials at low temperature plot 1 points: Average of Trials

Total number of points: 3

Ideally, the amount of data is large enough to represent a good sample size, which should result
in a bell-shaped curve, AKA a normal distribution.
Ideally, the mean should be a perfect representation

The mean is the average across multiple trials.

One point for variable change

Dotted line is the meaning

Consider organisms like ecosystems: Limited resources.

Biggest limited resource is energy.

Realities of data:

Very perfect, unlikely to fit a normal curve exactly

The problem with plotting the mean: Me may not accurately represent the spread, leading to a
positive skew or negative skew

Skew is mitigated via error bars

Problem with the mean: Mean does not indicate spread of data

Data spread impacts validity of the mean as a representative of the data.

Rule of thumb: Be smaller the spread, the more valid the mean.

Quantify via deviation/error

Used to determine the spread of data points from the mean

Equation of standard deviation

Equation of standard error

Standard deviation represents confidence in the data. Smaller standard deviation means higher
confidence, vice versa.

Confidence does not equal proof

Representation of confidence in models happens via error bars, based off standard error, based
off standard deviation. Standard error of mean represents the uncertainty of the mean due to
sample size as well as spread.

Standard error is a better representation of error, taking into account sample size as well as
spread.

As with standard deviation, lower standard error is preferable. Small standard error means the
likelihood of the mean being correct increases.

The mean is meant to be a representative of a population.

End class 8/28

Class 9/3 Error Bars

Error bars in bar graphs
Error bars in line graphs

Error bars and significance

Data may be valid, but when data is small enough, the question becomes significance.

We know that a mutation may happen, we do not know when, what cell, what gene.

Mutations happen every billion nucleotides. Cells have 6 billion nucleotides. There should be six
mutations per cell cycle.

Organisms are ordered, and stand against the entropy of the universe

In cancer research, these cells exposed to carcinogens show a faster rate of mutation
Standard error bars communicate

1. How accurately the mean represents the data accounting for low sample size

The smaller the error bar, the more reliable the data, vice versa

2. How likely is it that there is a significant data between data sets.

What is a significant difference

Data is significant if results are not due to chance or sampling size error

How does radiation cause cancer? Radiation introduces energy to cell molecules, making them
more likely to do things they would not normally do.

Carcinations also destabilize cell molecules, though not necessarily via introducing energy into
the system.

If error bars overlap, the data is not significant.

Insignificant v Significant

Benefit to using 2SE

When the sem is very small, it can be difficult to distinguish if there is overlap.
2SE makes it easier to ascertain overlap.

Always mention if graph uses SE or 2SE

Experimental design

Key components of experimental design:

1. Independent variable: Variable that is manipulated. Plot on x-axis

2. Dependent variable: Variable that is measured. Plot on y-axis
3. Control group: Group in comparison to experimental. Independent variable is unchanged
dependent variable is measured.
4. Experimental group: Group in which independent variable is manipulated, dependence
variable is measured.
5. Controlled variables: All factors that stay constant between control and experimental
6. Additional components: Large sample size, repeatable procedure, multiple trials.

Positive v negative controls

Negative control group: Is not exposed to the experimental treatments. Provides no response to
treatment. Tests influence of external factors. Placebo.

Positive control group: Exposed to independent variable. Provides an expected / known results.
Where negative gets placebo, positive gets ibuprofen. Negative is far more common.

End class 9/3

Class 9/9 Null/Alternate and Chi-Square
Hetero-hetero cross, ends w/ 75% dominant, 25% recessive

In a population of 200, 150 would be dominant, 50 would be recessive.

Outlier is no longer the first claim we can make; such a claim must be backed by statistical
analysis.

The two kinds of hypotheses

Null hypothesis: assumes the variable has no effect, and that all observations are a product of
chance. Insignificant data. Designated H0.

Alternate hypothesis: assumes the variable & relationships are true, and not a product of
chance. Significant data. Designated Ha.

In order to move forward with the alternate, we must first reject the null, which implies there is a
scientific explanation of the phenomenon.

Chi-Square
Chi-square is a statistical analysis test to determine the significance between the observed and
expected data.

If the discrepancy is not significant, we have failed to reject the null hypothesis, and thus the
discrepancy is due to random chance.

If the discrepancy is significant, we have rejected the null hypothesis and may argue that the
discrepancy is due to scientific phenomena.

A null hypothesis may only ever be used when you have expected results.

Chi-Square equation:
The chi-square value (x2) is compared to the critical value.

Critical value is given by a chart, readable by degrees of freedom and p-value.

If chi-square is less than the critical value, the discrepancy is not significant. Therefore, we have
failed to reject the null.
If chi-square is greater than the critical value, the discrepancy is significant. Therefore, we have
rejected the null.

Generally, the accepted p-value is 0.05, unless stated otherwise.

Degrees of freedom is possible states minus 1.

Steps in a chi-square test

1. Determine null hypothesis
2. Count observed values
3. Determine expected values
4. Calculate the chi-square value
5. Calculate degrees of freedom
6. Select p-value
7. Identify critical values
8. Compare chi-square to critical value
9. Reject, or fail to reject, the null hypothesis.
End class 9/9

End Statistics Unit

Laboratory Quality Control
50% (2)
Laboratory Quality Control
19 pages
41-47 Introductory Biostatistics Notes - Osmosis
No ratings yet
41-47 Introductory Biostatistics Notes - Osmosis
136 pages
Quality Work Life
No ratings yet
Quality Work Life
12 pages
Basic Statistics For Health Sciences
91% (11)
Basic Statistics For Health Sciences
361 pages
Statistics in Details
100% (2)
Statistics in Details
283 pages
ZYJ260
No ratings yet
ZYJ260
78 pages
Portable Radios: Operating Instructions
100% (1)
Portable Radios: Operating Instructions
47 pages
Unit 8 - TQM
No ratings yet
Unit 8 - TQM
37 pages
Data Visualization Notes Ou
No ratings yet
Data Visualization Notes Ou
125 pages
Cobra C1 FastScanManual
No ratings yet
Cobra C1 FastScanManual
64 pages
Psychology and Other Disciplines
No ratings yet
Psychology and Other Disciplines
5 pages
Biostatistics Notes Part 1
No ratings yet
Biostatistics Notes Part 1
9 pages
Evolution of Stars
No ratings yet
Evolution of Stars
3 pages
Collaborative Learning
No ratings yet
Collaborative Learning
7 pages
Stats For Primary FRCA
No ratings yet
Stats For Primary FRCA
7 pages
Chapter 1 A Preview of Business Statistics
No ratings yet
Chapter 1 A Preview of Business Statistics
3 pages
2-Basic Statistics For Pharmacology Practicals
No ratings yet
2-Basic Statistics For Pharmacology Practicals
38 pages
Things To Know PDF
No ratings yet
Things To Know PDF
56 pages
IB372 FA10 Lab01 Intro Statistics Presentation
100% (1)
IB372 FA10 Lab01 Intro Statistics Presentation
75 pages
Department of Education: Daily Lesson Log (DLL)
0% (1)
Department of Education: Daily Lesson Log (DLL)
2 pages
Graph
No ratings yet
Graph
9 pages
Psychology Advanced RM Workbook New 2019
No ratings yet
Psychology Advanced RM Workbook New 2019
23 pages
Wk06 Topic07 1 - 202307
No ratings yet
Wk06 Topic07 1 - 202307
57 pages
Statistics & Relative Risk (PT 1)
No ratings yet
Statistics & Relative Risk (PT 1)
26 pages
Statistics
No ratings yet
Statistics
28 pages
03.22.2021 - L8 Statistics and Least Square
No ratings yet
03.22.2021 - L8 Statistics and Least Square
71 pages
Statistics in ESS
No ratings yet
Statistics in ESS
34 pages
00 - Inrroduction To Statistics
No ratings yet
00 - Inrroduction To Statistics
30 pages
ML ADK Basic of Statistics 1
No ratings yet
ML ADK Basic of Statistics 1
48 pages
Biostatistics Notes
100% (1)
Biostatistics Notes
8 pages
Session 3 Week 2
No ratings yet
Session 3 Week 2
31 pages
RM Module 3
No ratings yet
RM Module 3
30 pages
MIT9 63F09 Lec04
No ratings yet
MIT9 63F09 Lec04
7 pages
Merged Presentation 8614
No ratings yet
Merged Presentation 8614
290 pages
2 - Biostatistics
No ratings yet
2 - Biostatistics
77 pages
RS1 Final Study Guide
No ratings yet
RS1 Final Study Guide
13 pages
11.inferential Statistics March 24
No ratings yet
11.inferential Statistics March 24
74 pages
Unit+0++Day+3+ +Statistics+and+Ethics
No ratings yet
Unit+0++Day+3+ +Statistics+and+Ethics
21 pages
Basic Biostatistics
No ratings yet
Basic Biostatistics
31 pages
Biostatistics Notes: Descriptive Statistics
No ratings yet
Biostatistics Notes: Descriptive Statistics
16 pages
Fiat Hitachi Excavator Ex135w Workshop Manual
100% (1)
Fiat Hitachi Excavator Ex135w Workshop Manual
22 pages
Topic06. Analysis of Differences
No ratings yet
Topic06. Analysis of Differences
63 pages
CONSCI 3940 Final Exam Review
No ratings yet
CONSCI 3940 Final Exam Review
14 pages
01 Statistics Lesson
No ratings yet
01 Statistics Lesson
35 pages
Chapter 2: Statistical Tests, Confidence Intervals and Comparative Studies
No ratings yet
Chapter 2: Statistical Tests, Confidence Intervals and Comparative Studies
75 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
8 pages
Inferential Statistics
No ratings yet
Inferential Statistics
48 pages
Ajay Kumar Garg Engineering College: 27 KM Stone, Delhi-Hapur Bypass Road
No ratings yet
Ajay Kumar Garg Engineering College: 27 KM Stone, Delhi-Hapur Bypass Road
32 pages
DV Unit 1&2 Notes
No ratings yet
DV Unit 1&2 Notes
50 pages
Statistics
No ratings yet
Statistics
30 pages
Bio Statistics
No ratings yet
Bio Statistics
97 pages
Lecture 2 - MAT361 (21 JAN 2025)
No ratings yet
Lecture 2 - MAT361 (21 JAN 2025)
40 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Introduction: Hypothesis Testing Is A Formal Procedure For Investigating Our Ideas
No ratings yet
Introduction: Hypothesis Testing Is A Formal Procedure For Investigating Our Ideas
7 pages
Hypothesis Testing - The Scientists' Moral Imperative
No ratings yet
Hypothesis Testing - The Scientists' Moral Imperative
34 pages
Bes Summary
No ratings yet
Bes Summary
11 pages
AP Psych Review Video 1.5
No ratings yet
AP Psych Review Video 1.5
5 pages
Week 2 Quantitative Data Analysis
No ratings yet
Week 2 Quantitative Data Analysis
22 pages
Ten Deadly Statistical Traps in Pharmaceutical Quality Control
No ratings yet
Ten Deadly Statistical Traps in Pharmaceutical Quality Control
70 pages
Biostats 2
No ratings yet
Biostats 2
7 pages
Cusat Btech Ece S8 Syllabus
No ratings yet
Cusat Btech Ece S8 Syllabus
4 pages
A Health Information System: Components, Requirements, Practical Use
No ratings yet
A Health Information System: Components, Requirements, Practical Use
6 pages
Nba Lab Details May 2014
No ratings yet
Nba Lab Details May 2014
38 pages
250 Lec 5 Fall 13
No ratings yet
250 Lec 5 Fall 13
42 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
1.safety Inspection Check List
No ratings yet
1.safety Inspection Check List
2 pages
Expe Finals
No ratings yet
Expe Finals
8 pages
1.1 - Statistical Analysis PDF
No ratings yet
1.1 - Statistical Analysis PDF
10 pages
Learner'S Packet (Leap) : Student Name: Section: Subject Teacher: Adviser
No ratings yet
Learner'S Packet (Leap) : Student Name: Section: Subject Teacher: Adviser
7 pages
Key Statistical Ideas For Research Students v2
No ratings yet
Key Statistical Ideas For Research Students v2
4 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Lecture 5: Chapter 5 Statistical Analysis of Data Yes The "S" Word
No ratings yet
Lecture 5: Chapter 5 Statistical Analysis of Data Yes The "S" Word
42 pages
Ten Deadly Statistical Traps in Pharmaceutical Quality Control
No ratings yet
Ten Deadly Statistical Traps in Pharmaceutical Quality Control
70 pages
Visual COBOL Question and Answers PDF
No ratings yet
Visual COBOL Question and Answers PDF
33 pages
HL 1 Topic 1 PP Notes
No ratings yet
HL 1 Topic 1 PP Notes
5 pages
Theresa Hughes Data Analysis and Surveying 101
No ratings yet
Theresa Hughes Data Analysis and Surveying 101
37 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Standard Specification For Castings, Austenitic-Ferritic (Duplex) Stainless Steel, For Pressure-Containing Parts
No ratings yet
Standard Specification For Castings, Austenitic-Ferritic (Duplex) Stainless Steel, For Pressure-Containing Parts
6 pages
PP Math6 QTR2W7 Day 1
No ratings yet
PP Math6 QTR2W7 Day 1
14 pages
Def Slide
No ratings yet
Def Slide
9 pages
Introduction To Aerospace Engineering
No ratings yet
Introduction To Aerospace Engineering
5 pages
Quiz 2 PF
No ratings yet
Quiz 2 PF
7 pages
Unit 4 - Week 2: Introduction To Python: Assignment 2
No ratings yet
Unit 4 - Week 2: Introduction To Python: Assignment 2
4 pages
Kowsi Final Project
No ratings yet
Kowsi Final Project
50 pages
MCQ Class 2 MS Word
No ratings yet
MCQ Class 2 MS Word
11 pages
60. Đề Thi Thử TN THPT 2021 - Môn Tiếng Anh - Sở GD & ĐT Hưng Yên - File Word Có Lời Giải
No ratings yet
60. Đề Thi Thử TN THPT 2021 - Môn Tiếng Anh - Sở GD & ĐT Hưng Yên - File Word Có Lời Giải
6 pages
Calculators List Allowed
No ratings yet
Calculators List Allowed
1 page
New Vendor Form
No ratings yet
New Vendor Form
1 page
? Gallery Walk Scoring Rubric
No ratings yet
? Gallery Walk Scoring Rubric
2 pages
Volktek - Solution Catalog For Surveillance Ethernet
No ratings yet
Volktek - Solution Catalog For Surveillance Ethernet
55 pages

Unit 0 - Statistics Unit Notes Dictated (CLOSED)

Uploaded by

Unit 0 - Statistics Unit Notes Dictated (CLOSED)

Uploaded by

Class 8/26 statistics unit starts

Importance: Why stats and data?

Science is a discipline based on a collection of quantifiable data.

Science can only ask questions answered by quantifiable data

No qualitative data of any kind e.g. Theology, philosophy

Structure defines function of everything.

Science is not old; understanding limited by technology.

There is no law in biology, unlike chemistry or physics.

Validity & significance of data

Data is not always true.

Trust, but verify.

Verification will happen via statistical analysis.

End class 8/26.

Not all data is significant or valid, as data can be skewed / flawed.

Conclusions are valid only when supported by valid / significant data.

How to validate data? Statistics

Statistics remove significant data from noise.

Collection of v&s data requires multiple trials.

When writing procedures always minimum three trials

Greater sample size, better and v&s data

One model / graph > multi- page writeup

Only ever is the mean plotted

Example: Enzymatic activity in the face of temperature swings

Three trials at room temperature plot 1 points: Average of Trials

Three trials at high temperature plot 1 points: Average of Trials

Three trials at low temperature plot 1 points: Average of Trials

Total number of points: 3

The mean is the average across multiple trials.

One point for variable change

Dotted line is the meaning

Consider organisms like ecosystems: Limited resources.

Biggest limited resource is energy.

Very perfect, unlikely to fit a normal curve exactly

Skew is mitigated via error bars

Data spread impacts validity of the mean as a representative of the data.

Quantify via deviation/error

Used to determine the spread of data points from the mean

Equation of standard deviation

Confidence does not equal proof

The mean is meant to be a representative of a population.

End class 8/28

Class 9/3 Error Bars

Error bars and significance

2. How likely is it that there is a significant data between data sets.

What is a significant difference

If error bars overlap, the data is not significant.

Benefit to using 2SE

Always mention if graph uses SE or 2SE

Key components of experimental design:

1. Independent variable: Variable that is manipulated. Plot on x-axis

Positive v negative controls

End class 9/3

In a population of 200, 150 would be dominant, 50 would be recessive.

The two kinds of hypotheses

Critical value is given by a chart, readable by degrees of freedom and p-value.

Generally, the accepted p-value is 0.05, unless stated otherwise.

Degrees of freedom is possible states minus 1.

Steps in a chi-square test

End Statistics Unit

You might also like