Cheat Sheet 1

Uploaded by

aizharyk.zhabay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views2 pages

Cheat Sheet 1

Uploaded by

aizharyk.zhabay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Types of data Dependent events: Pr(AandB) = Pr(B)*Pr(A|B) -The study or experiment consists of finite number smaller experiments (trials)

-The study or experiment consists of finite number smaller experiments (trials) each of
Numeric: discrete(1,2,3), continuous(1,1.25,2,.2.8) Pr(AandB) = Pr(A)*Pr(B|A) which has only two possible outcomes, such as dead/alive, diseased/non-diseased,
Non-numeric: nominal(binary,polytomous), ordinal(low-medium-high) Independent events: Pr(AandB) = Pr(A)*Pr(B) success/failure, etc.
1 numeric: box plot, histogram, stem&leaf plot Justification of independence: Pr(A|B) = Pr(A) -The outcomes of the trials are independent.
1 categorical: bar plot, pie chart Pr(B|A) = Pr(B) -The probabilities of the outcomes of the trials remain the same from trial to trial.
1 numeric+1 categorical: side by side box plot, stem&leaf, histogram Pr(X=1) = p , contrary, Pr(X=0) = 1-p
2 numeric: scatter plot Test validity measures
Sensitivity-“Among diseased, how many of them would be tested positive?” , where n = number of
Central tendency measures: mean, median and mode. trials/subjects, x = number of successes, p = probability of success.
Mean is just arithmetic average of all values. Mean = n*p
Specificity-“Among not diseased, how many of them would be tested negative?” Variance= n*p (1−p)
Standard deviation=
Median – a point in the middle of sequence.
PPV-“Among who tested positive, how many of them are actually diseased?” Poisson distribution
For even- take average of (n/2)th and (n/2 +1)th values
For odd- (n/2+0.5) -The probability of an event in a short interval is proportional to the length of the
Mode- the most frequently occurring value interval.
Dispersion or spread of values NPV-“Among tested negatives, how many of them are actually not diseased?” -An infinite number of events can occur in the interval.
Range- a difference between largest and smallest values -In any extremely small portion of the interval, the probability of more than one
Variance-spread between numbers in a data set. occurrence of the event is approximately zero.
-Whether or not an event occurs in an interval is independent of events in all other
intervals. Apart from independence between observations, in Poisson distribution
Standard deviation is an average difference between observations and sample probability of event occurrence in one interval should not affect that of other intervals.
mean. -Mean and variance are equal to each other.

Most sensitive to outliers: mean, variance, sd, range

not sensitive: median, mode, IQR, Q1, Q3 , where e is a constant (~2.718), λ is an average or
"Quantile"-percentill,quartiles,tertiles(33%) expected number of occurrences of the random event in the interval, x is a number of
P25-Q1-lower hinge events in a question.
P50-Q2-median Poisson distribution is a choice when there is a binomial random variable with very
P75-Q3-upper hinge large n and very low probability.
IQR=Q3-Q1
Upper fence = Upper hinge + 1.5*IQR
Lower Fence = Lower hinge – 1.5*IQR
Normal distribution
Continuous random variables-histogram, box plot, QQ plot
the rule “68-95-99.7”
Probability distributions:
1.Discrete random variables include outcomes that are finite and that can be counted.
Binomial distribution is used for binary variables, which contains only two possible
outcomes. Examples are diseased/non-diseased, dead/alive, infected/non-infected,
etc. To standardize any normal distribution:
Poisson probability distribution is utilized to predict the counts of events or rates. For
Symmetric: Mean = Median= Mode example, number of physician visits, vaccination rate during a year, number of suicidal
Left-skewed: Mean < Median < Mode attempts in a last five months. Characteristics of SRS:
Right-skewed: Mean > Median > Mode 2.Continuous random variables are able to take any value and not limited to integers. 1.Randomly selected sample usually has similar distribution shape as the population
Population parameters: μ, σ2, σ Gaussian (Normal) probability distribution describes probability function for continuous where it came from.
Sample estimates, statistics: x̄ , s2, s variables, where not only integers are outcome values. A bell-shaped curve is used to 2.The sample statistics will be trying to approximate the population parameters.
calculate probability of having a value within a specific range. 3. Variability in estimates always exists between different samples, even though they
Probability-a measure of the uncertainty associated with the occurrence of events. came from the same population-Sampling variability
Joint probability – likelihood of collection of events occurring at the same time point. Factorials is a number of possible arrangements of n objects. Sampling Distribution of the Sample Mean
The formula is: n!=n(n-1)(n-2)(n-3)…(2)(1)
0! = 1
Union probability – the probability of events A or B or … or X or all together occur. 4!=4*3!=4*3*2!=4*3*2*1=24 The mean of the Sampling Distribution of the Sample Mean is equal to the population
Permutations-the ways of arranging things in orders. mean:
Addition rule (Union Probability) , where permutations of n objects taken r at a time.
Mutually exclusive events: Pr(AorB)=Pr(A)+Pr(B) Combination is an arrangement of n objects take r at a time without regard to order. The variance of the Sampling Distribution of the Sample Mean equals the population
Non-mutually exclusive events: Pr (A or B) = Pr (A) + Pr (B) – Pr (A and B) variance divided by sample size.
Conditional Probability: Pr(A|B) = Pr(AandB)/ Pr(B) , where is combination of n objects taking r.
Multiplication rule (Joint Probability) Binomial Distribution
Standard deviation of the Sampling Distribution of Sample mean, also known as
Standard Error (SE)
upper one-sided confidence interval:

two-sided confidence interval:

T-Distribution

CLT
The Sampling Distribution of the Sample statistic follows Normality, regardless of the
shape of the population distribution, given that there is a sufficient sample size (n≥25) α=1 - confidence level
or population distribution is normal. α and confidence level are complimentary to each other.
If population distribution is normal- CLT. If population distribution is not normal, we If α is 0.05, then the confidence level is equal to 1-0.05, which is 0.95 or 95%.
need n≥25 at least to apply CLT. Construct 95% CI.
Hypothesis testing If the confidence level is 0.99, then α is 1-0.99, then α is 0.01.
The P-value is the probability of getting a sample statistic (x), given we believe that H0
is likely true by chance alone. Statistical importance is when the statistical test for one problem has a p-value less
Pr(observing data|H0 is true) than α. Rejecting the Null hypothesis and stating that the alternative hypothesis is
If the p-value is less than 0.05 (p<0.05) a statistically significant result. likely true is a statistically significant result.
If the p-value is larger than 0.05 (p>0.05) not a statistically significant result. Biological/Clinical/Public Health importance is a result of finding that makes a
difference in practice.

The significance level or alpha (α) is the probability of rejecting H0 when H0 is true –
Pr(reject H0|H0 is true)(type I error)
P-value <α => Reject H0, accept HA
P-value >α=> fail to reject H0, cannot accept HA
To increase power- decrease Type II error
Critical values are Z-statistics that correspond to the significance level.
1) Increasing α leads to decrease in β
2) The other way to decrease β and increase power is to increase the difference
between 0 and A
3) The next manipulation to increase power is to affect the Standard Deviation of
sample mean
Sample size calculation:
|Z-statistic| > |Critical value| => Reject H0, accept HA
|Z-statistic| ≤ |Critical value| => fail to reject H0, cannot accept HA

Confidence interval includes the population mean with some level of confidence.
Factors that affect the required sample size include:
lower one-sided confidence interval:

STA301 Formulas Definitions 01 To 45
0% (1)
STA301 Formulas Definitions 01 To 45
28 pages
First Course Mathematical Statistics by C. E. Weatherburn
100% (1)
First Course Mathematical Statistics by C. E. Weatherburn
302 pages
Designing Machine Learning Systems With Python - Sample Chapter
100% (1)
Designing Machine Learning Systems With Python - Sample Chapter
31 pages
Basic Statistics For Health Sciences
91% (11)
Basic Statistics For Health Sciences
361 pages
Exercises Solutions
100% (2)
Exercises Solutions
67 pages
AP Statistics Study Guide
100% (2)
AP Statistics Study Guide
12 pages
Ring Frame End Breakage Distribution: D e X 0.75 T
No ratings yet
Ring Frame End Breakage Distribution: D e X 0.75 T
9 pages
Seismic Map For Kuwait PDF
No ratings yet
Seismic Map For Kuwait PDF
6 pages
Statistics ESCP
No ratings yet
Statistics ESCP
383 pages
Biostatistics Lectures
No ratings yet
Biostatistics Lectures
26 pages
TUV SUD Certified Lean Six Sigma Green Belt (CLSSGB) Pre-Study Material ...
No ratings yet
TUV SUD Certified Lean Six Sigma Green Belt (CLSSGB) Pre-Study Material ...
21 pages
Statistical Inference: Prepared By: Antonio E. Chan, M.D
No ratings yet
Statistical Inference: Prepared By: Antonio E. Chan, M.D
227 pages
Research Designe and Basics of Stistics Manish Jain
100% (1)
Research Designe and Basics of Stistics Manish Jain
67 pages
Stats 1 For Students
No ratings yet
Stats 1 For Students
60 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
Final Cheat Sheet 2
No ratings yet
Final Cheat Sheet 2
4 pages
Bio Statistics
No ratings yet
Bio Statistics
97 pages
QM Formula Class
No ratings yet
QM Formula Class
31 pages
STAT100 - Full Course Notes
No ratings yet
STAT100 - Full Course Notes
27 pages
Bio-Stat Class 2 and 3
No ratings yet
Bio-Stat Class 2 and 3
58 pages
Chapter 5:discrete Probability Distributions
No ratings yet
Chapter 5:discrete Probability Distributions
60 pages
Biostatistics Notes Part 1
No ratings yet
Biostatistics Notes Part 1
9 pages
Statistics
100% (1)
Statistics
11 pages
EDA-Discrete Probability Distribution
No ratings yet
EDA-Discrete Probability Distribution
35 pages
Lecture Note On Biostatistics
No ratings yet
Lecture Note On Biostatistics
74 pages
Study Guide - Biostatistics: 35% of Prevmed Exam (With Epi)
No ratings yet
Study Guide - Biostatistics: 35% of Prevmed Exam (With Epi)
14 pages
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
No ratings yet
Submitted To: Mrs. Geetika Vashisht College of Vocational Studies University of Delhi
36 pages
9.2.2 Pivotal Quantities Uniform
No ratings yet
9.2.2 Pivotal Quantities Uniform
16 pages
Intro SRM
No ratings yet
Intro SRM
73 pages
Instant Download An Introduction To Generalized Linear Models Annette J. Dobson PDF All Chapter
100% (1)
Instant Download An Introduction To Generalized Linear Models Annette J. Dobson PDF All Chapter
55 pages
Statistical Models: Modeling and Simulation
No ratings yet
Statistical Models: Modeling and Simulation
51 pages
11.inferential Statistics March 24
No ratings yet
11.inferential Statistics March 24
74 pages
Bio Statistics
No ratings yet
Bio Statistics
72 pages
Market Making Under A Weakly Consistent Limit Order Book
No ratings yet
Market Making Under A Weakly Consistent Limit Order Book
37 pages
Statistics През
No ratings yet
Statistics През
46 pages
J. Virtamo 38.3143 Queueing Theory / Queueing Networks 1
No ratings yet
J. Virtamo 38.3143 Queueing Theory / Queueing Networks 1
20 pages
M.SC Ag Fruit Science
No ratings yet
M.SC Ag Fruit Science
21 pages
COM 201 - Inferential Statistics - 18032022-1
No ratings yet
COM 201 - Inferential Statistics - 18032022-1
58 pages
Basics of Statistics
No ratings yet
Basics of Statistics
40 pages
Chapter 5 - RM
No ratings yet
Chapter 5 - RM
22 pages
Econ 41 Syllabus
No ratings yet
Econ 41 Syllabus
3 pages
Biostatistics Notes
No ratings yet
Biostatistics Notes
10 pages
Probability Distribution
No ratings yet
Probability Distribution
16 pages
Probability: Youtube: Learn With Ca. Pranav, Instagram: @learnwithpranav, Telegram: @pranavpopat, Twitter: @pranav - 2512
No ratings yet
Probability: Youtube: Learn With Ca. Pranav, Instagram: @learnwithpranav, Telegram: @pranavpopat, Twitter: @pranav - 2512
23 pages
Probability and Statistics - Practice Tests and Solutions
No ratings yet
Probability and Statistics - Practice Tests and Solutions
46 pages
Review of Chapters 1-5
No ratings yet
Review of Chapters 1-5
21 pages
PSM 2k23
No ratings yet
PSM 2k23
32 pages
8 (1) .Basic Stat Inference
No ratings yet
8 (1) .Basic Stat Inference
41 pages
Math 140 Final Review Notes
No ratings yet
Math 140 Final Review Notes
20 pages
Statistics - The Big Picture
No ratings yet
Statistics - The Big Picture
4 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Normal Distribution: X e X F
No ratings yet
Normal Distribution: X e X F
30 pages
Binomial Distributions For Sample Counts
No ratings yet
Binomial Distributions For Sample Counts
38 pages
Classification of Data: Objectives: Understand How Data Are Classified. Recognize The Different Types of Data
No ratings yet
Classification of Data: Objectives: Understand How Data Are Classified. Recognize The Different Types of Data
39 pages
2statsnotes 1
No ratings yet
2statsnotes 1
24 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Cognitive Science Unit 3
No ratings yet
Cognitive Science Unit 3
15 pages
Statisitcs
No ratings yet
Statisitcs
22 pages
2.2 Probability
No ratings yet
2.2 Probability
19 pages
Probability Distributions-Sarin B
No ratings yet
Probability Distributions-Sarin B
20 pages
AP Stats Cheat Sheet FINAL
No ratings yet
AP Stats Cheat Sheet FINAL
8 pages
Biostat 2
No ratings yet
Biostat 2
18 pages
LQ1 Notes
No ratings yet
LQ1 Notes
15 pages
GB Academy Equation List
No ratings yet
GB Academy Equation List
16 pages
Statistical Methods
No ratings yet
Statistical Methods
16 pages
Assignment II & III 208
No ratings yet
Assignment II & III 208
9 pages
Theoretical Questions in Basic Business Statistics
No ratings yet
Theoretical Questions in Basic Business Statistics
12 pages
Key of Week1 - Lecture Notes
No ratings yet
Key of Week1 - Lecture Notes
10 pages
DescriptiveStatsFormulas JMP SAS
No ratings yet
DescriptiveStatsFormulas JMP SAS
21 pages
HL AI Probability Distributions Notes RMS
No ratings yet
HL AI Probability Distributions Notes RMS
11 pages
Discrete Distributions - Hypergeometric, Binomial, and Poisson - Engineering LibreTexts
No ratings yet
Discrete Distributions - Hypergeometric, Binomial, and Poisson - Engineering LibreTexts
14 pages
A. Variables:: Types of Distributions
No ratings yet
A. Variables:: Types of Distributions
10 pages
Where and When The Exam Is!!!: BM 1200 Quantitative Methods & Analytics
No ratings yet
Where and When The Exam Is!!!: BM 1200 Quantitative Methods & Analytics
11 pages
Question Bank
No ratings yet
Question Bank
8 pages
Prob Stats 2037-Exam-1-2014: Printed: October 6, 2014
No ratings yet
Prob Stats 2037-Exam-1-2014: Printed: October 6, 2014
7 pages
Comando Svy Stata
No ratings yet
Comando Svy Stata
3 pages
Statistics and Probability Reviewer
No ratings yet
Statistics and Probability Reviewer
7 pages
Definition of Median
No ratings yet
Definition of Median
6 pages
Jacobi Iterative Solution of Poisson's Equation in 1D
No ratings yet
Jacobi Iterative Solution of Poisson's Equation in 1D
11 pages
Review Exam 1
No ratings yet
Review Exam 1
3 pages
A Short Introduction To Probability
No ratings yet
A Short Introduction To Probability
22 pages
Probstats Reviewer
No ratings yet
Probstats Reviewer
3 pages
Stats Midterms Cheat Sheet
No ratings yet
Stats Midterms Cheat Sheet
3 pages
Statistics S1 Summary: X X X S
No ratings yet
Statistics S1 Summary: X X X S
3 pages
Sheet2 (4) Cambridge
No ratings yet
Sheet2 (4) Cambridge
3 pages
Binomial Poisson
No ratings yet
Binomial Poisson
5 pages

Cheat Sheet 1

Uploaded by

Cheat Sheet 1

Uploaded by

Types of data Dependent events: Pr(AandB) = Pr(B)*Pr(A|B) -The study or experiment consists of finite number smaller experiments (trials)

Most sensitive to outliers: mean, variance, sd, range

two-sided confidence interval:

You might also like