Lect w7 f2023
Lect w7 f2023
V-clip Timestamps
(m13a/aa, slides 2 and 3) 0:33 intro to h-test terminology; 1:13 “null” vs “alternative”; 3:50 “right” and
“wrong” side; 4:50 one-sided/tailed vs two-sided/tailed alt-hypos; 5:40 reject/fail to reject/ support; 6:18 Type
1or2 error and Type1 explained; 8:06 Type 2 error and “alpha” versus “beta”; 8:33 alpha “tolerance” (“alpha for
testing” / “significance level for the test) and “P-value” and “sig”/”significance”; 10:45 Power of a test; 11:45
first part of how to choose alpha; 12:35 intro to hypo testing procedure (emphasis on picking correct test and
stating correct conclusion). m13aa general form and meaning of test statistics and interpretation of P-value.
(m13b; slide 3… and some 4,5) 0:00 step1 = defining pop stat of interest; 3:04 state null hypo and name
test and importance of these steps; 4:33 stating alt hypo and its importance; 6:05 setting alpha and more
about how/why to choose values lower/higher than 0.05; 8:03 assumptions checking step and the
“randomness” assumption; 8:55 NORMALCY ASSUMPTION (often called “sample size”, but this isn’t really
what it is!); 9:45 other assumptions, equality of variance; 10:28 converting SPSS or table
“significance”/”p”/”tail-area” test result to the REAL p-value of your test outcome.
(m13bb; slides 4,5) 0:00 how to use my 2 slides with 11 tests summarized; 1:43 special explanation of
paired t-test notation and logic; 4:10 univariate Chi-squared versus Z-tests; 4:50 univariate anova versus
indep or pooled t-test; 5:10 Bivariate tests
Asking For Information Gathering Information Assessing Information
– Example: based on a sample of Dutch nationals, is the average difference of age 10 and age 20 heights (average teenage
growth) for Dutch people Dutch person greater than the supposed/known American average of 55cm
• X1 = age 10 height in cm, x2 = age 20 height; x diff = x2-x1; m0 = 55; H1: mdiff > m0
• Large (one-)Sample Z-test for proportion
– H0: pA = po (= phypothesized = pref )
– Example: based on a sample of Dutch nationals, is the percentage of Dutch persons taller than 200cm the supposed/known
American average of 10%?
• pA = proportion over 2 meters; p0 = 0.10; H1: px > p0
• Univariate 2 (more than two category proportions/counts of interest)
– H0 : pA = pAo ; : pAB = pBo ; : pC = pCo ; ….. [exact reference value for each category]
– Example: based on a sample of Dutch nationals, are the proportions of blonds, brunettes, redheads, and ‘others’ the same as
the American 10%, 20%, 5%, and 65%
• Stats of interest are the Dutch population proportions of each of the 4 hair color types, Expected counts (needed for test
statistic formula) are n*0.1, n*0.2, n*0.05, n*0.65; H1: at least one population proportion differs from the hypothesized value.
• Univariate ANOVA 2 (more than two population means of interest)
– H0: m1 = m2 = m3 … ; H1: at least one inequality of pop means.
– Example: based on a sample of Dutch nationals, are the average IQs of blonds/brunettes/redheads/others the same?
ShortList of Tests: Bivariate
• (two) Independent Sample T-test for means (different pop
variances)
– H0: m1 = m2 (or m1 - m2 = 0)
– Example: based on a samples of Dutch and Americans nationals, is the average Dutch person taller than the average
American?
• X = height in cm; m is pop mean of x; pop 1 is Dutch, pop2 American; H1: m1 > m2
• Two Sample Z-test for proportions
– H0: p1 = p2 (or p1 - p2 = 0)
– Example: based on a samples of Dutch and Americans, is the percentage of Dutch persons taller than 200cm greater
than that for Americans?
• p = population proportion over 2 meters; pop A American, pop D Dutch, H1: pD > pA
• (Linear) correlation test
– H0: correlation r = 0 (or regression slope b = 0)
– Example: based on a samples of Dutch, do taller people have higher IQs?
• X is height, y is IQ; r is linear correlation of IQ with height for the population, b is the slope of the population
linear relationship for determining IQ based on height. H1 : r > 0 : or b > 0