PSYB07 Final Notes
PSYB07 Final Notes
Chisquare assumptions
1. The IV consists of mutually exclusive and exhaustive categories; the
measure is frequency
2. Independence of observations (i.e., one observation per subject; subjects
fit into only one of the mutually-exclusive categories)
3. Expected frequencies for each category are 5 or greater. This is because
the χ2 statistic (which we will cover next) is calculated using frequencies,
which are continuous, and approximates a continuous variable when N is
large
Distribution
1. One-tailed
2. All values are positive because you are squaring the values
3. Begins to look normal when df approached infinity
a. Df = k-1 where k is the number of categories
PHI
- This sounds suspiciously like a correlation... That’s because it is one! Phi is
the measure of association between two binary variables. A Pearson
correlation coefficient estimated for two binary variables will return the phi
coefficient.
Correlation and regression
➔ Both are statistical techniques to determine the relationships between two
variables
Correlation coefficient
● Pearson’s correlation coefficient of a sample = r
● Correlation of a population = p (rho)
Testing a correlation
● You can use a t-test to assess whether or not a relationship actually exists
between two variables
Assumptions
● Dependent and independent variables are continuous
● The DV and IV are normally distributed
● No outliers in DV or IV; no bivariate outliers
○ Correlations are not resistant to outliers
● DV and IV are linearly related
● Correlation must be significant to run regression
Cohen’s D Pearson’s r
Formula
Process of calculations
- Test the correlation using a t-test
Regression
➔ Associated with observational research
➔ All scores taken into consideration
➔ Y = mx+B
➔ Line of best fit
➔ Residuals
◆ The residuals refer to the difference between your observed y-scores, and the y-scores
generated by your equation
◆ How far off line is at predicting values
ANOVA Test
T-test limitations
● We can only compare two group means at a time
● We can only analyze one IV at a time
● You increase the Type 1 error rate when you do too many tests
Assumptions
1. The dependent variable is continuous and the independent variable is categorical
2. Data and observations are independent
3. DV is normally distributed to all groups
4. Variances are homogeneous for all groups
5. No outliers beyond +/- 4 standard deviations in all groups s
Variance
- Error estimate of variance does not change regardless of if the null is true or not
Treatment estimate