Week 4 Team Lecture
Week 4 Team Lecture
Descriptive points/measures
of central
statistics tendency
extent of association
between/among
variables
4
A point of central tendency is a point around
which the data revolve, a middle number around which
the data regarding a particular variable seem to hover.
5
“The mode is the single
number or score that
occurs most frequently.”
6
What is the mode in the
following data set?
3 4 6 7 7 9 9 9 9 10 11 11 13 13 13 15 15 21 26
Mode = 9
7
The median is the numerical center of a set of
data; to facilitate our discussion, we will call each
piece of data in the set a “score.”
8
What is the median in the
following data set?
69 70 72 73 75 76 77 78 79 81 83 83 84 85 86
87 89 90 91 92
Median = 82
9
“…the mean [is] the fulcrum point
for a set of data: it represents the
single point at which the two sides
of a distribution ‘balance.’”
11
“…the geometric mean… is computed
by multiplying all of the scores together
and then finding the Nth root of the
product.”
12
Arithmetic Mean vs.
Geometric Mean
Attribute Arithmetic Mean Geometric Mean
The sum of all values
The nth root of the
Definition divided by the number of
product of n values.
values.
Applicable to any set of Applicable to positive
Applicability
numerical data. values only.
Can be calculated with Can also be calculated
Weighted Mean
weighted values. with weighted values.
Outliers have a significant Outliers have a lesser
Effect of Outliers
impact on the mean. impact on the mean.
Represents the average Represents the central
Interpretation
value of a set of data. tendency of a set of data.
Sum of values divided by Nth root of the product of
Calculation
the count of values. values.
13
“The configuration of the data
dictates the measure of central
tendency most appropriate for
that particular situation.”
14
“The nature of the data—the facts
of life—governs the statistical
technique, not the other way
around.”
15
“… we can often make reasonable predictions about
a population based on our knowledge of central
tendency…
17
“… data are more similar if they cluster about the mean.
Scatter them, and they lose some of their uniformity;
they become more diverse, more heterogeneous. As
specific data points recede farther from the mean, they
lose more and more of the quality that makes them
‘average.’”
18
The simplest measure of variability is the range,
which indicates the spread of the data from lowest
to highest value:
19
The interquartile range is equal to
Quartile 3 (the 75th percentile point) minus
Quartile 1 (the 25th percentile point)
21
Average Deviation
25
“More useful in statistical analyses are
standard scores. Simply put, a standard score
tells us how far an individual’s performance is
from the mean with respect to standard
deviation units.”
26
z-score
27
Example: Mary’s
Extroversion Test Result
28
Converting a Z-Score to Another
Scale
29
Mary’s Score as an IQ Score
30
“… statistical manipulation of
the data is not, in and of itself,
research.”
31
Limitation of Measures of Central
Tendency
32
“The statistical process by which we
discover whether two or more variables
are in some way associated with one
another is called correlation.”
33
correlation
coefficient for
two variables
Directio
Strength
n
34
“The most widely used statistic for determining
correlation is the Pearson product moment
correlation, sometimes called the Pearson r. But
there are numerous other correlation statistics as
well. As is true for measures of both central tendency
and variability, the nature of the data determines the
technique that is most appropriate for calculating
correlation.”
35
36
37
“Not all relationships take a linear
form… Always keep in mind that the
nature of the data governs the
correlational procedure that is
appropriate for those data. ”
38
“Correlation does not
necessarily indicate
causation…”
39
“… inferential statistics allow us to draw
inferences about large populations from
relatively small samples...”
42
“… Different samples—even when each one has been randomly
selected from the same population—will almost certainly yield
slightly different estimates of the overall population. The
difference between the population mean and a sample mean
constitutes an error in our estimation.”
43
“… the standard deviation
for the distribution of
sample means—is called
the standard error of the
mean…”
44
Statistically, when all of the samples are of a
particular size (n), the standard error of the
mean for any variable being measured is
related to the standard deviation for the
variable itself in the following formula:
45
46
Point Versus Interval
Estimates
47
“… A research hypothesis is a reasonable conjecture,
an educated guess, a theoretically or empirically based
prediction. Its purpose is a practical one: It provides a
logical framework that guides a researcher while
designing a research study and collecting data.”
48
This process of comparing observed data
with the results we would expect from
chance alone is called testing the null
hypothesis.
49
statistically
significant
result
1-in-20 1-in-100
probabilit criterion
y
50
“… if we incorrectly reject the null
hypothesis—we are making a Type I
error (also called an alpha error)…”
52
53
54
References: