Principles of High Quality Assessment and Reliability
Principles of High Quality Assessment and Reliability
Recap
3. VALIDITY
◾ Something valid is something fair.
◾ A valid test is one that measures what it
• is supposed to measure.
• Types of Validity
◾ Face: What do students think of the test?
◾ Construct: Am I testing in the way I taught?
◾ Content: Am I testing what I taught?
◾ Criterion-related: How does this compare
with the existing valid test?
Tests can be made more valid by making
them more subjective (open items).
Validity – appropriateness, correctness, meaningfulness and
usefulness of the specific conclusions that a teacher reaches
regarding the teaching-learning situation.
Questions:
◾ Will the test take longer to design than apply?
◾ Will the test be easy to mark?
The problem:
The solution:
◾Informed consent
◾Anonymity and
Confidentiality
1. Gathering data
2. Recording Data
3. Reporting Data
ETHICS IN ASSESSMENT – “RIGHT AND
WRONG”
Questions:
◾ Can we trust the results of the test?
◾ Would we get the same results if the tests were
taken again and scored by a different person?
• With a large standard error, the band of scores is wide, and we have less
confidence in our obtained score.
• If the standard error is small, the band of scores will be narrow, and we
will have greater confidence that our obtained score is a dependable
measure of the characteristic.
Standard Error of Measurement
The relationship
between the reliability
coefficient and the
standard errors of
measurement can be
seen in the table, which
presents the standard
errors of measurement
for various reliability
coefficients and
standard deviations.
Standard Error of Measurement
• The main difficulty encountered with the standard error occurs when we
want to compare two assessments that use different types of scores.
Factors Influencing Reliability
Measures
Factors Influencing Reliability Measures
Objectivity • Methods
The sizeof of Estimating
the
Reliability
reliability
• The objectivity of an
coefficient is
assessment refers related to the
0 to the degree to 0 method of
which equally
3 competent scores 4 estimating
reliability.
obtain the same
• The variation in the
results. size of the
• The test items are reliability
of the objective coefficient
type and the resulting from the
resulting scores are
method of
not influenced by estimating
the scorers’ reliability is directly
judgment or opinion. attributable to the
type of consistency
included in each
Reliability of Assessments
Evaluated in Terms of a
Fixed Performance Standard
Fixed Performance Standard
• The most natural approach to reliability is to evaluate the consistency
with which students are classified as performing above or below the
standard.
Fails to meet
Meets standard
standard
Meets
2 20
standard
Assessment A
Fails to meet
7 1
standard
• Decision is important
• Decision is final
• Decision is irreversible
• Decision is unconfirmable
• Decision concerns individuals
• Decision has lasting consequences
Reliability Demands and Nature of the
Decision
Low reliability is demanded when the