Week 1& 2 Testing
Week 1& 2 Testing
Assessment and
Evaluation
Defining Terms
• When we are talking about giving people a test and recording
scores etc., we would normally refer to this as an assessment
procedure.
• If, on the other hand, we are talking about looking back over a
course or a lesson and deciding what went well, what was learnt
and how people responded, we would prefer the term 'evaluate'
as it seems to describe a wider variety of data input (testing, but
also talking to people and recording impressions and so on).
EVALUATION
• It's not enough to be clear about what you want people to learn and to
design a teaching programme to achieve the objectives. We must also
have some way of knowing whether the objectives have been achieved.
• That's called testing.
• 6. Objective testing…. • G. Finding the best and worst students in the class
by giving them a test
• 7. Initial evaluation…. • H. A placement test
• Alternate response
• This sort of item is probably most familiar to language teachers as a True /
False test. (Technically, only two possibilities are allowed. If you have a True /
False / Don't know test, then it's really a multiple-choice test.)
• Multiple-choice
• This is sometimes called a fixed-response test. Typically, the correct answer
must be chosen from three or four alternatives. The 'wrong' items are called
the distractors.
Types of Test Items
• Structured response
• In tests of this sort, the subject is given a structure in which to form the answer.
Sentence completion items of the sort which require the subject to expand a sentence such as
He / come/ my house / yesterday / 9 o'clock into He came to my house at 9 o'clock yesterday
are tests of this sort as are writing tests in which the test-taker is constrained to include a
list of items in the response.
• Free response
• In these tests, no guidance is given other than the rubric and the subjects are free to write or
say what they like.
• A hybrid form of this and a structured response item is one where the subject is given a
list of things to include in the response but that is usually called a structured response test,
especially when the list of things to include covers most of the writing and little is left to the
test-taker's imagination.
TEST TYPES WHAT THE TESTS ARE EXAMPLE
INTENDED TO DO
Diagnostic tests
discover learners' strengths and a test set early in a programme to
weaknesses for planning purposes plan the syllabus
Proficiency tests test a learner’s ability in the public examinations and placement
language regardless of any course tests
they may have taken
Barrier tests a special type of test designed to a pre-course test which assesses the
discover if someone is ready to take learner's current level with respect
a course to the intended course content
WAYS OF TESTING AND
MARKING
Methodology Description Example Comments
The argument is that this
kind of test is more reliable
Testing a particular skill by Testing whether someone because it tests the
Direct testing getting the student to can write a discursive essay outcomes, not just the
perform that skill by asking them to write one individual skills and
knowledge that the test-
taker needs to deploy
• 8. Achievement test
Reliability
Validity
• If you are writing a test for your own class or an individual
learner or group of students for whom you want to plan a
course, or see how a course is going, then validity is most
important for you.
• You will only be running the test once and it isn't important that
the results are correlated to other tests. All you want to ensure
is that the test is testing what you think it's testing so
the results will be meaningful.
• There are five different sorts of validity to consider. Here they
VALIDITY
Construct validity
• At what level?
Concerning what topics?
For which audiences?
In what style?
In what register?
In what length of text?
and so on.
Concurrent validity
• If, for example, you have a well established proficiency test, such as
one administered by experienced examination boards, you may feel
that you would be better served with a shorter test that gave you the
same sort of data.
• This may be less important to you but if your test predicts well how
learners perform in the examination proper, it will tell you more than
if it doesn't.
Concurrent Validity