0% found this document useful (0 votes)

153 views35 pages

Principles of Language Assessment

The book presents a theoretical framework related to assessment process in second language teaching.

Uploaded by

Patricia mauricio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

153 views35 pages

Principles of Language Assessment

The book presents a theoretical framework related to assessment process in second language teaching.

Uploaded by

Patricia mauricio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Principles of Language

Assessment

Daniela Bettoni Méndez

Valeria Carboni J iménez
How do you know if a test is effective? Does it
accurately measure what you want it to
measure?

There are five cardinal criteria for testing a test, which are,
practicality, reliability, validity, authenticity, and washback.
1. Practicality

An effective test can be considered practical.

• Not excessively expensive.
• Stays within appropriate time constraints.
• Easy to administer.
• Has a scoring/evaluation procedure that is specific
and time-efficient.
1. Practicality
Practical Test: it does not consume time or money than necessary to
accomplish the objective, it does not take a student five hours to complete the
test, it is not a test that only takes minutes to complete and hours to evaluate, it I
not a test that is only scored by computer; otherwise, the test is impractical.

Example: If a test has 20 listening items based on an audio tape, and 80 item on
grammar, vocabulary, reading comprehension, and multiple choice format, and a
scoring grid that accompanies the test, is considered practical.
2. Reliability

• A reliable test is consistent and dependable.

• If you give the same test to the same student, the results are going to be
similar.

• There happens to be some fluctuations,

1. Student related reliability: A student’s score can be deviated from the

true score by illness, a bad day, anxiety, and other physical and psychological
factors.
2. Reliability
2. Rater Reliability: Human error, bias, and subjectivity may enter into the scoring
process.

• Inter-rater reliability: When two or more scorers yield inconsistent scores of the
same test, because of lack of attention, inexperience, inattention, and preconceived
biases. When two scorers are not applying the same standards, like the example before.

• Intra-rater reliability: Teachers with unclear scoring criteria, fatigue, bias toward
particular good and bad students, or carelessness. The first revision of tests with the last
ones is different because the teachers get more tired, and the result can be an
inconsistent evaluation across all tests. The careful specification of analytical scoring can
increase rater reliability.
2. Reliability
3. Test Administration Reliability: The conditions in which the test is
administrated. Listening comprehension with a lot of noise from the streets,
and students sitting next to windows cannot hear well. Photocopying
variations, too dark, too blurry, the amount of light in different parts of the
room, variations in temperature, and the conditions of chairs and desks.

4. Test Reliability: There could be measurement errors. If a test is too long,

it could cause fatigue from the test-takers, and they could respond
incorrectly. Timed tests may discriminate with students who do not perform
well in a test with a time limit. Even test answers that are ambiguous may
cause trouble.
3. Validity
• Most complex criterion and most important principle
• “The extent to which inferences made from assessment results are appropriate,
meaningful, and useful in terms of the purpose of the assessment”

Example:
• A valid test of reading ability actually measures reading ability, not a 20/20 vision
test
• To measure writing ability, one might ask students to write as many words as they
can in 15 minutes, and then simply count the words for the final score. This is easy
to apply but it is not considered a valid test of writing ability without some
consideration of comprehension.
3. Validity
How is the validity of a test established?

There is no final measure of validity, but there are certain aspects that one can take in
consideration:
• Examine the extent to which a test calls for performance that matches that of the
course or unit of study being tested
• Being concerned of how well a test determines whether or not students have
reached an established set of goals or level of competence
• Statistical correlation with other related but independent measures
• Focus on the consequences of a test or even the test-takers perception of validity
Types of evidence to establish the validity
of a test
1. Content-Related Evidence
• If a test samples the subject matter about which conclusions are to be drawn, and if it
requires the test taker to perform the behavior that is being measured, it can claim content-
related evidence of validity
• You can identify it if you can clearly define the achievement that you are measuring
Example:
• If you are trying to assess a person’s ability to speak a second language in a conversational
meaning, asking the learner to answer paper-and-pencil multiple choice questions regarding
grammatical judgements dos not achieve content validity. A test that requires the learner to
actually speak within some sort of authentic context does.
Types of evidence to establish the validity
of a test
1. Content-Related Evidence
• Cases of highly specialized and sophisticated testing instruments: questionable
content validity.

• The standard language proficiency tests lack content validity since they do not
require the full spectrum of communicative performance on the part of the
learner

• What such proficiency tests lack In content validity, they may gain in other
forms of evidence
Types of evidence to establish the validity
of a test
1. Content-Related Evidence
Understand the difference between direct and indirect testing
• Direct testing: actually performing the target task.
• Indirect testing: performing a task that is related in some way.

Example: to test a learner oral production of syllable stress

• Test task: to have learners mark stressed syllables in a list of written words (indirect testing)
• Test task: require students to actually produce target words orally (direct testing)

The most feasible rule of thumb for achieving content validity is to test performance directly.
Types of evidence to establish the validity
of a test
2. Criterion-Related Evidence
• The extent to which the criterion of the test has actually been reached
• Specified classroom objectives are measured, and implied predetermined levels of
performance are expected to be reached
• In the case of teacher-made classroom assessments, criterion-related evidence is
demonstrated through a comparison of results of an assessment with results of some
other measure of the same criterion

Example: In a course units where students are asked to be able to produce orally voiced and
voiceless stops in all phonetic environments, the results of one teacher’s test unit might be
compared with an independent assessment of the same phonemic proficiency.
Types of evidence to establish the validity
of a test
2. Criterion-Related Evidence
• It falls into one of two categories: concurrent and predictive validity.
• Concurrent validity: if a tests results are supported by other concurrent performance
beyond the assessment itself

Example: the validity of a high score on the final exam of a foreign language course will be
sustained by actual proficiency in the language

• Predictive validity: it becomes important in the case of placement tests, admissions

assessment batteries, language aptitude tests. It assess a test taker’s likelihood of
future success
Types of evidence to establish the validity
of a test

3. Construct-Related Evidence
• A construct is any theory, hypothesis or model that attempts to explain
observed phenomena in our universe of perceptions

• Constructs may or may not be empirically measured

• Every issue in language learning and teaching involves theoretical

constructs

• In the field of assessment, construct validity asks: Does this test actually
tap into the theoretical constructs as it has been defined?
Types of evidence to establish the validity
of a test

3. Construct-Related Evidence

• Construct validity is a major issue in validating large-scale standardized tests of

proficiency.

• Such tests adhere to the principle of practicality and because they must sample a
limited number of domains of language, they may not be able to contain all the
content of a particular skill

• The TOEFL has until recently not attempted to sample oral production, yet, oral
production is an important part of the test.
Types of evidence to establish the validity
of a test

4. Consequential Validity

• It encompasses all the consequences of a test, including such considerations

as its accuracy in measuring intended criteria, its impact on the preparation of
test-takers, its effect on the learner and the social consequences of a test
interpretations

• One aspect: the effect of test preparation courses and manuals on

performance. It reflects on socioeconomic conditions such as opportunities for
coaching.

• Another consequence of a test falls into the category of washback

Types of evidence to establish the validity
of a test

5. Face Validity
• The extent in which students view the assessment as fair, relevant and useful
for improving learning

• The degree in which a test looks right and appears to measure the knowledge
or abilities it claims to measure

• Sometimes students do not know what is being tested.

• They may feel that a test is not testing what it is supposed to test.

• Face validity means that the students perceive the test to be valid.
Types of evidence to establish the validity
of a test
5. Face Validity
Face validity will be high if students encounter:

• A well constructed format with familiar tasks

• A test that is doable within the time limit

• Items and instructions that are clear

• Tasks related to their course work

• Difficulty level that is reasonable

Types of evidence to establish the validity
of a test
5. Face Validity
• Face-validity is not something that can be empirically tested by a teacher or
even by a testing expert

• Purely a factor of the “eye of the beholder”

• Some assessment experts view face validity as a superficial factor that is

dependent on the eyes of the receiver

• The other side of this issue reminds us the psychological state of the learner
4. Authenticity
• The degree of correspondence of the characteristics of a given language test
task to the features of a target language task.

• Claim for authenticity: task is likely to be enacted in the real world.

• Many test items fail to simulate real-world tasks

• Authenticity has increased noticeably

• Reading passages are selected from real-world sources

• Listening comprehension sections feature natural language

4. Authenticity
In a test, authenticity may be presented in the following ways:

• The language is as natural as possible

• Items are contextualized

• Topics are meaningful

• Some thematic organization to items is provided

• Tasks represent real world tasks

5. Washback
• The effect of testing on teaching and learning
• It refers to the effects the tests have on instruction in terms of how students
prepare for the test
• A form of washback that occurs in classroom assessment is the information that
“washes back” to students in the form of useful diagnoses of strengths and
weaknesses
• Effects of an assessment on teaching and learning prior to the assessment itself
• Informal performance assessment is more likely to have built-in washback effects
because the teacher is usually providing interactive feedback
• Formal tests can also have positive washback, but they provide no washback if
students only receive a grade.
5. Washback
• Challenge to teachers: to create classroom tests that serve as learning devices
through which washback is achieved

• Students incorrect responses: windows of insight into further work.

• The correct responses need to be praised

• It enhances a number of basic principles of language acquisition: intrinsic

motivation, autonomy, self-confidence, language ego, interlanguage and
strategic investment

• The teacher must comment generously and specifically on task performance

5. Washback
• Letter grades and numerical scores: no information of intrinsic interest. They
foster a competitive, not cooperative, learning

Tips for teachers:

• When you return a written test, consider giving more than a number as your
feedback.

• Give praise for strengths, as well as constructive criticism for weaknesses.

• Give strategic hints on how a student might improve

• Give the student a motivating experience in which he will gain a sense pf

accomplishment and challenge
5. Washback
• Students have ready access to you to discuss the feedback and evaluation you
have given

• An interactive, cooperative, collaborative classroom can promote an

environment of dialogue between students and teachers regarding evaluation
aspects

• Students need to have a chance to feed back on your feedback to seek

clarification of any issue
Applying principles to the evaluation of
classroom tests

Are the test procedures practical?

• Determined by students and teachers’ time constraints, costs,

administrative details. Practicality Checklist*

• Teachers should avoid the temptation of only adding multiple

choice items that might not be even well designed.

• Teachers must give feedback to students, comments and

suggestions on their tests.
Applying principles to the evaluation of classroom
tests

Is the test reliable?

• It applies to both the test and the teacher. Students have to receive the
same quality of input, whether written or auditory.

• Clean photocopies of the test, the sound amplification is clearly audible,

video input is visible, there is good temperature, lightning, optimal
classroom conditions, and equality for all students.

• Intra-rater reliability for open-ended responses has to follow the

next guidelines: use consistent sets of criteria for a correct response, give
uniform attention to students during the test, read through tests at least
twice to check for your consistency.
Applying principles to the evaluation of classroom tests

Does the procedure demonstrate content validity?

• The students have to perform tasks that were included in the previous
classroom lessons and that represent the objectives of the unit seen.
• Are classroom objectives identified? A valid objective would be this one
Valid
Students will produce yes/no questions with final rising intonation
Invalid
Students should be able to demonstrate some reading comprehension.
Practice vocabulary in context because it is ambiguous and no standards are
implied.

• Test specifications means that a test should have a structure by dividing it

into a number of sections that correspond to the objectives assessed.
Applying principles to the evaluation of classroom tests

• Is the procedure face valid and ‘’biased for best’’?

• The test must have clear instructions, appropriate timing, there are
no surprises, and the test is logically organized.

• ‘’Biased for best’’ refers to giving students proper review of the

content seen, suggesting strategies that can be helpful for them,
helps students who have more difficulty and those who do not.

• It is important to give instructions before, during and after the test

to put all the ideas of the test clear. This makes the students
comfortable and it is valid.
Applying principles to the evaluation of
classroom tests

Are the test tasks as authentic as possible?

• The language has to be natural, the items need to
approximate real world tasks, and the items need to
be contextualized.
• Contextualizing the items for example in multiple
choice, allows students to follow a story line.
Applying principles to the evaluation of classroom tests

Does the test offer beneficial washback to the learner?

• Preparation time before the test.
• The learners can review the content, and after the test, the
students find their weaknesses and strengths.
• Self-assessment can help students discover their own
mistakes, and this is effective for writing performance.
• Peer discussion of the test results is an alternative, and even
with the teacher.
Bibliography

Brown, D. (n.d.). Language assessment: Principles and classroom practices.

Retrieved from
http://images.pcmac.org/Uploads/JeffersonCountySchools/JeffersonC
ountySchools/Departments/DocumentsCategories/Documents/Langu
age%20Assessment%20-
%20Principles%20and%20Classroom%20Practices.pdf
Thank you!

Big Fun 2 Workbook
67% (3)
Big Fun 2 Workbook
98 pages
Basic Laws On The Professionalization of Teaching
100% (9)
Basic Laws On The Professionalization of Teaching
119 pages
CPSM 2019 Handbook
0% (1)
CPSM 2019 Handbook
36 pages
Features of A Well-Written Text
71% (21)
Features of A Well-Written Text
33 pages
Pacific Southbay College, Inc: Humss 11-B
100% (1)
Pacific Southbay College, Inc: Humss 11-B
46 pages
Revised SG For EAPP
No ratings yet
Revised SG For EAPP
67 pages
1 Discourse
100% (1)
1 Discourse
41 pages
Principles of Speech Writing
100% (1)
Principles of Speech Writing
9 pages
Principles of Speech Writing PDF
100% (1)
Principles of Speech Writing PDF
17 pages
The Principle of Speech Writing
No ratings yet
The Principle of Speech Writing
14 pages
Certified Quality Auditor (ASQ Sample Exam Paper)
86% (7)
Certified Quality Auditor (ASQ Sample Exam Paper)
24 pages
Combined Language Assessment (English) TEST # 32 Index Page
No ratings yet
Combined Language Assessment (English) TEST # 32 Index Page
14 pages
Communication For Academic Purposes
100% (1)
Communication For Academic Purposes
55 pages
Lesson 4 Types of Speeches According To Purpose and Delivery
100% (1)
Lesson 4 Types of Speeches According To Purpose and Delivery
52 pages
ELT Handout
No ratings yet
ELT Handout
85 pages
Patterns of Written Texts
No ratings yet
Patterns of Written Texts
49 pages
Preparation Guide: The Official Resource For CPLP Test Preparation
100% (1)
Preparation Guide: The Official Resource For CPLP Test Preparation
48 pages
Study Habits Inventory
100% (1)
Study Habits Inventory
6 pages
A Consciousness Raising Approach To The Teaching of Grammar1
No ratings yet
A Consciousness Raising Approach To The Teaching of Grammar1
15 pages
Block 2 - Past Progresive - Presentation 1
100% (1)
Block 2 - Past Progresive - Presentation 1
33 pages
Oral Com Lesson 5
No ratings yet
Oral Com Lesson 5
25 pages
French LC Notes Oral-Written-Aural
No ratings yet
French LC Notes Oral-Written-Aural
19 pages
Talk A Lot: 80 Great Ideas For Role Plays!
No ratings yet
Talk A Lot: 80 Great Ideas For Role Plays!
39 pages
Undergraduates Handbook (2016)
No ratings yet
Undergraduates Handbook (2016)
204 pages
Principles of Speech Delivery
No ratings yet
Principles of Speech Delivery
12 pages
Lesson 3 Patterns Development
No ratings yet
Lesson 3 Patterns Development
28 pages
Methods of Teaching
No ratings yet
Methods of Teaching
5 pages
ENG 3A - Chapter 5
No ratings yet
ENG 3A - Chapter 5
63 pages
RWS PPT Module 4 Lesson 7,8,9
No ratings yet
RWS PPT Module 4 Lesson 7,8,9
52 pages
The Present: An Enlightened Approach
0% (1)
The Present: An Enlightened Approach
19 pages
Reading and Wriring
No ratings yet
Reading and Wriring
71 pages
Inflection PDF
No ratings yet
Inflection PDF
9 pages
Coherence and Cohesion - Lisa's Study Guides
100% (2)
Coherence and Cohesion - Lisa's Study Guides
2 pages
Integrating Media Literacy Across
No ratings yet
Integrating Media Literacy Across
95 pages
Quiz
100% (1)
Quiz
3 pages
Syllabus - Eng 112 Technical Writing & Reporting (Ma'Am Charm)
No ratings yet
Syllabus - Eng 112 Technical Writing & Reporting (Ma'Am Charm)
6 pages
RWS-Text As Connected Discourse
No ratings yet
RWS-Text As Connected Discourse
24 pages
2nd Long Exam Oral Comm
No ratings yet
2nd Long Exam Oral Comm
18 pages
Context in Development: Hypertext and Intertext
No ratings yet
Context in Development: Hypertext and Intertext
3 pages
RW Quarter Exam
100% (1)
RW Quarter Exam
2 pages
Hypertext and Intertext
100% (1)
Hypertext and Intertext
2 pages
Lecture 3 - Properties of A Well-Written Text
No ratings yet
Lecture 3 - Properties of A Well-Written Text
31 pages
202
50% (2)
202
4 pages
English Grammar: Ma. Martha Manette A. Madrid, Ed.D. Professor
No ratings yet
English Grammar: Ma. Martha Manette A. Madrid, Ed.D. Professor
32 pages
Grounded Theory
No ratings yet
Grounded Theory
25 pages
Testıng 2
No ratings yet
Testıng 2
28 pages
Evaluation Guidelines Tier II v0
No ratings yet
Evaluation Guidelines Tier II v0
27 pages
Oral Communication - Final
No ratings yet
Oral Communication - Final
20 pages
Evaluating Messages and Images in Different Test Types
No ratings yet
Evaluating Messages and Images in Different Test Types
11 pages
Campus Journalism - A Better Understanding
No ratings yet
Campus Journalism - A Better Understanding
23 pages
SQ3R Ssu
No ratings yet
SQ3R Ssu
19 pages
ED 703 OBE Course Syllabus - Peer Reviewed
No ratings yet
ED 703 OBE Course Syllabus - Peer Reviewed
13 pages
Text Vs Discourse
No ratings yet
Text Vs Discourse
19 pages
BLOQUE 1 - VERB BE - DESCRIBING and LOCATING
No ratings yet
BLOQUE 1 - VERB BE - DESCRIBING and LOCATING
24 pages
Speech Delivery
No ratings yet
Speech Delivery
25 pages
JM 1-1, JM 1-2
No ratings yet
JM 1-1, JM 1-2
10 pages
Rubric For Manuscript Speech1
No ratings yet
Rubric For Manuscript Speech1
1 page
Republic of The Philippines Division of Bohol Department of Education Region VII, Central Visayas
No ratings yet
Republic of The Philippines Division of Bohol Department of Education Region VII, Central Visayas
6 pages
Bloque 4 - Expressing Quantities
No ratings yet
Bloque 4 - Expressing Quantities
20 pages
Guidelines For Writing A Research Paper in Linguistics
No ratings yet
Guidelines For Writing A Research Paper in Linguistics
35 pages
Reading and Writing Skills Reviewer
No ratings yet
Reading and Writing Skills Reviewer
7 pages
Factors Affecting Students Performance in Physical Education Class in LPU Laguna
No ratings yet
Factors Affecting Students Performance in Physical Education Class in LPU Laguna
17 pages
Cot 2 Lesson Plan Nature of Communication
No ratings yet
Cot 2 Lesson Plan Nature of Communication
3 pages
LP Demo Final
No ratings yet
LP Demo Final
3 pages
Alaska Board of Veterinary Examiners Statutes and Regulations
No ratings yet
Alaska Board of Veterinary Examiners Statutes and Regulations
19 pages
Diagnostic Test in Reading Writing
No ratings yet
Diagnostic Test in Reading Writing
3 pages
Performance Task
No ratings yet
Performance Task
7 pages
School of Chemistry CHEM211: Inorganic Chemistry II: Subject Outline
No ratings yet
School of Chemistry CHEM211: Inorganic Chemistry II: Subject Outline
13 pages
Regional Mock Board Examination 2019: General Guidelines
No ratings yet
Regional Mock Board Examination 2019: General Guidelines
12 pages
RWS 1ST Quarter Summative
No ratings yet
RWS 1ST Quarter Summative
2 pages
Unit 14 (Final) PDF
No ratings yet
Unit 14 (Final) PDF
7 pages
Teaching and Assessment of Grammar Midterm Coverage
No ratings yet
Teaching and Assessment of Grammar Midterm Coverage
4 pages
BSC Forensic Sci Course Specification
No ratings yet
BSC Forensic Sci Course Specification
7 pages
RWS Handout 3rd Quarter
No ratings yet
RWS Handout 3rd Quarter
4 pages
Final Examination in Oral Communication
No ratings yet
Final Examination in Oral Communication
3 pages
Language Strategy Use Survey
No ratings yet
Language Strategy Use Survey
7 pages
Assessment and Feedback Policy: Documents/student-Policies/manual-Of-General-Regulations
No ratings yet
Assessment and Feedback Policy: Documents/student-Policies/manual-Of-General-Regulations
10 pages
Reading and Writing Skills Quarter Monday Tuesday Wednesday Thurday Friday I. Objectives
No ratings yet
Reading and Writing Skills Quarter Monday Tuesday Wednesday Thurday Friday I. Objectives
4 pages
Eapp Module 8 W12
No ratings yet
Eapp Module 8 W12
4 pages
FinMan 306 Syllabus
No ratings yet
FinMan 306 Syllabus
8 pages
Parts of Research Paper
No ratings yet
Parts of Research Paper
4 pages
UiPath Certified Advanced RPA Developer v1.0 - EXAM Description
No ratings yet
UiPath Certified Advanced RPA Developer v1.0 - EXAM Description
6 pages
Brainstorming
No ratings yet
Brainstorming
13 pages
LP For Claims Activity For Humss
No ratings yet
LP For Claims Activity For Humss
3 pages
Rutgers International Economics Syllabus
No ratings yet
Rutgers International Economics Syllabus
5 pages
Syllabus MENG 2113 & 2213: Statics and Dynamics Summer 2017: Jeong - Ok@mwsu - Edu
No ratings yet
Syllabus MENG 2113 & 2213: Statics and Dynamics Summer 2017: Jeong - Ok@mwsu - Edu
2 pages
Guide For Taking and Scoring The ECCE Sample Test, Form B
No ratings yet
Guide For Taking and Scoring The ECCE Sample Test, Form B
3 pages
REL 2300 World Religions
No ratings yet
REL 2300 World Religions
5 pages
VALIDITY
No ratings yet
VALIDITY
3 pages
Testing Communicative Competence
No ratings yet
Testing Communicative Competence
3 pages
A Case Study - Week 2
No ratings yet
A Case Study - Week 2
2 pages
Argumentative Lesson Plans
100% (1)
Argumentative Lesson Plans
4 pages
Igcse First Language English Exam Review Key
No ratings yet
Igcse First Language English Exam Review Key
8 pages
Eligibilities Granted Under Special Laws and CSC Issuances Reyes
No ratings yet
Eligibilities Granted Under Special Laws and CSC Issuances Reyes
2 pages

Principles of Language Assessment

Uploaded by

Principles of Language Assessment

Uploaded by

Principles of Language

Daniela Bettoni Méndez

An effective test can be considered practical.

• A reliable test is consistent and dependable.

• There happens to be some fluctuations,

1. Student related reliability: A student’s score can be deviated from the

4. Test Reliability: There could be measurement errors. If a test is too long,

Example: to test a learner oral production of syllable stress

• Predictive validity: it becomes important in the case of placement tests, admissions

• Constructs may or may not be empirically measured

• Every issue in language learning and teaching involves theoretical

• Construct validity is a major issue in validating large-scale standardized tests of

• It encompasses all the consequences of a test, including such considerations

• One aspect: the effect of test preparation courses and manuals on

• Another consequence of a test falls into the category of washback

• Sometimes students do not know what is being tested.

• A well constructed format with familiar tasks

• A test that is doable within the time limit

• Items and instructions that are clear

• Tasks related to their course work

• Difficulty level that is reasonable

• Purely a factor of the “eye of the beholder”

• Some assessment experts view face validity as a superficial factor that is

• Claim for authenticity: task is likely to be enacted in the real world.

• Many test items fail to simulate real-world tasks

• Authenticity has increased noticeably

• Reading passages are selected from real-world sources

• Listening comprehension sections feature natural language

• The language is as natural as possible

• Items are contextualized

• Topics are meaningful

• Some thematic organization to items is provided

• Tasks represent real world tasks

• Students incorrect responses: windows of insight into further work.

• The correct responses need to be praised

• It enhances a number of basic principles of language acquisition: intrinsic

• The teacher must comment generously and specifically on task performance

Tips for teachers:

• Give praise for strengths, as well as constructive criticism for weaknesses.

• Give strategic hints on how a student might improve

• Give the student a motivating experience in which he will gain a sense pf

• An interactive, cooperative, collaborative classroom can promote an

• Students need to have a chance to feed back on your feedback to seek

Are the test procedures practical?

• Determined by students and teachers’ time constraints, costs,

• Teachers should avoid the temptation of only adding multiple

• Teachers must give feedback to students, comments and

Is the test reliable?

• Clean photocopies of the test, the sound amplification is clearly audible,

• Intra-rater reliability for open-ended responses has to follow the

Does the procedure demonstrate content validity?

• Test specifications means that a test should have a structure by dividing it

• Is the procedure face valid and ‘’biased for best’’?

• ‘’Biased for best’’ refers to giving students proper review of the

• It is important to give instructions before, during and after the test

Are the test tasks as authentic as possible?

Does the test offer beneficial washback to the learner?

Brown, D. (n.d.). Language assessment: Principles and classroom practices.

You might also like