100% found this document useful (1 vote)

325 views

Language Testing

This document discusses various types of language tests and testing methods. It describes proficiency tests, achievement tests, diagnostic tests, and placement tests. It also outlines direct versus indirect testing, discrete point versus integrative testing, norm-referenced versus criterion-referenced tests, objective versus subjective testing, reliability, validity, common test tasks, and analyzing test results.

Uploaded by

Danu Angga Vebriyanto

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

325 views

Language Testing

Uploaded by

Danu Angga Vebriyanto

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Language testing

A test is a sample of an individuals behaviour/performance on the basis of which inferences are made about the more general underlying competence of that individual. Language tests refer to any kind of measurement/examination technique which aims at describing the test takers foreign language proficiency, e.g. oral interview, listening comprehension task, free composition writing.

1. Kinds of tests and testing

Proficiency tests Proficiency tests aim to measure students' L2 competence regardless of any training they previously had in the language. In these tests, designers specify what the candidates should be able to do to pass the test. Achievement tests Achievement tests assess whether learners have acquired specific elements of language that they were taught in the language course they took part in. There are two types of achievement tests: final tests at the end of the course and progress tests during the course. Diagnostic tests

Diagnostic tests help identify learners' strengths and weaknesses in L2. Their main aim is to help teachers decide what needs to be taught to students.
Placement tests

With the help of placements tests students can be placed in the learning group that is appropriate for their level of competence.

Direct versus indirect testing Direct tests: candidates are required to perform the skill the test intends to measure. Indirect tests want to measure skills that underlie performance in a particular task.

Discrete point versus integrative testing

In discrete point tests every item focuses on one clear-cut segment of the target language without involving the others Typical test format: written multiple-choice test. In integrative tests candidates need to use a number of language elements at the same time in completing the test tasks. For example: essay writing, dictation, cloze test.

Norm referenced tests

In norm-referenced tests, candidates performance is assessed in comparison with that of the other candidates. For these reasons the cut-off points (line between fail and pass) are determined after the test results are obtained from the group of students based on the distribution of the scores.
TOTAL
50

Criterion referenced tests

Criterion-referenced tests compare all the testees to a predetermined criterion. In such tests everybody whose achievement comes up to the pre-set criterion will receive a pass mark, while those under it will fail. The criteria are often set in terms of tasks that students have to be able to perform (e.g. to interact with an interlocutor with ease; to ask for information and understand instructions).

Frequency

Std. Dev = 11,96 Mean = 53,1 N = 270,00

,0 75,0 70,0 65,0 60,0 55,0 50,0 45,0 40,0 35,0 30,0 25,0 20

TOTAL

Common European Framework of References for Languages

Proficient user C2 Can understand with ease virtually everything heard or read. Can summarise information from different spoken and written sources, reconstructing arguments and accounts in a coherent presentation. Can express him/herself spontaneously, very fluently and precisely, differentiating finer shades of meaning even in more complex situations. C1 Can understand a wide range of demanding, longer texts, and recognise implicit meaning. Can express him/herself fluently and spontaneously without much obvious searching for expressions. Can use language flexibly and effectively for social, academic and professional purposes. Can produce clear, well-structured, detailed text on complex subjects, showing controlled use of organisational patterns, connectors and cohesive devices.

Independent user B2Can understand the main ideas of complex text on both concrete and abstract topics, including technical discussions in his/her field of specialisation. Can interact with a degree of fluency and spontaneity that makes regular interaction with native speakers quite possible without strain for either party. Can produce clear, detailed text on a wide range of subjects and explain a viewpoint on a topical issue giving the advantages and disadvantages of various options. B1 Can understand the main points of clear standard input on familiar matters regularly encountered in work, school, leisure, etc. Can deal with most situations likely to arise whilst travelling in an area where the language is spoken. Can produce simple connected text on topics which are familiar or of personal interest. Can describe experiences and events, dreams, hopes and ambitions and briefly give reasons and explanations for opinions and plans.

Basic User

A2 Can understand sentences and frequently used expressions related to areas of most immediate relevance (e.g. very basic personal and family information, shopping, local geography, employment). Can communicate in simple and routine tasks requiring a simple and direct exchange of information on familiar and routine matters. Can describe in simple terms aspects of his/her background, immediate environment and matters in areas of immediate need. A1 Can understand and use familiar everyday expressions and very basic phrases aimed at the satisfaction of needs of a concrete type. Can introduce him/herself and others and can ask and answer questions about personal details such as where he/she lives, people he/she knows and things he/she has. Can interact in a simple way provided the other person talks slowly and clearly and is prepared to help.

Objective testing versus subjective testing

The scoring of a task is objective if the rater does not have to make a judgement because the scoring is unambiguous. For example: multiple choice test. In subjective test tasks, raters have to make a judgement when assessing candidates' performance. For example: marking of an essay.

Reliability
Consider the tasks in the 199 exam: 1. C-test 2. Gap-fill task 3. Summary writing Decide whether these tasks are direct or indirect, subjective or objective, integrative or discrete point tasks.
Reliability is the extent to which a test is free of random measurement error and produces consistent results when administered under similar conditions. This means that a reliable test is not affected by circumstances outside the test (e.g. the people who administer and mark the test, the time and place of the test) Types of reliability: internal consistency: whether the test items are related to each other and measure the same ability parallel or alternate form reliability: how well parallel or alternate forms of the same test measure the same ability

Validity
test-retest reliability: whether test-takers perform similarly each time they complete the test intra-rater reliability: whether the same rater assesses the test-takers' performance in the same way each time he/she evaluates the test inter-rater reliability: whether two raters assess the test-takers' performance in the same way
Validity is the extent to which a test measures what it is supposed to measure and nothing else. content validity: whether the test measures the ability it intends to measure; concurrent validity: whether the test takers' performance in a test correlates with their results in a different type of test; predictive validity: whether the test results accurately predict future performance; construct validity: whether the test appropriately represents the theory of language competence it is based on; face validity: whether the test looks as if it measures what it is supposed to measure.

About the validity of the C-test

The item-related strategies used by the participants Type of strategy Percentage of total Lexical 12.41 Syntactic 9.97 Morphological 3.71 Textual 5.36 Background knowledge 0.83 Translation 6.48 Counting the number of letters 15.31 No strategy used - automatically filled in 45.87 Total 100

3. Types of frequently used objective test tasks

Multiple choice. It consists of a stem: 1. He ______________ three letters since 9 o'clock. And options, one of which is correct and the others are distractors. A writes B has written C has been written D had written Cloze test It is a continuous text in which every Nth word is mechanically deleted. N is usually between five and ten. The examinees have to fill in these blanks. It aims to test reading comprehension, syntax and vocabulary.

C-test In the C-test the second half of every second word is left out. C-tests can provide a rough measure of learners' global level of proficiency. Dictation The basis of the procedure is that each individual dictated chunk is long enough (10-25 words) to exceed the learners short-term memory, and so the forgotten items have to be filled in from the context and the learners knowledge of the language. Editing The editing test is the is reverse of the cloze test. For example: extra words extra are inserted put placed gone into to a text, and testees are is required to crossing cross these out.

Matching Candidates are given a list of possible answers which they have to match with another list of words. For example: Match the words on the left with those on the right to make other English words. 1 head A partner 2 room B wife 3 business C master 4 house D mate Ordering In ordering tasks, candidates have to put a group of words, sentences or paragraphs in order. For example: Put the following words in order to complete the sentence: went yesterday I cinema friend to with.

The oral proficiency interview

Ideally the oral proficiency interview consists four phases: 1. Warm-up: usually not marked; 2. Level-check: getting an approximate idea of the learners proficiency level and the topics he/she feels comfortable in; 3. Probes: actual rating starts only at this stage, the interviewee is pushed up to or beyond his/her level of competence; 4.Wind-up: rounding off the interview by turning back to activities within the learners ability so as not to send him/her away with a feeling of failure.

Analysis of test results

The three most simple analyses of test results are the following: 1. Distribution curve shows the number of students scoring within a particular range.

Std. Dev = 3,25 Mean = 8,3 0 0,0 2,0 4,0 6,0 8,0 10,0 12,0 14,0 N = 61,00

Score

Statistical features of good tests

2. Facility value expresses the proportion of students who responded correctly to an item. For example: if 100 students took part in a test, and 54 of them got the item right, the facility value is 0.54. 3. Discrimination index expresses how well an item can discriminate between good and bad students. Ranges from 1 to - 1.

The distribution curve should be bellshaped. Facility values should be between 0.3 and 0.7 (or in more lenient approaches to test design 0.2-0.8). Discrimination indices should be above 0.4 (or in more lenient approaches to test design above 0.3).

Washback
Washback is the effect tests have on teaching and learning. A beneficial washback effect can be if a so far neglected skill (e.g. listening) is put into the focus of teaching as a result of the introduction of a test where scores in this skill are important in determining the candidates' grades. A negative washback effect can be if most of the time in lessons in secondary schools is spent on practising multiple choice tests. Tests have effect on those who take the test, the teachers who prepare the students for the tests, the teaching materials (e.g. course-books), the society and the educational system.

1. Explain the difference between proficiency and achievement tests; b) diagnostic and placement tests; c) direct and indirect tests; d) subjective and objective tests; e) norm-referenced and criterion referenced tests; f) integrative and discrete point tests. 2. What is reliability? List the various types of reliability. 3. What is validity? List the various types of validity. 4. What are the most frequently used objective test tasks? 5. What are the most frequent statistical measures of test performance? 6. What effects can tests have on teaching and learning?

OceanofPDF.com the Mental Game of Trading - Jared Tendler
0% (1)
OceanofPDF.com the Mental Game of Trading - Jared Tendler
1 page
Chapter 1 - Assessment Concepts and Issues
No ratings yet
Chapter 1 - Assessment Concepts and Issues
21 pages
Assessment Concepts and Issues
No ratings yet
Assessment Concepts and Issues
13 pages
Fulcher, G - Practical Language Testing
No ratings yet
Fulcher, G - Practical Language Testing
3 pages
You're Sexy When You Touch Like That
100% (2)
You're Sexy When You Touch Like That
15 pages
Baal Kadmon Tarot Magick - Harness The Magickal Z
92% (12)
Baal Kadmon Tarot Magick - Harness The Magickal Z
89 pages
Brainwave Frequency Glossary
No ratings yet
Brainwave Frequency Glossary
9 pages
Mistakes and Feedback
No ratings yet
Mistakes and Feedback
18 pages
Language Tests
No ratings yet
Language Tests
16 pages
Language Assessment and Proficiency Standards
100% (1)
Language Assessment and Proficiency Standards
3 pages
The Effect of PQRST Method On The Student
No ratings yet
The Effect of PQRST Method On The Student
30 pages
Discourse Types
No ratings yet
Discourse Types
17 pages
Teaching Speaking Skills
No ratings yet
Teaching Speaking Skills
6 pages
Assessing Writing
No ratings yet
Assessing Writing
13 pages
Teaching Language Skills
No ratings yet
Teaching Language Skills
6 pages
ELT 201 - Structure of English
No ratings yet
ELT 201 - Structure of English
8 pages
English Grammar: Ma. Martha Manette A. Madrid, Ed.D. Professor
No ratings yet
English Grammar: Ma. Martha Manette A. Madrid, Ed.D. Professor
32 pages
Cohesive Features in Argumentative Writing Produced by Chinese Undergraduates
No ratings yet
Cohesive Features in Argumentative Writing Produced by Chinese Undergraduates
14 pages
Grammar Learning Strategy Inventory
No ratings yet
Grammar Learning Strategy Inventory
3 pages
G1 - Chapter 9 - Testing Writing
No ratings yet
G1 - Chapter 9 - Testing Writing
42 pages
Individual Differences in Grammar
No ratings yet
Individual Differences in Grammar
7 pages
Towards Acquiring Communicative Competence Through Writing
67% (3)
Towards Acquiring Communicative Competence Through Writing
27 pages
History of Language Teaching
No ratings yet
History of Language Teaching
14 pages
Testing Speaking Skills Term Paper
No ratings yet
Testing Speaking Skills Term Paper
10 pages
The Role of Repetition in CLIL Teacher Discoursce
No ratings yet
The Role of Repetition in CLIL Teacher Discoursce
10 pages
Subject Assignment: Content & Language Integrated Learning: General Information
100% (2)
Subject Assignment: Content & Language Integrated Learning: General Information
37 pages
Social Factors and SLA - Pps
100% (1)
Social Factors and SLA - Pps
36 pages
ELT 1 Module 3
No ratings yet
ELT 1 Module 3
4 pages
Lecture 1 Sense Relations - Lecture Notes
No ratings yet
Lecture 1 Sense Relations - Lecture Notes
2 pages
Unit 12 Teaching Writing
No ratings yet
Unit 12 Teaching Writing
44 pages
First and Second Language Acquisition
No ratings yet
First and Second Language Acquisition
3 pages
Four Skills Presentation
No ratings yet
Four Skills Presentation
28 pages
Week 1 Introduction To The Teaching of Grammar: Prepared By: Lee Sow Ying
No ratings yet
Week 1 Introduction To The Teaching of Grammar: Prepared By: Lee Sow Ying
30 pages
Assessing Listening
No ratings yet
Assessing Listening
23 pages
Anthology Review Pedagogical Grammar
No ratings yet
Anthology Review Pedagogical Grammar
17 pages
Evaluation and Assessment: Group 6
No ratings yet
Evaluation and Assessment: Group 6
23 pages
2017 01JZamoraIRey If
No ratings yet
2017 01JZamoraIRey If
10 pages
Testing Pragmatic Competence in A Second Language
100% (1)
Testing Pragmatic Competence in A Second Language
22 pages
Language Testing-Principles of Language Assessment
No ratings yet
Language Testing-Principles of Language Assessment
15 pages
English Language Teaching Principles Practice (ELT/PP Year II) - Module II
No ratings yet
English Language Teaching Principles Practice (ELT/PP Year II) - Module II
90 pages
Baker & Westrup (2003)
No ratings yet
Baker & Westrup (2003)
171 pages
Speaking Assessment Rubrics PDF
No ratings yet
Speaking Assessment Rubrics PDF
2 pages
Liu & Braine 2005 Cohesive Features in Argumentative Writing Produced by Chinese Undergraduates
100% (2)
Liu & Braine 2005 Cohesive Features in Argumentative Writing Produced by Chinese Undergraduates
14 pages
Introduction Authentic Materials
No ratings yet
Introduction Authentic Materials
3 pages
ETH 342-De 5
No ratings yet
ETH 342-De 5
8 pages
Summary Chapter 7 and 9 - Nguyen Thi My Duyen - 1967 012 049
No ratings yet
Summary Chapter 7 and 9 - Nguyen Thi My Duyen - 1967 012 049
21 pages
CD Course Assessment
No ratings yet
CD Course Assessment
18 pages
Guidelines For Language Classroom Instruction
100% (1)
Guidelines For Language Classroom Instruction
14 pages
3 - Basic Principles of Assessment
No ratings yet
3 - Basic Principles of Assessment
20 pages
Course Outline - Semantics - Eng
No ratings yet
Course Outline - Semantics - Eng
5 pages
FP005 - Teaching Pronunciation Practice Activity
No ratings yet
FP005 - Teaching Pronunciation Practice Activity
9 pages
Maintaining Appropriate Student Behavior
No ratings yet
Maintaining Appropriate Student Behavior
2 pages
Basic Steps To Write An Academic Essay
No ratings yet
Basic Steps To Write An Academic Essay
3 pages
Assessing Speaking Group 6
100% (1)
Assessing Speaking Group 6
26 pages
Interlanguage Pragmatics in SLA
100% (1)
Interlanguage Pragmatics in SLA
6 pages
The Role of Pragmatics in Second Language Teaching PDF
No ratings yet
The Role of Pragmatics in Second Language Teaching PDF
63 pages
Testing Assessing Teaching SUMMARY
No ratings yet
Testing Assessing Teaching SUMMARY
6 pages
Structure of English
No ratings yet
Structure of English
17 pages
Harmer, J (2007) Chapter 6 Describing Teachers. The Practice of English Language Teaching. PP 107-108. England Pearson Education Limited.
No ratings yet
Harmer, J (2007) Chapter 6 Describing Teachers. The Practice of English Language Teaching. PP 107-108. England Pearson Education Limited.
3 pages
Speaking Skills Assessment Rubric
No ratings yet
Speaking Skills Assessment Rubric
1 page
Semantics, Pragmatics and Discourse
No ratings yet
Semantics, Pragmatics and Discourse
4 pages
Language Assessment Assignment - Observation and Reflection
100% (1)
Language Assessment Assignment - Observation and Reflection
13 pages
Tugas Language Testing Pak Halim
No ratings yet
Tugas Language Testing Pak Halim
16 pages
Uses of Language Test
67% (3)
Uses of Language Test
27 pages
Marylina Serio Resume Eportfolio
No ratings yet
Marylina Serio Resume Eportfolio
2 pages
Discipline and Your Child: Tips To Avoid Trouble
No ratings yet
Discipline and Your Child: Tips To Avoid Trouble
3 pages
When I Was
No ratings yet
When I Was
30 pages
PR1-SCRIPT
No ratings yet
PR1-SCRIPT
3 pages
Benefits of Achieving Customer Service Excellence
No ratings yet
Benefits of Achieving Customer Service Excellence
19 pages
IEPA 034 - Week 4 Vocabulary List - AVL 46-60 (5)
No ratings yet
IEPA 034 - Week 4 Vocabulary List - AVL 46-60 (5)
2 pages
Prof Ed 7 6
No ratings yet
Prof Ed 7 6
11 pages
Understanding Millennials
No ratings yet
Understanding Millennials
36 pages
Chapter 4: Emotions and Moods
No ratings yet
Chapter 4: Emotions and Moods
2 pages
Data Collection Methods: Observation
No ratings yet
Data Collection Methods: Observation
9 pages
COP1000C Computer Programming
No ratings yet
COP1000C Computer Programming
11 pages
Sustainability 12 03630
No ratings yet
Sustainability 12 03630
18 pages
Aesthetic Paradigms of Kant and Schelling
No ratings yet
Aesthetic Paradigms of Kant and Schelling
29 pages
Cum Sa Elaborezi Un Eseu Profi
No ratings yet
Cum Sa Elaborezi Un Eseu Profi
11 pages
How Can You Describe A Good Leader? How Does A Good Leader Contribute To The Overall Performance of The Group in Achieving Their Objectives?
No ratings yet
How Can You Describe A Good Leader? How Does A Good Leader Contribute To The Overall Performance of The Group in Achieving Their Objectives?
2 pages
Qustion 2020-Man QB MCQ - MAN-22509 - VI SEME 2019-20
No ratings yet
Qustion 2020-Man QB MCQ - MAN-22509 - VI SEME 2019-20
15 pages
Chapter 2 Cultural Diversity - B
No ratings yet
Chapter 2 Cultural Diversity - B
43 pages
Schizophrenia
No ratings yet
Schizophrenia
4 pages
Employer Branding
No ratings yet
Employer Branding
29 pages
A Stylistic Approach To The God of Small Things PDF
100% (1)
A Stylistic Approach To The God of Small Things PDF
75 pages
How Does Our Brain Process Language
100% (1)
How Does Our Brain Process Language
2 pages
G. Litavrin, Family Relations and Family Law in The Byzantine Countryside of The Eleventh Century - An Analysis of The Praktikon of 1073, DOP 44 (1990)
No ratings yet
G. Litavrin, Family Relations and Family Law in The Byzantine Countryside of The Eleventh Century - An Analysis of The Praktikon of 1073, DOP 44 (1990)
8 pages
Children Literature
No ratings yet
Children Literature
3 pages
Decision Making System
No ratings yet
Decision Making System
11 pages
Poetry Lesson Plan-Free Verse 1
No ratings yet
Poetry Lesson Plan-Free Verse 1
3 pages