0% found this document useful (0 votes)

8 views36 pages

Forms of A TEST

The document outlines various formats and types of test questions, including selected response, limited production, extended production, and portfolios, each with their respective pros and cons. It also discusses item difficulty and discrimination, emphasizing the importance of test method effects on performance and validity. Finally, it covers scoring and grading processes, including objective and subjective scoring methods, grading systems, and the distinction between scoring and grading.

Uploaded by

salikecon8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views36 pages

Forms of A TEST

Uploaded by

salikecon8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 36

Forms of a TEST

Formats or Types of
Questions
Refers to how the test questions are structured:

1. Selected response: It is a test format where the test-

taker selects an answer from a set of given options,
rather than generating their own response.
• Multiple Choice
• True/False
• Matching
Formats or Types of
Questions
Best Used For:
• Testing knowledge, comprehension, and recall
• Measuring large amounts of content efficiently
• Standardized testing (like SAT, TOEFL, etc.)

Limitations:
• Doesn’t assess deep thinking or problem-solving well
• Encourages guessing
• Hard to design good distractors (wrong answer options)
Formats or Types of
Questions
2. Limited Production: It is a test format where the test-
taker produces a short, somewhat open-ended or limited
response rather than selecting from given options. It falls
between selected response (like multiple choice) and
extended response (like essays).

•Fill-in-the-Blanks or Cloze tests

•Short Answer
•Labeling Diagrams
✅ Pros ❌ Cons
Harder to grade than
Less chance of guessing
multiple choice
Encourages active recall Can still be ambiguous
Useful for checking
accuracy
Not ideal for assessing
Often has one correct
complex thinking
answer, allowing fairly
objective grading
Formats or Types of
Questions
3. Extended Production: This is a type of test that
requires the test-taker to produce a longer, more detailed
response, often involving higher-order thinking,
explanation, or creative expression.

•Essay Questions
•Reports or Research Summaries
•Narrative or Creative Writing
•Problem-Solving with Justification
•Language Production Tasks (speaking or writing tasks in
language testing)
Pros and Cons

Formats or Types of
Questions
✅ Pros ❌ Cons
Time-consuming to grade
Reveals deep understanding, analysis,
Responses may vary widely between
synthesis
test-takers
Encourages critical and original
Grading can be subjective
thinking, and creativity
Can integrate multiple skills or content
areas
Not ideal for large-scale testing
Measures communication skills and
argumentation
Formats or Types of
Questions
4. The Portfolio: It is a type of authentic, performance-based
assessment where a student collects and submits a series of
works over time to demonstrate learning, growth, and skills.
What can be included in a portfolio?
•Essays or research papers
•Artwork or design projects
•Lab reports or science experiments
•Presentations or videos
•Journals or reflective writing
•Drafts showing revisions and improvements
•Test results or skill checklists
Pros and Cons

Formats or Types of Questions

✅ Pros ❌ Cons
Shows real progress, growth over Time-consuming to compile and
time and depth in learning assess
Encourages reflection and self-
evaluation (student ownership of Requires clear criteria to be fair
learning)
Allows for personalized demonstration
May lack standardization
of learning
Useful for project-based, practical or
Subjective if rubrics aren't clear
creative work
TEST ITEMS
• Test items are the individual questions or tasks that
make up a test or assessment. Each item is designed to
measure a specific knowledge, skill, or competency in
the learner.
CLASSIFICATION OF TEST ITEMS
Test items can be classified based on various criteria, including:
1. Response Type

Type Description Examples

Fill-in-the-blank, Short
Limited Production Short, specific responses
answer
Long, open-ended Essays, Reports,
Extended Production
responses Portfolios
2. Production Level
Type Description Examples
The test-taker selects Multiple choice,
Selected Response
from given options True/False, Matching
The test-taker generates Fill-in-the-blank, Short
Constructed Response
their own answer answer, Essay
CLASSIFICATION OF TEST ITEMS
3. Scoring Method
Type Description Examples
Have clear, correct Multiple choice,
Objective Items
answers; easy to score True/False
Require judgment to Essay, Creative writing,
Subjective Items
score Oral responses
4. Test Format or Structure
Format Description
Words or parts are removed from text;
Cloze/Deletion
students fill in gaps
Matching Pairs of related items to be matched
Requires reasoning, steps, and final
Problem-solving
answer
Performance-Based Real-world tasks (e.g., presentations)
CLASSIFICATION OF TEST ITEMS
5. Learning Domain (Bloom’s Taxonomy)

Level Skills Measured Suitable Item Types

MCQs, T/F, Fill-in-the-
Knowledge Recall facts
blank
Comprehension Understand meaning Matching, Short answer
Apply knowledge to new
Application Problem-solving
situations
Analysis Break down into parts Essay, Case study
Projects, Extended
Synthesis Combine ideas creatively
writing
Evaluation Make judgments Critiques, Reflections
ITEM DIFFICULTY
Item difficulty (p-value)

•Item difficulty refers to how easy or hard a test question

(item) is for a group of test-takers.

•It is quantified by the proportion of students who

answered the item correctly.
ITEM DIFFICULTY
Formula:

Item Difficulty (p)= Number of students who answered correctly

Total number of students who answered the item\text{Item

Values range from 0 to 1.

• A higher p-value = easier item

A lower p-value = more difficult item
ITEM DIFFICULTY
Interpretation:
Ideal test construction includes items with moderate difficulty (around 0.5)
to ensure a good spread of performance.

p-value Interpretation
0.90 – 1.00 Very easy
0.70 – 0.89 Easy
0.40 – 0.69 Moderate (ideal range)
0.20 – 0.39 Difficult
0.00 – 0.19 Very difficult
ITEM DISCRIMINATION
Item discrimination (D-index)

•It indicates how well a test item distinguishes between

high-performing and low-performing students.

•A good item should be answered correctly more often by

students who do well on the whole test.
ITEM DISCRIMINATION
Common Method (Upper-Lower Group Method):

Divide students into two groups:

•Upper group: top 27% performers

•Lower group: bottom 27% performers

Discrimination Index (D)= Correct in Upper Group − Correct in Lower Group

n n

**** n is the number of students in each group.

ITEM DISCRIMINATION
Interpretation:
Negative discrimination index means low-performing students are
more likely to get it right than high-performing ones,
which suggests a flawed or trick question.
D-value Interpretation
0.40 and above Excellent discriminator
0.30 – 0.39 Good
0.20 – 0.29 Acceptable
0.00 – 0.19 Poor (revise or discard)
Problematic (possibly flawed or
Negative
misleading)

Ideal Test Items:

•Moderate difficulty (p ≈ 0.4 – 0.7)
•High discrimination (D ≥ 0.3)
ITEM DISCRIMINATION
Why Are These Important?

•It helps identify which items to keep,revise, or discard

•Improve test validity and fairness

•Ensure the test effectively differentiates between levels of

ability
Test Method Effect
Test Method Effect refers to how the format or method of testing
itself can influence a test-taker’s performance — independently of test takers’
actual knowledge or skills.

It is not the content being tested that causes differences in scores —

it’s how the test is designed or delivered.

It is important because if a test is supposed to measure someone's ability,

but performance is affected by the test method, the results can
become misleading or unfair.
Test Method Effect
Examples of Test Method Effects:
1. Test Format
•A student may know the content well but perform worse on multiple-choice
than on short answer due to anxiety or confusion with distractors.
2. Mode of Delivery
•Online vs. paper-based testing can affect performance depending on a
student’s familiarity with technology.
4. Time Constraints
•A test may not truly reflect knowledge if the method (e.g., time-limited) causes
stress or penalizes slower test-takers.
5. Cultural Bias
•A test with culturally specific content may confuse students from different
backgrounds, affecting scores even if they have the required skills.
Test Method Effect
Consequences of Test Method Effects:
•Reduces the validity of the test (i.e., it may not be measuring what it claims to
measure)
•Affects fairness across test-takers
•May result in false positives or negatives (overestimating or underestimating
ability)

How to Minimize Test Method Effects:

•Use clear and simple language
•Provide practice examples or familiarization sessions
•Choose the most appropriate item format for the skill being tested
•Avoid cultural bias in examples or contexts
•Ensure the test environment is consistent and accessible
•Use multiple types of items to assess the same objective
Scoring a TEST
• Scoring is the process of assigning numerical
or qualitative values to a student’s responses
on a test. The goal is to quantify performance
so it can be interpreted, compared, and
evaluated.
Scoring a TEST
Types of Scoring

1.Objective Scoring
•Used for items with clearly correct answers
•No judgment needed
•Fast, reliable, and consistent

•Multiple Choice
•True/False
•Fill-in-the-blank (if exact answer is expected)
•Matching
Scoring method:
Correct = 1 point, Incorrect = 0 points
Scoring a TEST
2. Subjective Scoring
•Used for items that require judgment or interpretation
•Based on rubrics or scoring guides
•Can vary between scorers if not standardized

•Essays
•Oral responses
•Project work
•Open-ended or problem-solving questions
Scoring method:
Use rubrics with levels (e.g., 1–5 or 0–10) based on
specific criteria
Scoring a TEST
3. Holistic vs. Analytic Scoring (for subjective items)

Type Description Use When

Quick judgments,
Holistic Overall impression
language fluency
Separate score for each
component (e.g., Detailed feedback
Analytic
grammar, content, needed, complex tasks
structure)
Disadvantages Advantages
• It has to consider
• Halo effect influence of
all aspects.
one aspect over another.
• Easier to design,
• Compensatory scoring.
train graders and
• Cannot be used for
grade.
Holistic diagnostic reasons.
• Higher reliability
• Validation is weaker (2
because more
students with same
agreement as a
score, but unequal
whole.
profile).

• Not all parts equal to the • More detailed

whole can be included. feedback.
• Difficult to prepare. • Strengths and
• We have to break weaknesses can be
Analytic language into specific understood better.
skills (e.g. hard to • No carry over from
differentiate btw. one aspect to
grammar and another.
vocabulary)
Scoring a TEST
Scoring Techniques
Raw Score
•The total number of points earned
e.g., 42 out of 50
Percentage Score
•(RawScore÷TotalPossibleScore)×100
e.g., (42 ÷ 50) × 100 = 84%
Scaled Score
•Converts raw scores to a consistent scale (e.g., 200–800 in SAT)
•Useful in standardized testing
Weighted Score
•Some items or sections carry more weight
e.g., Essay = 40%, MCQ = 60% of total
Grading a TEST
• Grading a test involves assigning a value
(usually numerical or letter-based) to represent a
student's performance. It reflects how well a
student understood or mastered the material
assessed in the test.
Grading a TEST
Types of Grading Systems
1. Absolute Grading
• Based on pre-set criteria or score ranges
• Common in schools and standardized exams

Score (%) Grade

90–100 A
80–89 B
70–79 C
60–69 D
Below 60 F
Grading a TEST
2. Relative Grading (Norm-Referenced)
• Based on student performance in comparison to
peers
• Often uses a bell curve
• Only a certain percentage get each grade

Top 10% | A | | Next 20% | B | | Middle 40%| C | | Next 20%

| D | | Bottom 10%| F |
• Can be unfair in small classes or if all students perform
well
Grading a TEST
3. Criterion-Referenced Grading
• Grades based on specific learning objectives or
rubrics
• Focuses on mastery of content, not how others
performed

4. Pass/Fail Grading
• Used in some practical or qualifying exams
• Students either meet a minimum standard or do not
Grading a TEST
Steps in Grading a Test
1. Score individual items
1. Objective items (e.g., MCQs): use answer key
2. Subjective items (e.g., essays): use a rubric for consistent
evaluation
2. Calculate raw score
1. Total correct answers or points earned
3. Convert raw score to percentage or grade
1. e.g., 45/50 = 90% = A
4. Apply grading system (absolute, relative, etc.)
5. Give feedback
1. Highlight strengths, weaknesses, and areas to improve
Grading a TEST

Grading Tools
• Rubrics: Predefined criteria for scoring open-ended
responses
• Answer Keys: For objective questions
• Gradebooks: Digital or paper systems to track scores
• Software: Learning management systems (e.g., Google
Classroom, Canvas) automate grading
Scoring vs. Grading

Scoring Grading
Converting scores into a grade (A, B,
Assigning points to responses
C, etc.)
More detailed and raw More summarized
Happens first Happens after scoring

Chapter 1 - Basic Concepts in Assessment
80% (5)
Chapter 1 - Basic Concepts in Assessment
46 pages
The Alpha Male Millionaire Reportv2
100% (1)
The Alpha Male Millionaire Reportv2
28 pages
TR - Automotive Servicing NC I
No ratings yet
TR - Automotive Servicing NC I
130 pages
Keys To Ageless Living
No ratings yet
Keys To Ageless Living
38 pages
Chapter 2 Intellectual Revolutions That Defined Society
No ratings yet
Chapter 2 Intellectual Revolutions That Defined Society
33 pages
Cbea-English Q1 No 17
No ratings yet
Cbea-English Q1 No 17
3 pages
Untitled
No ratings yet
Untitled
448 pages
Language Learning in Early Childhood
No ratings yet
Language Learning in Early Childhood
2 pages
Leadership:: Theory, Application, Skill Development
No ratings yet
Leadership:: Theory, Application, Skill Development
57 pages
Unified ME Tool For IDEA LE
No ratings yet
Unified ME Tool For IDEA LE
4 pages
Teacher-Made Vs Standardized Tests
No ratings yet
Teacher-Made Vs Standardized Tests
27 pages
JULY
No ratings yet
JULY
2 pages
Goat Days
No ratings yet
Goat Days
8 pages
Basic Concepts of Assessment Part 2.exam PDF
No ratings yet
Basic Concepts of Assessment Part 2.exam PDF
20 pages
English 10 3rd Quarter (2nd Topic) Rhylee Mae Vel Demegillo)
No ratings yet
English 10 3rd Quarter (2nd Topic) Rhylee Mae Vel Demegillo)
5 pages
THE UNDEAD INGESTION OF THE SELF CANNIBALISTIC IDENTITY FORMATION AND GHOULISH SUBJECTIVITY IN ZOMBIE LITERATURE. A Thesis.
No ratings yet
THE UNDEAD INGESTION OF THE SELF CANNIBALISTIC IDENTITY FORMATION AND GHOULISH SUBJECTIVITY IN ZOMBIE LITERATURE. A Thesis.
65 pages
On Thi
No ratings yet
On Thi
43 pages
Explain The Rationale For Tapering Psychotropic Medication Doses Before The Client Discontinues The Drug. (5 Points)
No ratings yet
Explain The Rationale For Tapering Psychotropic Medication Doses Before The Client Discontinues The Drug. (5 Points)
2 pages
Educational Test and Measurement For Stu
100% (1)
Educational Test and Measurement For Stu
57 pages
Designing & Construction of LAQ, SAQ
No ratings yet
Designing & Construction of LAQ, SAQ
67 pages
Abdulwahab 19
No ratings yet
Abdulwahab 19
68 pages
UE-MA-LT-W2-Purpose of Tests-2019
No ratings yet
UE-MA-LT-W2-Purpose of Tests-2019
23 pages
Testing For Language Teachers
100% (2)
Testing For Language Teachers
87 pages
PX Interview Guide For Candidates 2021
No ratings yet
PX Interview Guide For Candidates 2021
11 pages
Language Testing and Assessment, 2018-2019
No ratings yet
Language Testing and Assessment, 2018-2019
79 pages
Session II (PGDT)
No ratings yet
Session II (PGDT)
44 pages
School Community Final
No ratings yet
School Community Final
24 pages
Assessment in Learning: Prepared By: Sittie Nermin A. H.Noor
100% (1)
Assessment in Learning: Prepared By: Sittie Nermin A. H.Noor
47 pages
LET - REVIEW - Measurement - Assessment of Learning
No ratings yet
LET - REVIEW - Measurement - Assessment of Learning
20 pages
L.O Controlled Test AUG5
No ratings yet
L.O Controlled Test AUG5
9 pages
Psychology Chapter 5
No ratings yet
Psychology Chapter 5
9 pages
Measurement, Testing, Assessment and Evaluation
No ratings yet
Measurement, Testing, Assessment and Evaluation
27 pages
Evaluation by The Utamed
No ratings yet
Evaluation by The Utamed
63 pages
What If You Dream About Your Classmate Having A Crush On You - Google Search
No ratings yet
What If You Dream About Your Classmate Having A Crush On You - Google Search
1 page
Fa 1
No ratings yet
Fa 1
2 pages
Week 1-2 Ch. 1. PPT Language Testing
0% (2)
Week 1-2 Ch. 1. PPT Language Testing
15 pages
Week 2 - Assessment of Learning 1
100% (1)
Week 2 - Assessment of Learning 1
31 pages
Method Precedes
No ratings yet
Method Precedes
7 pages
Achievement Test Edu-506
No ratings yet
Achievement Test Edu-506
26 pages
Assessment of Learning Hand Outs PDF
No ratings yet
Assessment of Learning Hand Outs PDF
18 pages
Measurement & Evaluation
No ratings yet
Measurement & Evaluation
26 pages
Assessment
No ratings yet
Assessment
51 pages
Test and Their Uses in Educational Assessment: Prepared By: Group II
No ratings yet
Test and Their Uses in Educational Assessment: Prepared By: Group II
46 pages
CH14 PPT 52626
No ratings yet
CH14 PPT 52626
14 pages
Assessment of Student Learning Handout
No ratings yet
Assessment of Student Learning Handout
7 pages
What Is Test? How Many Types of Test Are There?discuss The Qualities of A Good Test. Discuss The Difference Between Test and Assessment
No ratings yet
What Is Test? How Many Types of Test Are There?discuss The Qualities of A Good Test. Discuss The Difference Between Test and Assessment
6 pages
Lecture3 - TESTING AND EVALUATION PDF
No ratings yet
Lecture3 - TESTING AND EVALUATION PDF
45 pages
TESTING&ASSESSMENT
No ratings yet
TESTING&ASSESSMENT
6 pages
Designing A Classroom Test: Anthony Paolo, PHD
No ratings yet
Designing A Classroom Test: Anthony Paolo, PHD
45 pages
Testing Evaluation PP T
No ratings yet
Testing Evaluation PP T
51 pages
Assessment of Learning Report
No ratings yet
Assessment of Learning Report
28 pages
Week 1: Introduction To Language Testing/assessment
No ratings yet
Week 1: Introduction To Language Testing/assessment
15 pages
Assessment and Evaluation Learning 2
No ratings yet
Assessment and Evaluation Learning 2
14 pages
Introducing Language Testing Spring 2024
No ratings yet
Introducing Language Testing Spring 2024
32 pages
Approaches To Language Testing
No ratings yet
Approaches To Language Testing
46 pages
Pt. 3 ASL 1 SY 24 25
No ratings yet
Pt. 3 ASL 1 SY 24 25
77 pages
English Project
No ratings yet
English Project
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Testing and Evaluation
100% (3)
Testing and Evaluation
11 pages
Assessment Cbe3 1
No ratings yet
Assessment Cbe3 1
36 pages
Profed4 Reviewer
No ratings yet
Profed4 Reviewer
4 pages
Panvel Municipal Corporation Surveyor Paper 10 December 2023 With
No ratings yet
Panvel Municipal Corporation Surveyor Paper 10 December 2023 With
42 pages
Administering Test
No ratings yet
Administering Test
17 pages
Assessment of Learning PPT 201012014906
No ratings yet
Assessment of Learning PPT 201012014906
54 pages
The Power of Play - Gamified Advertising With AR - VR - AI and Its Impact On Consumer Engagement by Abhishek Srivastava
No ratings yet
The Power of Play - Gamified Advertising With AR - VR - AI and Its Impact On Consumer Engagement by Abhishek Srivastava
5 pages
Module 3 - Lesson 3
No ratings yet
Module 3 - Lesson 3
37 pages
Module4-Types of Tests
No ratings yet
Module4-Types of Tests
3 pages
Kind of Test Lectures
No ratings yet
Kind of Test Lectures
13 pages
PRETEST
No ratings yet
PRETEST
2 pages
Sınav Hazırlama 1
No ratings yet
Sınav Hazırlama 1
135 pages
How To Develop Self Reliance
No ratings yet
How To Develop Self Reliance
5 pages
SSFGRH
No ratings yet
SSFGRH
2 pages
COMMS207 Research Worksheet #2 Jade Jugum
No ratings yet
COMMS207 Research Worksheet #2 Jade Jugum
2 pages
Unit-V Standardized and Non-Standardized Tests: Meaning, Characteristics, Objectivity, Validity, Reliability, Usability, Norms, Construction of Tests
100% (2)
Unit-V Standardized and Non-Standardized Tests: Meaning, Characteristics, Objectivity, Validity, Reliability, Usability, Norms, Construction of Tests
11 pages
Tests and Measurement
No ratings yet
Tests and Measurement
7 pages
Week 1& 2 Testing
No ratings yet
Week 1& 2 Testing
50 pages
Ss24 Cultural Relativism Activity
No ratings yet
Ss24 Cultural Relativism Activity
1 page
History of Ethiopian and The Horn APPROVED by ASSOCIATION 1 (3) - Copy Export
No ratings yet
History of Ethiopian and The Horn APPROVED by ASSOCIATION 1 (3) - Copy Export
6 pages
Revised Reviewer Kay Sir Sayago
No ratings yet
Revised Reviewer Kay Sir Sayago
30 pages
Test Design
No ratings yet
Test Design
5 pages
Reviewer For Assesment in Learning 1
No ratings yet
Reviewer For Assesment in Learning 1
4 pages
Assessing Language
No ratings yet
Assessing Language
10 pages
Assessment For HDP
No ratings yet
Assessment For HDP
7 pages
Selecting
No ratings yet
Selecting
3 pages
đánh giá giảng dạy từ chương 2
No ratings yet
đánh giá giảng dạy từ chương 2
37 pages
Lecture 5 & 6 - Construction & Administration of Competency Based Assessment Stud
No ratings yet
Lecture 5 & 6 - Construction & Administration of Competency Based Assessment Stud
73 pages
Methods of Evaluation
No ratings yet
Methods of Evaluation
34 pages
Topic 9 Notes
No ratings yet
Topic 9 Notes
7 pages

Forms of A TEST

Uploaded by

Forms of A TEST

Uploaded by

Forms of a TEST

1. Selected response: It is a test format where the test-

•Fill-in-the-Blanks or Cloze tests

Formats or Types of Questions

Type Description Examples

Level Skills Measured Suitable Item Types

•Item difficulty refers to how easy or hard a test question

•It is quantified by the proportion of students who

Item Difficulty (p)= Number of students who answered correctly

Values range from 0 to 1.

• A higher p-value = easier item

•It indicates how well a test item distinguishes between

•A good item should be answered correctly more often by

Divide students into two groups:

•Upper group: top 27% performers

Discrimination Index (D)= Correct in Upper Group − Correct in Lower Group

**** n is the number of students in each group.

Ideal Test Items:

•It helps identify which items to keep,revise, or discard

•Improve test validity and fairness

•Ensure the test effectively differentiates between levels of

It is not the content being tested that causes differences in scores —

It is important because if a test is supposed to measure someone's ability,

How to Minimize Test Method Effects:

Type Description Use When

• Not all parts equal to the • More detailed

Score (%) Grade

Top 10% | A | | Next 20% | B | | Middle 40%| C | | Next 20%

You might also like