0% found this document useful (0 votes)

63 views22 pages

Lesson 5 Criteria To Consider When Constructing Good Test Items

The document discusses guidelines for constructing good test items and performance tasks. It covers criteria like validity, reliability, and factors that influence them. It also discusses ways to establish validity through face validity, content validity, and criterion-related validity. Methods for measuring reliability include test-retest reliability, equivalent forms reliability, and split-half reliability. The document also provides criteria for selecting good performance assessment tasks and guidelines for grading students.

Uploaded by

ARLON CADIZ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views22 pages

Lesson 5 Criteria To Consider When Constructing Good Test Items

Uploaded by

ARLON CADIZ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Criteria to Consider

when Constructing
Good Test Items
and Performance
Task

1
Let’s try this!

Choose the letter of the correct answer.

1. The students of Teacher Louie are very noisy. To keep them busy, they
were given any test available in the classroom and then the results
were graded as a way to punish them. Which statement best explains
if the practice is acceptable or not?
a. The practice is acceptable because the students behaved well
when they were given test.
b. The practice is not acceptable because it violated the principle of
reliability.
c. The practice is not acceptable because it violates the principle of
validity.
d. The practice is acceptable since the results are graded.
2
Let’s try this!

2. Which is acceptable practice when evaluating the students?

a. Evaluation should be based on the information obtained
from measuring instruments on cognitive behaviors.
b. Evaluation method should be selected based on the desired
trait to measure.
c. Evaluation results should be used to grade students.
d. Evaluation should be done at the end of instruction.

3
Let’s try this!

3. Teacher Rhodalyn wants to test the reliability of her

achievement test in TLE. Which of the following activities will
help her to achieve her purpose?
a. Administer two parallel tests to different groups of students.
b. Administer two equivalent tests to the same group of
students.
c. Administer a single test but to two different groups of
students.
d. Administer two different tests but to the same group of
students.
4
Let’s try this!

4. Mrs. Aluyog developed an achievement test in TLE for grade 7

students. Before she finalized the test, she requested her head
to determine if the test items were constructed based on the
behavior domain to be measured. What characteristic of a test
did she establish?
a. Validity c. Reliability
b. Scorability d. Administrability

5
Let’s try this!

5. Mrs. Garcia wants to establish the reliability of her test.

However, she has only one form of the test and she
administered her test only once. What test of reliability can she
do?
a. Test of stability c. Test of correlation
b. Test of equivalence d. Test of internal consistency

6
Validity

• It is the degree to which the test measures what is

intended to measure.
• It is the usefulness of the test for a given purpose.
• It is the most important criterion of a good examination.
• A validity coefficient should be at least 0.5 but
preferably higher.

7
Factors influencing the Validity of the test

• Appropriateness of the test

• Directions
• Reading vocabulary and sentence structures
• Difficulty of items
 Acceptance index of difficulty is 0.2 – 0.8 (> than 0.8 means too
easy; < than 0.2 means too difficult)
 Acceptance index of discrimination is 0.3 – 1.0 (> than 0.3
means poor discriminatory power)
• Construction of test items
• Length of the test
8
Factors influencing the Validity of the test

• Arrangement of items
• Patterns of answers

9
Ways in Establishing Validity

• Face Validity – examining the physical appearance of

the test.
• Content Validity – careful and critical examination of
the objectives of the test.
• Criterion-related Validity – sets of scores revealed by a
test is correlated with the scores obtained in another
external predictor or measure.

10
Ways in Establishing Validity

• Purposes of Criterion-related Validity

 Concurrent Validity – present status of the individual by
correlating the sets of scores obtained from two measures
given concurrently.
 Predictive Validity – future performance of an individual by
correlating the sets of scores obtained from two measures
given at a longer time interval.
• Construct Validity – comparing psychological traits of
factors that theoretically influence scores in a test.
11
Ways in Establishing Validity

• Types of Construct Validity

 Convergent Validity – the instrument defines a similar trait
(e.g. critical thinking test that is being developed may be
correlated with a standardized critical thinking test)
 Divergent Validity – the instrument can describe only the
intended trait and not the other traits (e.g. critical thinking
test may not be correlated with reading comprehension test)

12
Reliability

• It refers to the consistency of scores obtained by the

same person when retested using the same instrument
or one that is parallel to it.
• The reliability coefficient should be at least 0.7 but
preferably higher.

13
Factors affecting Reliability

• Length of the test

 the longer the test, the higher the reliability
 longer test provides a more adequate sample of the behavior being
measured and is less distorted by chance factors like guessing
• Difficulty of the test
 achievement test should be constructed such that the average
score is 50 percent correct and the scores range from near zero to
near perfect
 the bigger the spread of the scores, the more reliable the
measured difference is likely to be
 a test is reliable if the coefficient of correlation is not less than 0.85
14
Factors affecting Reliability

• Objectivity
 eliminating the bias, opinions or judgments of the person who checks
the test
• Administrability
 test should be administered with ease, clarity, and uniformity so that
scores obtained are comparable
 uniformity can be obtained by setting the time limit and oral
instructions
• Scorability
 test should be easy to score such that directions for scoring are
clear, the scoring key is simple; provisions for answer sheets are
made
15
Factors affecting Reliability

• Economy
 test should be given in the cheapest way, which means that
answer sheets must be provided so the test can be given from time
to time
• Adequacy
 test should contain a wide sampling of items to determine the
educational outcomes or abilities so that the resulting scores are
representatives of the total performance in the areas

16
Factors affecting Reliability

METHOD Type of Reliability Procedure Statistical Measure

Measure
Test-Retest Measure of stability Give a test twice to the same group with any time Pearson r
interval between tests from several minutes to
several years
Equivalent forms Measure of equivalence Give parallel forms of tests with close the time Pearson r
intervals between them
Test-Retest with Measure of stability and Give parallel forms of tests with increased time Pearson r
equivalent forms equivalence intervals between forms

Split-half Measure of internal Give a test once. Score equivalent halves of the test Pearson r and
consistency (e.g. odd-and even numbered items) Spearman Brown
Formula

17
Factors affecting Reliability

METHOD Type of Reliability Procedure Statistical Measure

Measure
Kuder-Richardson Measure of Internal Give the test once then correlate the Kuder-Richardson
Consistency proportion/percentage of the students passing and Formula 20 and 21
not passing a given item

18
Criteria in Selecting Good Performance Assessment
Task

• Generalizability
 students’ performance on the task compare to other performance
task
• Authenticity
 reflective what the students will be doing in real world
 Multiple Foci
 measures multiple instructional outcomes or targets.
 Teachability
 assessment task can be the learning or teaching task
19
Criteria in Selecting Good Performance Assessment
Task

• Feasibility
 task which is reliability implementable in relation to its cost, space,
time, and equipment
• Scorability
 scoring is define and can be easily determined
 Fairness
 task is fair to all students

20
Guidelines in Grading Students

• Explain your grading system to the students early in the

course and remind them of the grading policies
regularly.
• Base grades on a predetermined and reasonable set of
standards.
• Base your grades on as much objective evidence as
possible.
• Base grades on the students’ attitude as well as
achievement, especially at the elementary and high
school level.
21
Guidelines in Grading Students

• Base grades on the students’ relative standing

compared to classmates.
• Base grades on a variety of sources.
• Become familiar with the grading policy of your school
and with your colleagues’ standards.
• When failing a student, closely follow school
procedures.
• Guard against bias in grading.
• Keep students informed of their standing in the class.
22

Critical Thinking Self Assessment Scale (CTSAS)
100% (2)
Critical Thinking Self Assessment Scale (CTSAS)
244 pages
Validity and Reliability
100% (4)
Validity and Reliability
19 pages
The Effects of Leadership Styles On Employee Performance - Case Study: Istanbul - Turkey Sme's
No ratings yet
The Effects of Leadership Styles On Employee Performance - Case Study: Istanbul - Turkey Sme's
9 pages
Introduction To Validity and Reliability
No ratings yet
Introduction To Validity and Reliability
6 pages
3.4. Validity, Reliability and Fairness
100% (1)
3.4. Validity, Reliability and Fairness
3 pages
Characteristics of A Good Test
No ratings yet
Characteristics of A Good Test
33 pages
Types of Validity
No ratings yet
Types of Validity
6 pages
Principles of Language Testing
No ratings yet
Principles of Language Testing
48 pages
Factors Influencing The Validity of The Tests in General
100% (2)
Factors Influencing The Validity of The Tests in General
10 pages
Characteristics of A Good Test
50% (2)
Characteristics of A Good Test
5 pages
Establishing Validity-and-Reliability-Test
No ratings yet
Establishing Validity-and-Reliability-Test
28 pages
Qualities of An Evaluation Tool
No ratings yet
Qualities of An Evaluation Tool
42 pages
Assessment of Learning
No ratings yet
Assessment of Learning
31 pages
Thursday Validity, Reliability and Generalizability
100% (1)
Thursday Validity, Reliability and Generalizability
16 pages
Manual Risb
87% (15)
Manual Risb
164 pages
Untitled
No ratings yet
Untitled
292 pages
Budget Goal Commitment and Budget Participation On Employee Performance
No ratings yet
Budget Goal Commitment and Budget Participation On Employee Performance
58 pages
Index of Difficulty (Di)
No ratings yet
Index of Difficulty (Di)
3 pages
2021 Developing A Measurement of Employee Learning Agility
No ratings yet
2021 Developing A Measurement of Employee Learning Agility
18 pages
Assessment Procedures Secondary Schools and Proffessional Levels
No ratings yet
Assessment Procedures Secondary Schools and Proffessional Levels
91 pages
Chapter 2
No ratings yet
Chapter 2
10 pages
Assessment of Learning-1
No ratings yet
Assessment of Learning-1
24 pages
Language Testing PPT 2
No ratings yet
Language Testing PPT 2
27 pages
The Relevance of Factors Affecting Real Estate Investment Decisions
No ratings yet
The Relevance of Factors Affecting Real Estate Investment Decisions
16 pages
PE 7 MODULE 7 Correct
No ratings yet
PE 7 MODULE 7 Correct
8 pages
Measuring Instrument Module 2
No ratings yet
Measuring Instrument Module 2
10 pages
Validity:: "Validity Is The Degree To Which A Test Measures What It Is Supposed To Measure."
No ratings yet
Validity:: "Validity Is The Degree To Which A Test Measures What It Is Supposed To Measure."
6 pages
Assessing Consumer Acceptance of Roasted Rice and Cucumber Shake#3
No ratings yet
Assessing Consumer Acceptance of Roasted Rice and Cucumber Shake#3
33 pages
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
No ratings yet
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
5 pages
Educ 107 Midterm Course Pack
No ratings yet
Educ 107 Midterm Course Pack
22 pages
Measurement and Evaluation
No ratings yet
Measurement and Evaluation
71 pages
Quantitative Analysis Quantitative Analysis: Prepared By: AUDRHEY N. RAMIREZ
No ratings yet
Quantitative Analysis Quantitative Analysis: Prepared By: AUDRHEY N. RAMIREZ
33 pages
Constructionoftests 211015110341
No ratings yet
Constructionoftests 211015110341
57 pages
Validity and Reliability
No ratings yet
Validity and Reliability
19 pages
Educ Measurement Prelim
No ratings yet
Educ Measurement Prelim
24 pages
Al1 Final Reviewer
No ratings yet
Al1 Final Reviewer
170 pages
Unit Iii - Designing and Developing Assessments: Let's Read These
No ratings yet
Unit Iii - Designing and Developing Assessments: Let's Read These
21 pages
Constructionoftests 211015110341
No ratings yet
Constructionoftests 211015110341
57 pages
Material For Evaluation For Class Lectures 1
No ratings yet
Material For Evaluation For Class Lectures 1
67 pages
Principles of High Quality Assessment 2
No ratings yet
Principles of High Quality Assessment 2
46 pages
Week V & VI
No ratings yet
Week V & VI
77 pages
Characteristics of A Good Test
No ratings yet
Characteristics of A Good Test
23 pages
Analytical Chemistry - Random Errors in Chemical Analyses
No ratings yet
Analytical Chemistry - Random Errors in Chemical Analyses
39 pages
Qualities of Test (Validity & Relibility Etc)
No ratings yet
Qualities of Test (Validity & Relibility Etc)
38 pages
Media Information Sources
No ratings yet
Media Information Sources
40 pages
Trixielyn Kate N. Roxas - Improving Assessment Items
No ratings yet
Trixielyn Kate N. Roxas - Improving Assessment Items
28 pages
Xtics of Good Test BI
No ratings yet
Xtics of Good Test BI
22 pages
Chapter 2 Principles of Language Assessment-Handout
No ratings yet
Chapter 2 Principles of Language Assessment-Handout
46 pages
Gosling Et Al. 2003 A - Very - Brief - Measure - of - The - Big-Five - perXCELLENT
No ratings yet
Gosling Et Al. 2003 A - Very - Brief - Measure - of - The - Big-Five - perXCELLENT
25 pages
Assessment in Learning 1 Reviewer
No ratings yet
Assessment in Learning 1 Reviewer
7 pages
Principles of High Quality Assessment and Reliability
No ratings yet
Principles of High Quality Assessment and Reliability
49 pages
Ed 216 NOTES
No ratings yet
Ed 216 NOTES
21 pages
Conditions of A Good Test #1
No ratings yet
Conditions of A Good Test #1
27 pages
Chatacteristics of Good Test
No ratings yet
Chatacteristics of Good Test
40 pages
Educ 116 PDF
No ratings yet
Educ 116 PDF
29 pages
1 s2.0 S0020748919302159 Main
No ratings yet
1 s2.0 S0020748919302159 Main
13 pages
Characteristics of A Good Test
No ratings yet
Characteristics of A Good Test
35 pages
Week 4 & 5 - Principles of Language Assessment
No ratings yet
Week 4 & 5 - Principles of Language Assessment
35 pages
Qualities of A Good Test Instrument Validity
No ratings yet
Qualities of A Good Test Instrument Validity
5 pages
Standardized and Non Standardized Test
No ratings yet
Standardized and Non Standardized Test
23 pages
Features of A Good Test
No ratings yet
Features of A Good Test
11 pages
Validity Refers To How Well A Test Measures What It Is Purported To Measure
No ratings yet
Validity Refers To How Well A Test Measures What It Is Purported To Measure
6 pages
Unit 8 EE 1
No ratings yet
Unit 8 EE 1
19 pages
2001 - Importance Des Phases Arrêtées Dans Le Football (English)
No ratings yet
2001 - Importance Des Phases Arrêtées Dans Le Football (English)
8 pages
The International Journal of Business & Management
No ratings yet
The International Journal of Business & Management
14 pages
Impact of Branding On Impulse Buying Behavior: Evidence From FMCG's Sector Pakistan
No ratings yet
Impact of Branding On Impulse Buying Behavior: Evidence From FMCG's Sector Pakistan
10 pages
Lindganis,+A5 INNOVATION+ Special+Edition
No ratings yet
Lindganis,+A5 INNOVATION+ Special+Edition
14 pages
Item Analysis and Validation
No ratings yet
Item Analysis and Validation
20 pages
Assessment P3 Notes Part 1
No ratings yet
Assessment P3 Notes Part 1
7 pages
1 s2.0 S019188699900152X Main
No ratings yet
1 s2.0 S019188699900152X Main
12 pages
The Impact of Social Media As A Scaffolding in English Language Acquisition: A Case Study in Non-Commissioned Officers in The Sri Lanka Army
No ratings yet
The Impact of Social Media As A Scaffolding in English Language Acquisition: A Case Study in Non-Commissioned Officers in The Sri Lanka Army
14 pages
Educ 6 M2-Midterm
No ratings yet
Educ 6 M2-Midterm
14 pages
Reliability - On The Reproducibility of Assessment Data
No ratings yet
Reliability - On The Reproducibility of Assessment Data
8 pages
Qualities or Characteristics Desired in An Assessment Instrument
No ratings yet
Qualities or Characteristics Desired in An Assessment Instrument
7 pages
215 A00031 PDF
No ratings yet
215 A00031 PDF
4 pages
MR Katee
No ratings yet
MR Katee
6 pages
Validity and Reliability: Purpose of Tests
No ratings yet
Validity and Reliability: Purpose of Tests
19 pages
Standardized and Non-Standardized Test
No ratings yet
Standardized and Non-Standardized Test
14 pages
Age Estimation Epiphiseal Fusion
No ratings yet
Age Estimation Epiphiseal Fusion
11 pages
Galeoto Barthel Index 2015
No ratings yet
Galeoto Barthel Index 2015
7 pages
Unit Iii - Designing and Developing Assessments
No ratings yet
Unit Iii - Designing and Developing Assessments
5 pages
Document 17 Standardized and Non Standarized Test
No ratings yet
Document 17 Standardized and Non Standarized Test
3 pages
Assessment For Learning (231837)
No ratings yet
Assessment For Learning (231837)
6 pages
Athlete Brand Image
No ratings yet
Athlete Brand Image
10 pages
El 114 Prelim Module 2
No ratings yet
El 114 Prelim Module 2
9 pages
Educational Measurement and Evaluation
No ratings yet
Educational Measurement and Evaluation
4 pages
MR Bakari
No ratings yet
MR Bakari
4 pages
ED 106 Midterm
No ratings yet
ED 106 Midterm
2 pages
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
From Everand
CISA EXAM-Testing Concept-Knowledge of Compliance & Substantive Testing Aspects
Hemang Doshi
3/5 (4)
How to Practice Before Exams: A Comprehensive Guide to Mastering Study Techniques, Time Management, and Stress Relief for Exam Success
From Everand
How to Practice Before Exams: A Comprehensive Guide to Mastering Study Techniques, Time Management, and Stress Relief for Exam Success
Ranjot Singh Chahal
No ratings yet

Lesson 5 Criteria To Consider When Constructing Good Test Items

Uploaded by

Lesson 5 Criteria To Consider When Constructing Good Test Items

Uploaded by

Criteria to Consider

Choose the letter of the correct answer.

2. Which is acceptable practice when evaluating the students?

3. Teacher Rhodalyn wants to test the reliability of her

4. Mrs. Aluyog developed an achievement test in TLE for grade 7

5. Mrs. Garcia wants to establish the reliability of her test.

• It is the degree to which the test measures what is

• Appropriateness of the test

• Face Validity – examining the physical appearance of

• Purposes of Criterion-related Validity

• Types of Construct Validity

• It refers to the consistency of scores obtained by the

• Length of the test

METHOD Type of Reliability Procedure Statistical Measure

METHOD Type of Reliability Procedure Statistical Measure

• Explain your grading system to the students early in the

• Base grades on the students’ relative standing

You might also like