0% found this document useful (0 votes)

24 views24 pages

Measurement and Evaluation Notes

Uploaded by

Wolfus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views24 pages

Measurement and Evaluation Notes

Uploaded by

Wolfus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Measurement and evaluation

Measurement: process of obtaining numerical description of the

degree of individual possesses.
Assignment of numbers: tests etc.

Test: tool, procedure, examination, assessment, or a measure of an

outcome.
Tests: designed to measure any quality, ability, skill, or knowledge.
We test:
Achievement: knowledge
Personality: characteristics
Aptitude: potential to succeed
Ability or intelligence: skill

We test to classify and to place and select and diagnose.

Instructional: assess progress

Curricular: decision out of school curricula
Selection: determine ability
Placement: given to group students
Personal: make wise decisions for themselves.
Evaluation:
Establishing objectives
Classifying objectives
Defining objectives
Selecting indicators
Comparing data with objectives
Make judgements on worth: how good…?

Analyzing information to determine the extent of students achievement

of objectives.

Assessment principles:
Address learning targets
Provide efficient feedback.
Use variety of assessment procedures
Ensure that assessments are valid.
Keep record of assessment
Address the results meaningfully.

Measurement: quantitative determination of how much an individual’s

performance has been.
Measurement describes a situation; evaluation judges its worth or
value.

Assessment:
Gathering information
Pinpoints strengths and weaknesses
Diagnostic and formative
Focus on individual students.

Evaluation:
Setting a value on assessment information
Judgment
Ranks and sorts.
Summative
Focus on group.

Formative test: monitor the attainment of instructional objective.

Summative test: measures extent to which student has attained desired
outcome.
Standardized test: valid, reliable, and objective.
Norm-referenced test: based on standard level of accomplishment by
whole group taking the test.
Criterion-reference test: measuring device with a predetermined level
of success for test takers.

Levels of measurement:
Variable: take on more than one value. Can be measured diff ways. The
way its measured determines the level of measurement being used.
Measurement: assignment of labels to a variable or an outcome.
Represents how much info is being provided by the outcome measure.

Nominal: difference in quality. Discrete in nature// example: hair color,

nationality, names.
Ordinal: can be ordered or ranked// example: level of education.
Interval: assign value to outcome based on continuum of equal
intervals// example: difference between 100 degrees and 90 degrees is
the same as 60 degrees and 70 degrees.
Ratio: true zero point// example: income, height, weight,
unemployment rate.

Data: pieces of info that u collect and examine ur topic.

Variable: element that is liable to change.
Statistics: describing and analyzing quantitative data// example: mean.

Measures of central tendency: indices that represent average score

among a group of scores.
Mean, median, mode.
Mean: X= ∑X/n

Frequencies: refers to the number of times something occurs// how

many times will the coin land tails side up?

3 common measures of variability:

Range: difference between highest and lowest score.
Quartile deviation.
Upper quartile: top 25%
Lower quartile: lowest 25%
QD= (Q3-Q1)/2

Variance: the amount of spread amongst scores.

This Photo by Unknown Author

Standard deviation: square root of variance. Used in interval and ratio

data.
Distribution: When a distribution is not normally distributed, it is said
to be skewed.
Skewed: not symmetrical. Theres +ve and -ve.
Measures of relative position: Where score falls in distribution relative
to all other scores.
How well an individual has scored in comparison to others.
Measures: percentile ranks and standard scores.

z-score of mean= 0.
A score at 1 standard deviation above the mean has a z-score of 1.
Z= x-u/o x is raw score, and u is mean.

t-score: multiply z-score by 10 and add 50.

Types of reliability:
Consistency of scores
Consistency among raters
Consistency across time

Theory of reliability:
X= t+e
X is observed score//what u got in test.
T true score//the accurate reflection of what you really know.
E measurement error// day to day difference between the true and
error score.

Observed score= true score + error.

Error increase –Reliability decreases

Error decreases –Reliability increases

Sources of error:
Trait error: did not study.
Method error: lousy instruction, hot room.
Administration errors: inaccurate timing.
Scoring errors: subjective scoring, clerical errors.

Reliability is calculated using correlation coefficient (rxy)

Ranged between .00 and 1.0
Higher= more reliable.

Types of reliability:

Test retest: test to see if exam is reliable overtime.

Problems: practice effects, time between tests and nature of sample.

Parallel forms: examine similarity of 2 diff forms of same test.

Internal consistency: determine items are consistent to represent one

construct.

Interrater: to know how much 2 raters agree on their judgements of

some outcome.
Interrater reliability= #agreements/#possible agreements.

To increase reliability:
Increase standardization of tests
Increase number of items
Delete unclear items.
Moderate difficulty
Minimize effect of external events.

Validity: the tool does what it says it does.

Threats:

Construct underrepresentation: when test doesn’t measure important

aspect of specified construct.
Construct-Irrelevant Variance: when test measures characteristics,
content, or skills that are unrelated to the test construct.

Types of validity evidence:

Content
Criterion
Construct
Consequential

Content validity: the characteristic of a test that is made up of items

that fairly represent all the items that could be on the test.
Tests with well-established content validity: have a list or table that has
detailed the material elements of a construct.

Criterion validity: the characteristic of a test that produces scores

correlated with some other measure.
Criterion validity is important to establish for tests that predict the
future or estimate concurrent performance on some other test.
predictive criterion validity and concurrent criterion validity.
Predictive validity: assesses whether a test reflects a set of abilities in
the future.
Entrance exams.

Concurrent validity: assesses whether a test reflects a set of abilities at

the current moment.
Licensing.
Construct validity: characteristic of a test with scores that reflect the
construct (invisible trait) a test is intended to measure.
Cognitive abilities, intelligence, addiction.

Consequential validity: concerned with unintended social

consequences from the use of a test.
To help society and help people who are being tested.

Biased test: unfair towards a certain group.

If you can’t establish validity:

Redo questions
Undeveloped models.

Validity is more important than reliability.

Steps to develop classroom test:

Identification and statement of educational objectives is the first step.

Educational objectives: goals that you hope the student will learn.
Also referred to as instructional / learning objectives.

Bloom’s taxonomy: knowledge, comprehension, application, analysis,

synthesis, evaluation.
To show compatibility between class instruction and test content, use
table of specifications (test blueprint).

Norm-referenced assessment: compare student performance to other

students.
Criterion-referenced assessment: compare students’ performance to
an absolute standard or criterion.

Selected response: items require a student to select a response from

available alternatives.
Strengths:
Can include many which facilitates adequate sampling of the content.
Scored in efficient objective reliable manner. Good at measuring lower-
level objectives.
Weaknesses:
Difficult to write.
Not able to assess all Edu obj.
Subject to random guessing.

Constructed response: require student to construct a response.

Strengths:
Easier to write.
Assess higher order cognitive abilities.
Eliminate random guessing.

Weakness:
Can’t include many items in test, not able to sample content domain as
thoroughly.
More difficult to score in a reliable manner.
Sensitive to feigning.

Suggestions for assembling an assessment:

Adhere to your table of specifications.
Provide clear instructions.
State items clearly.
Include items that contribute to the reliability and validity of your
assessment results.
Multiple choice items:
Most popular.
Objective items
Preferred way of testing achievement-oriented outcomes.
Stem: question or incomplete statement.

Possible answers alternatives

Incorrect alternatives: distracters.

Benefits:
Easy to score.
Easy to analyse.
Flexible.
Easy to create items that match LO.
Written at any level of BT.

Distractors should be plausible.

No intentional clues should be given.

No inconsistent lengths.
No inconsistent categories.
• Best-answer multiple-choice items: There may be more than one
correct answer, but only one of them is the best.
• Rearrangement multiple-choice items: Here is where the test
taker arranges a set of items in sequential order.
• Interpretive multiple-choice items: Test taker reads through a
passage and then selects a response where the alternatives all are
based on the same passage.
• Substitution multiple-choice items: There are alternatives from
which to select. The test taker selects those responses from a set
of responses that he or she thinks answers the question correctly.

Strengths:
Versatile
Can be scored in a reliable manner.
Easy to refine using results of item analysis.
Efficient way of sampling content domain.
Weaknesses:
Not effective for measuring educational objectives.
Not easy to write.
Limit creativity.

Matching items:
Assess particular topic.
Easy to administer.
Easy to score.
Acceptable tool for assessment

Good when there are lots of possible answers without repletion.

Involve selection.
More than 5 alternatives for MC.
Or more than two alternatives for TF questions.

Premises: statement in column.

Options: responses.
Should be reasonable.
List responses in different order than premises.
Place premises in logical order.
Make sure they are on the same page.

Pros:
Easy to score.
Scored with a reliable manner.
Easy to administer to large numbers.
Responses are short and easy.
Allow comparison of ideas.
Cons:
Limited knowledge testing.
Scoring can be a problem.
Good memory is needed.

True false items:

Used to assess achievement when there is a clear distinction between
two alternates.
Binary choice items.

Use declarative sentences.

Clear choice and binary.
Focus on one specific topic.
Void statements of opinion.
No clues.

Pros:
Reliable and objective scoring.
Efficient.

Cons:
Vulnerable to guessing.
Subject to response sets.
Not easy to write.

The probability of TF tests is 50.

Limited to knowledge-based items.

Correcting for guessing formula:

CS=R-W.
Correct score= number correct-number incorrect.

Constructed response item: essays or short answers.

Known as supply items.
Supply does not select answers.
Used to assess lower level thinking skills.
Focus on a certain level of material.

Pros:
Flexible
Minimized guessing
Easy to write.
Cons:
No machine scoring.
Subjective.
Limited cognitive skills assessed.
Questions hard to create.

Essay items:
Higher level thinking.
Informative responses.
Open ended and close ended essay items.

Open ended:
Unrestricted.

Close ended:
Restricted.

Assess different levels of complexity.

Make sure the question is complete and clear.
Evaluate higher order outcomes.
Have all test takers answer the same questions.
Allow adequate time to answer.
Pros:
Shows how to relate ideas to each other.
Increases security.
Flexibility.
Easy to construct.

Cons:
Emphasize writing.
Tough to write.
Not easy to score.

Use model correct answer for comparison when scoring.

Grade responses without knowing the identity of the person.
Take your time.

Developing a rubric:
A systematic scoring guideline to evaluate students’ performance
(papers, speeches, problem solutions, portfolios, cases) using a detailed
description of performance levels.
Gives consistent score.
Makes us more aware of expectations.
Components:
Task description
Criteria
Level of attainment

Rubric:
Flexible tool to measure student’s learning related to a specific
objective of a task.
Reliability, consistent grading.

For teachers:
Provides students with detailed feedback.
Encourage critical thinking.
Helps refine teaching skills.

For students:
Help to monitor and critique own work.
Provide informative descriptions of expected performance.

A good rubric is:

Well defined.
Context specific.
Finite and exhaustive.
Ordered.
Related to common core theme.

Descriptive rubric:
Allow scoring of a task on several different aspects of the task.
Pros: provides judgment on each criterion.
Cons: time consuming to make.

Holistic rubric:
Single scale with all criteria included in the evaluation being considered
together.
Pros: saves time in scoring.
Cons: no specific feedback.

Use rubrics on:

Projects
Presentations
Portfolios
(performance based)
Sample work should be scored.
More than one evaluator should score papers.
If 2 disagree, the third decides.
Frequent disagreements means that rubric needs to be adjusted.

Using rubric with students:

Explain what the test with emphasise.
Inform student how the assessment will be recorded.
explain how the results will be used.
Make sure the rubric is understandable.
Works best with holistic rubrics.
Provide rubric in advance.

Differentiation instruction:
Proactive accepetance of and planning for student differences In
readiness, motivation and learning proviles.
Making adjustments throughut teaching and learning cycle.

Why do we need it?

Autism has risen.
One in five children experience behavioural or emotional difficulty.
Some live in poverty.
Gifted students.

To differentiate instruction, students should have access to high quality

curriculum and document assessment.

A classroom is a system of five interdependent elements.

1.Classroom environment
2.Curriculum
3.Assessment
4.Instruction
5.Classroom management

Kinds of assessment:
Formative; changing the course to improve outcomes.
Summative; measure and evaluate student outcomes.

When to assess for effective differentiation?

Pre-assessment
Formative assessment
Summative assessment.

• Choice is key to the process.

• The learning tasks always consider the students
‘strengths/weaknesses.
• Groupings of students will vary, some will work better
independently, and others will work in various group settings.
• Multiple intelligence is taken into consideration as are the
students’ learning and thinking styles. Lessons are authentic to
ensure that all students can make connections.
• Project and problem-based learning are also key in differentiated
instruction and assessment.
• Lessons and assessments are adapted to meet the needs of all
learners.
• Opportunities for children to think for themselves are evident.

Examples of Differentiated assessment:

Quizzes
Debates
Journals
Peer-evaluations.

Approaches:
Find ways to know students more.
Small group teaching into daily or weekly teaching routines.
Offer more ways to explore and express learning.
Teach in multiple ways.
Allow working alone or with peers.

One lesson plan a month is enough to assess differentiation.

Chapter 1 Review of Principles of High Quality Assessment
No ratings yet
Chapter 1 Review of Principles of High Quality Assessment
69 pages
EBS 234 Assessment in Basic Schools
No ratings yet
EBS 234 Assessment in Basic Schools
92 pages
Psy 311
No ratings yet
Psy 311
149 pages
Assessment of Learning 1 101412
No ratings yet
Assessment of Learning 1 101412
217 pages
Assessment of Learning PPT 201012014906
No ratings yet
Assessment of Learning PPT 201012014906
54 pages
Nature, Def & Explan
No ratings yet
Nature, Def & Explan
49 pages
Building Bridges
83% (12)
Building Bridges
42 pages
Al1 Final Reviewer
No ratings yet
Al1 Final Reviewer
170 pages
Assessment of Learning-1
No ratings yet
Assessment of Learning-1
24 pages
Module 5
No ratings yet
Module 5
45 pages
Standardized and Non Standardized Test
No ratings yet
Standardized and Non Standardized Test
23 pages
Principles of High Quality Assessment 2
No ratings yet
Principles of High Quality Assessment 2
46 pages
Moral Reasoning
No ratings yet
Moral Reasoning
22 pages
Unit 8 EE 1
No ratings yet
Unit 8 EE 1
19 pages
Measurement and Evaluation Answers
No ratings yet
Measurement and Evaluation Answers
29 pages
Pearn Counting MAV 2017
No ratings yet
Pearn Counting MAV 2017
43 pages
Test Construction and Validation
100% (2)
Test Construction and Validation
88 pages
CT 200 Module 5-2
No ratings yet
CT 200 Module 5-2
41 pages
Module 1
No ratings yet
Module 1
33 pages
3 Education Notes
No ratings yet
3 Education Notes
9 pages
Aballejml 201011063136
No ratings yet
Aballejml 201011063136
67 pages
Constructionoftests 211015110341
No ratings yet
Constructionoftests 211015110341
57 pages
Using Rubrics for Performance-Based Assessment: A Practical Guide to Evaluating Student Work
From Everand
Using Rubrics for Performance-Based Assessment: A Practical Guide to Evaluating Student Work
Todd Stanley
4.5/5 (2)
Educ 203 Cre Reviewer
No ratings yet
Educ 203 Cre Reviewer
10 pages
Lesson 3 - Technology-Enhanced Lesson Using The ASSURE
100% (1)
Lesson 3 - Technology-Enhanced Lesson Using The ASSURE
25 pages
Psych 207 STATS Notes
No ratings yet
Psych 207 STATS Notes
59 pages
Selecting Measuring Instruments CH05
No ratings yet
Selecting Measuring Instruments CH05
37 pages
Principles of High Quality Assessment and Reliability
No ratings yet
Principles of High Quality Assessment and Reliability
49 pages
What Is Test
No ratings yet
What Is Test
35 pages
Topic 9 Notes
No ratings yet
Topic 9 Notes
7 pages
Q1 Educational Measurement and Evaluation
100% (2)
Q1 Educational Measurement and Evaluation
41 pages
6406 Classroom Assessment Assignment 2
No ratings yet
6406 Classroom Assessment Assignment 2
8 pages
Measurement & Evaluation
No ratings yet
Measurement & Evaluation
26 pages
The First Tamil Book On Behavioral Economics
0% (1)
The First Tamil Book On Behavioral Economics
152 pages
Propaganda (Plainfolks-Testimonial)
No ratings yet
Propaganda (Plainfolks-Testimonial)
3 pages
Psych Testing Reviewer Midterm
No ratings yet
Psych Testing Reviewer Midterm
9 pages
Measurements Evaluation PDF
No ratings yet
Measurements Evaluation PDF
14 pages
Lesson 2
No ratings yet
Lesson 2
25 pages
Tests Detailesd 75 Slides Slide Share
No ratings yet
Tests Detailesd 75 Slides Slide Share
75 pages
Summary For Measurement
No ratings yet
Summary For Measurement
8 pages
RESEARCH Methedology
No ratings yet
RESEARCH Methedology
21 pages
Ed 203 Tce
No ratings yet
Ed 203 Tce
10 pages
Lesson 5 Criteria To Consider When Constructing Good Test Items
No ratings yet
Lesson 5 Criteria To Consider When Constructing Good Test Items
22 pages
Module 1 Assignment Roll No.D15304
100% (2)
Module 1 Assignment Roll No.D15304
12 pages
Assessment of Learning Basic Concept 201
No ratings yet
Assessment of Learning Basic Concept 201
139 pages
Assessment and Evaluation in Education - Teacher Note Forthe Midterm
No ratings yet
Assessment and Evaluation in Education - Teacher Note Forthe Midterm
8 pages
Kcast Let Review: Assessment of Learning 1 & 2
No ratings yet
Kcast Let Review: Assessment of Learning 1 & 2
34 pages
Unit 9 (Debate Pro Junior 1)
No ratings yet
Unit 9 (Debate Pro Junior 1)
8 pages
Assessment in Learning Handout MARCH 2023
No ratings yet
Assessment in Learning Handout MARCH 2023
24 pages
Educational Measurement and Evaluation
No ratings yet
Educational Measurement and Evaluation
41 pages
Unit 03 8602
No ratings yet
Unit 03 8602
36 pages
Practical Research1 DLL W9
No ratings yet
Practical Research1 DLL W9
7 pages
9 Assessment of Learning (AL)
No ratings yet
9 Assessment of Learning (AL)
6 pages
8602 Assignment No 2
No ratings yet
8602 Assignment No 2
6 pages
Assessment in Learning Handout
No ratings yet
Assessment in Learning Handout
11 pages
4 Similarities and Differences Between Coaching and Mentoring
100% (1)
4 Similarities and Differences Between Coaching and Mentoring
5 pages
Psy 410
No ratings yet
Psy 410
5 pages
Kesalahan Siswa Dalam Menerjemahkan Materi Bacaan Bahasa Arab Ke Dalam Bahasa Indonesia Muzia Ranselengo, Mukhtar I Miolo
No ratings yet
Kesalahan Siswa Dalam Menerjemahkan Materi Bacaan Bahasa Arab Ke Dalam Bahasa Indonesia Muzia Ranselengo, Mukhtar I Miolo
10 pages
Lesson Plan Template
No ratings yet
Lesson Plan Template
4 pages
Chapter Iii
No ratings yet
Chapter Iii
6 pages
Exercise 5 6 7
No ratings yet
Exercise 5 6 7
7 pages
Why The Future Need Us
No ratings yet
Why The Future Need Us
1 page
W5 - Technology Planning Project 6.13
No ratings yet
W5 - Technology Planning Project 6.13
22 pages
Assessment of Learning
No ratings yet
Assessment of Learning
5 pages
Assessment of Learning Hand Outs PDF
No ratings yet
Assessment of Learning Hand Outs PDF
18 pages
LET - REVIEW - Measurement - Assessment of Learning
No ratings yet
LET - REVIEW - Measurement - Assessment of Learning
20 pages
9 Bhatia Battery Performance Test
No ratings yet
9 Bhatia Battery Performance Test
16 pages
Assessment of Student Learning
No ratings yet
Assessment of Student Learning
5 pages
GED0106 Graded Formative Assessment 4
No ratings yet
GED0106 Graded Formative Assessment 4
2 pages
Leadership Is A Conversation: Presented by
No ratings yet
Leadership Is A Conversation: Presented by
15 pages
Self Esteem 1
No ratings yet
Self Esteem 1
4 pages
Measurement - Drill Sheets Gr. 6-8
From Everand
Measurement - Drill Sheets Gr. 6-8
Chris Forest
1/5 (1)
The Therapist's Use of Self
100% (2)
The Therapist's Use of Self
15 pages
Unit Test: Circle The Correct Answer To Fill in The Gap!
No ratings yet
Unit Test: Circle The Correct Answer To Fill in The Gap!
1 page
Peer Observation Form: Criteria Identified Strengths Areas For Future Focus Points Allotted 1 2 3 4 5 6 7 8 9
No ratings yet
Peer Observation Form: Criteria Identified Strengths Areas For Future Focus Points Allotted 1 2 3 4 5 6 7 8 9
1 page
Peer Observation Form
No ratings yet
Peer Observation Form
1 page
LET Review Mat For Assessment
No ratings yet
LET Review Mat For Assessment
7 pages
Assessment of Student Learning
No ratings yet
Assessment of Student Learning
3 pages
Anp - Item Analysis
No ratings yet
Anp - Item Analysis
20 pages
Teacher Talking Time (TTT) Self-Measuring Tool
No ratings yet
Teacher Talking Time (TTT) Self-Measuring Tool
4 pages
Sssssss SG GGGGG GGGGG
No ratings yet
Sssssss SG GGGGG GGGGG
4 pages
1.0 Brief Overview of Educational Assessment
No ratings yet
1.0 Brief Overview of Educational Assessment
4 pages
Introduction To Linguistics - Final Test Sem2 1920
No ratings yet
Introduction To Linguistics - Final Test Sem2 1920
4 pages
Resume: About Me Contact
No ratings yet
Resume: About Me Contact
3 pages
Test Study
No ratings yet
Test Study
5 pages
How The Following Issues Affect The Educational System in The Philippines?
No ratings yet
How The Following Issues Affect The Educational System in The Philippines?
3 pages
Cbest Results
No ratings yet
Cbest Results
3 pages
Lesson Plan: How To Use Google and Google Slides
No ratings yet
Lesson Plan: How To Use Google and Google Slides
2 pages
Marketing
No ratings yet
Marketing
9 pages
Amy Sow
No ratings yet
Amy Sow
1 page
Measurement - Task Sheets Gr. PK-2
From Everand
Measurement - Task Sheets Gr. PK-2
Chris Forest
5/5 (1)
Measurement - Task Sheets Gr. 3-5
From Everand
Measurement - Task Sheets Gr. 3-5
Chris Forest
No ratings yet
7 Mindful Eating Tips: 1. Shift Out of Autopilot Eating
No ratings yet
7 Mindful Eating Tips: 1. Shift Out of Autopilot Eating
1 page
Data Analysis & Probability - Task Sheets Gr. PK-2
From Everand
Data Analysis & Probability - Task Sheets Gr. PK-2
Tanya Cook
No ratings yet

Measurement and Evaluation Notes

Uploaded by

Measurement and Evaluation Notes

Uploaded by

Measurement and evaluation

Measurement: process of obtaining numerical description of the

Test: tool, procedure, examination, assessment, or a measure of an

We test to classify and to place and select and diagnose.

Instructional: assess progress

Analyzing information to determine the extent of students achievement

Measurement: quantitative determination of how much an individual’s

Formative test: monitor the attainment of instructional objective.

Nominal: difference in quality. Discrete in nature// example: hair color,

Data: pieces of info that u collect and examine ur topic.

Measures of central tendency: indices that represent average score

Frequencies: refers to the number of times something occurs// how

3 common measures of variability:

Variance: the amount of spread amongst scores.

This Photo by Unknown Author

Standard deviation: square root of variance. Used in interval and ratio

t-score: multiply z-score by 10 and add 50.

Observed score= true score + error.

Error increase –Reliability decreases

Reliability is calculated using correlation coefficient (rxy)

Test retest: test to see if exam is reliable overtime.

Parallel forms: examine similarity of 2 diff forms of same test.

Internal consistency: determine items are consistent to represent one

Interrater: to know how much 2 raters agree on their judgements of

Validity: the tool does what it says it does.

Construct underrepresentation: when test doesn’t measure important

Types of validity evidence:

Content validity: the characteristic of a test that is made up of items

Criterion validity: the characteristic of a test that produces scores

Concurrent validity: assesses whether a test reflects a set of abilities at

Consequential validity: concerned with unintended social

Biased test: unfair towards a certain group.

If you can’t establish validity:

Validity is more important than reliability.

Identification and statement of educational objectives is the first step.

Bloom’s taxonomy: knowledge, comprehension, application, analysis,

Norm-referenced assessment: compare student performance to other

Selected response: items require a student to select a response from

Constructed response: require student to construct a response.

Suggestions for assembling an assessment:

Possible answers alternatives

Distractors should be plausible.

No intentional clues should be given.

Good when there are lots of possible answers without repletion.

Premises: statement in column.

True false items:

Use declarative sentences.

The probability of TF tests is 50.

Correcting for guessing formula:

Constructed response item: essays or short answers.

Assess different levels of complexity.

Use model correct answer for comparison when scoring.

A good rubric is:

Use rubrics on:

Using rubric with students:

Why do we need it?

To differentiate instruction, students should have access to high quality

A classroom is a system of five interdependent elements.

When to assess for effective differentiation?

• Choice is key to the process.

Examples of Differentiated assessment:

One lesson plan a month is enough to assess differentiation.

You might also like