0% found this document useful (0 votes)

38 views4 pages

1.0 Brief Overview of Educational Assessment

The document discusses educational assessment and psychometrics. It defines constructs that are assessed, such as achievement, and different types of assessments including norm-referenced tests which compare individuals to groups and criterion-referenced tests which compare performance to standards. The document also discusses the importance of validity and reliability in assessments and different response formats such as structured and open-response items.

Uploaded by

lookingforfunonly

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views4 pages

1.0 Brief Overview of Educational Assessment

Uploaded by

lookingforfunonly

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

1.

0 Brief Overview of Educational Assessment

Psychologists and educators use the term, “construct,” to denote hypothetical

abstractions of mental processes that are related to behavior or experience (Murphy &

Davidshofer, p. 156). For example, “extroversion” is an abstract personality dimension which

may be assessed through an individual’s behavioral responses to a personality inventory.

“Achievement, defined as the extent to which students can demonstrate mastery of a scholastic

curriculum, is the most frequently assessed construct in the classroom” (Chatterji, p. 27).

Any test is an assessment instrument which records behavior obtained under

standardized conditions with established rules for scoring (Murphy & Davidshofer, p. 3,

Standards, p. 3). (Please note that the terms “test,” “exam,” and “assessment” will be used

interchangeably.) Tests vary in the precision and detail of their scoring, from the exact scoring

of multiple-choice tests to the more subjective judgment entailed by short answer or essay

tests. Tests may be used to assess maximal performance, such as aptitude or achievement

tests (examinees are asked to “do their best”), or may be used to assess typical performance,

such as an attitude survey or personality inventory (respondents are asked to report their

typical responses) (Crocker & Algina, p. 4).

Assessment allows us to identify individual differences among people.

A norm-referenced test compares an individual’s scores on an assessment instrument to the

scores of a norm group. Norm groups vary, depending on the purpose of the assessment. For

example, the scores of a child on an intelligence test may be compared to a group of children of

the same age, which would indicate the child’s standing compared to their age group. Well-

known norm-referenced aptitude tests are the Scholastic Assessment Test and the Graduate
Record Examinations. Grading “on a curve” is also a norm-referenced procedure in which the

class itself serves as the norm. So, for example, the top 20% may receive an A, the second 20%

a B, and so on (Chatterji, p. 85).

In contrast to norm-referenced tests, criterion-referenced tests compare an individual’s

score on an assessment instrument to a specific standard, usually related to the degree of

content mastered. “The focus is on what test takers can do and what they know, not on how

they compare to others” (Anastasi, p. 102). Many educational and licensing tests are criterion-

referenced tests which are used to establish knowledge or competency. For example, the

typical academic grade scale (90% to 100% = A, 80% to 89% = B, 70% to 79% = C, 60% to 69% =

D, <60% = F) establishes standards for achievement with respect to course content.

Assessments assist teachers in making two kinds of instructional decisions: Formative

decisions are used to shape the instructional design or delivery process, e.g., several of my

students need additional training on using the calculator. Summative decisions describe

students’ mastery of learning objectives, e.g., 70% of my students received an A or a B on the

final exam (Chatterji, p. 28).

Educational decisions may also be described as “low-stake” or “high-stake.” High-stake

decisions determine “who will and who will not gain access to employment, education, and

licensure or certification (jointly referred to as credentialing) opportunities” (Sackett, Schmitt,

Ellingson, & Kabin, p. 302). Although many classroom assessments are used for low-stake

decision making, the stakes tend to be higher when making summative decisions, in which case,

the assessments “should be of defensible quality” (Chatterji, p. 29).

2
Psychometrics is “the science of the assessment of individual differences,” often

referring to the quantitative aspects of psychological measurement (Whitney & Shultz, p. 425).

Two long-standing hallmarks of a test’s quality are its validity and reliability. “The validity of a

test concerns what the test measures and how well it does so. It tells us what can be inferred

from test scores” (Anastasi, p. 139). “Reliability refers to the consistency of scores obtained by

the same persons when reexamined . . . .” (Anastasi, p. 109). Because an unreliable test cannot

be valid, a test’s reliability places limits on a test’s validity1.

Tests also differ in the manner in which individuals respond to items. Probably the most

common form of testing is with items that call for a written response (even if only indicating an

item choice), although assessments may be conducted by observing behavior, judging a

product, conducting an interview, or reviewing a portfolio. Among written assessments, items

may call for a structured-response in which there is only one correct answer (e.g., multiple

choice, true false) or open-response, in which the length and content of the response varies,

e.g., short answer or essay tests (Chatterji, p. 86 – 89).

In general, assessments consisting of structured-response items allow a large amount of

the achievement domain to be covered in a relatively short period of time (improving reliability

and validity), may be administered to large groups, can be quickly graded, and consist of

objectively correct answers (improving reliability). However, if structured response items are

1
Precise speakers and writers would note the following: Tests are not valid or invalid, the inferences that we
make from their use are valid or invalid. Tests are not reliable or unreliable. Reliability coefficients are specific to a
sample of the population, so the best we can say is that the use of the test is likely reliable, particularly across
similar populations. It should also be noted that reliability results for criterion-referenced tests are not typical.
This is because test scores on a criterion-referenced test result in less variability which will limit reliability
coefficients.

3
not well written, e.g., the distractors increase the likelihood of correct guessing, both the

validity and reliability may be diminished.

Although some argue that structured response items can be written to assess higher-

order cognitive skills, they also admit that training and practice in creating such items is

required (Anastasi, p. 417). Most agree that open-response items can be created which will

require higher-level cognitive functioning, imparting access to some areas of the achievement

domain not accessible by structured-response items (increasing validity). In general,

assessments consisting of open-response items allow less of the domain to be tested (reducing

validity), require human scorers, and take more time to grade.

Because Chatterji was a course book, there is no need to do anything further. What appears
below are non-course references.

References

American Educational Research Association, American Psychological Association, & National

Council on Measurement in Education (1999). Standards for Educational and Psychological
Testing. Washington, DC: AERA, APA, & NCME.

Anastasi, A. (1988). Psychological Testing (6th ed.). New York, NY: Macmillan.

Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. New York,
NY: Harcourt College Publishers.

Murphy, K.R., & Davidshofer, C.O. (1998). Psychological Testing: Principles and Applications (4th
ed.). Upper Saddle River, NJ: Prentice Hall.

Sackett, P.R., Schmitt, N., Ellingson, J.E., & Kabin, M.B. (2001). High stakes testing in
employment, credentialing, and higher education. American Psychologist, 56, 302-318.

Shultz, K.S., & Whitney, D.J. (2005). Measurement Theory in Action: Case Studies and Exercises.
Thousand Oaks, CA: Sage Publications, Inc.

Chapter III Part 1 Ppt
No ratings yet
Chapter III Part 1 Ppt
34 pages
Assessment_Lesson 5 Construction of Written Tests
No ratings yet
Assessment_Lesson 5 Construction of Written Tests
5 pages
Topic-12B-Test-Development-by-cohen 2
No ratings yet
Topic-12B-Test-Development-by-cohen 2
66 pages
Week V & VI
No ratings yet
Week V & VI
77 pages
There Are Two Categories of Test That Perform The Different Functions
No ratings yet
There Are Two Categories of Test That Perform The Different Functions
5 pages
Scientific Method Presentation Colorful Illustrative Style
No ratings yet
Scientific Method Presentation Colorful Illustrative Style
155 pages
Assessment of Learning
100% (1)
Assessment of Learning
36 pages
Sample Ims
No ratings yet
Sample Ims
18 pages
Aballejml 201011063136
No ratings yet
Aballejml 201011063136
67 pages
Electrolysis Theory Mcqs
No ratings yet
Electrolysis Theory Mcqs
74 pages
Educ8 Reviewer
No ratings yet
Educ8 Reviewer
13 pages
Construction of Written Tests
No ratings yet
Construction of Written Tests
18 pages
Assessment of Learning PPT 201012014906
No ratings yet
Assessment of Learning PPT 201012014906
54 pages
Formal and Informal Assessment
No ratings yet
Formal and Informal Assessment
4 pages
Objective Tests
No ratings yet
Objective Tests
17 pages
Testing 2
No ratings yet
Testing 2
15 pages
Assessment of Learning Notes
No ratings yet
Assessment of Learning Notes
96 pages
Measurement and Evaluation Notes
No ratings yet
Measurement and Evaluation Notes
24 pages
ED-203-TCE (1)
No ratings yet
ED-203-TCE (1)
10 pages
A Usability Evaluation of Google Calendar Using Heuristic and Cooperative Evaluation Techniques
100% (2)
A Usability Evaluation of Google Calendar Using Heuristic and Cooperative Evaluation Techniques
24 pages
Assessment-Learning-1
No ratings yet
Assessment-Learning-1
34 pages
Objective Subjective
No ratings yet
Objective Subjective
11 pages
Session II (PGDT)
No ratings yet
Session II (PGDT)
44 pages
Unit 8 EE 1
No ratings yet
Unit 8 EE 1
19 pages
Constructionoftests 211015110341
No ratings yet
Constructionoftests 211015110341
57 pages
Testing_and__Measurement_2024_(1)_041839
No ratings yet
Testing_and__Measurement_2024_(1)_041839
12 pages
Module-2-Lesson-3-TYPES-AND-DISTINCTIONS-OF-TESTS-3
No ratings yet
Module-2-Lesson-3-TYPES-AND-DISTINCTIONS-OF-TESTS-3
4 pages
Lyksss Portfolio Kemerut 20240604 154541 0000
No ratings yet
Lyksss Portfolio Kemerut 20240604 154541 0000
20 pages
Portfolio Assessment-2 Final-Requirement
No ratings yet
Portfolio Assessment-2 Final-Requirement
45 pages
Damasco, John Rey C
No ratings yet
Damasco, John Rey C
11 pages
in Foundations of Special and Inclusive Educ
No ratings yet
in Foundations of Special and Inclusive Educ
26 pages
Assessment in Learning: Prepared By: Sittie Nermin A. H.Noor
100% (1)
Assessment in Learning: Prepared By: Sittie Nermin A. H.Noor
47 pages
lecture 5 and 6
No ratings yet
lecture 5 and 6
19 pages
REPORT
No ratings yet
REPORT
3 pages
Test Construction Using Revised Bloom
No ratings yet
Test Construction Using Revised Bloom
9 pages
Curriculum and TTLM Development
No ratings yet
Curriculum and TTLM Development
121 pages
Chapter 3 Designing and Developing Assessments
100% (2)
Chapter 3 Designing and Developing Assessments
20 pages
Evaluation, Measurement and Assessment Cluster 14
No ratings yet
Evaluation, Measurement and Assessment Cluster 14
25 pages
Chapter 1 All Lessons
No ratings yet
Chapter 1 All Lessons
9 pages
Basic Concepts in Assessment Notes 1
100% (3)
Basic Concepts in Assessment Notes 1
6 pages
Lesson 1.3
No ratings yet
Lesson 1.3
4 pages
Measurement, Testing, Assessment and Evaluation
No ratings yet
Measurement, Testing, Assessment and Evaluation
27 pages
Lesson 2 - Common Terminologies
100% (2)
Lesson 2 - Common Terminologies
48 pages
What Is Measurement
No ratings yet
What Is Measurement
4 pages
Test, Measurement & Evaluation
No ratings yet
Test, Measurement & Evaluation
38 pages
Assessment of Learning
No ratings yet
Assessment of Learning
34 pages
Standardized Test
No ratings yet
Standardized Test
5 pages
Enjoy1 130512161848 Phpapp02
No ratings yet
Enjoy1 130512161848 Phpapp02
42 pages
Prof Ed N5 Assessment of Learning
No ratings yet
Prof Ed N5 Assessment of Learning
32 pages
Lecture
No ratings yet
Lecture
14 pages
Adv Security Technical Reference
No ratings yet
Adv Security Technical Reference
10 pages
Relevance of Assessment
No ratings yet
Relevance of Assessment
12 pages
Handout 2 Test and Their Uses in The Educational Assessment
No ratings yet
Handout 2 Test and Their Uses in The Educational Assessment
5 pages
profed 6 reviewer
No ratings yet
profed 6 reviewer
5 pages
Psychological Standardized Test
No ratings yet
Psychological Standardized Test
22 pages
This Paper To Fulfill The Last Assignment With Subject: Assessment and Teaching
No ratings yet
This Paper To Fulfill The Last Assignment With Subject: Assessment and Teaching
5 pages
ASSESSMENT
No ratings yet
ASSESSMENT
6 pages
Assessment and Evaluation Learning 2
No ratings yet
Assessment and Evaluation Learning 2
14 pages
Gose Educ 105
No ratings yet
Gose Educ 105
19 pages
ASSESSMENT OF LEARNING New MODULES
100% (2)
ASSESSMENT OF LEARNING New MODULES
26 pages
Standardized & Non Standardized Tests
100% (1)
Standardized & Non Standardized Tests
45 pages
BUS421 Strategic Management Syllabus Term 2 - 2021
No ratings yet
BUS421 Strategic Management Syllabus Term 2 - 2021
9 pages
Educational and Psychological Measurement and Evaluation
No ratings yet
Educational and Psychological Measurement and Evaluation
41 pages
Concept Map
No ratings yet
Concept Map
7 pages
Prof Ed 8 LP4 2
No ratings yet
Prof Ed 8 LP4 2
13 pages
Judging of Dairy Products
No ratings yet
Judging of Dairy Products
153 pages
Irrigation Performance Evaluation
No ratings yet
Irrigation Performance Evaluation
27 pages
Getahun Gizaw Research PROPOSAL 123
No ratings yet
Getahun Gizaw Research PROPOSAL 123
46 pages
Lect 5 SM
No ratings yet
Lect 5 SM
25 pages
1.Case Study Approach_Design_Thinking (1)
No ratings yet
1.Case Study Approach_Design_Thinking (1)
29 pages
ASP Guideline - ME05
No ratings yet
ASP Guideline - ME05
19 pages
NRC- Corporate Governance
No ratings yet
NRC- Corporate Governance
15 pages
Diversity Inclusion Curriculum Resource
100% (1)
Diversity Inclusion Curriculum Resource
64 pages
Adv 2016 Release Notes
No ratings yet
Adv 2016 Release Notes
20 pages
Testing A Test
No ratings yet
Testing A Test
4 pages
Standard Documents: BGXB02008
No ratings yet
Standard Documents: BGXB02008
33 pages
Effective Team Checklist
0% (1)
Effective Team Checklist
1 page
Route B (3278)
No ratings yet
Route B (3278)
4 pages
(Voss 2003) Formulating Interesting Research Questions
No ratings yet
(Voss 2003) Formulating Interesting Research Questions
4 pages
The Repair of Earthquake Damaged Buildings: Robert D HANSON and Craig D Comartin
No ratings yet
The Repair of Earthquake Damaged Buildings: Robert D HANSON and Craig D Comartin
8 pages
School
No ratings yet
School
103 pages
UP Tutorial FINAL
No ratings yet
UP Tutorial FINAL
4 pages
BUSN 6530 Syllabus Su19
No ratings yet
BUSN 6530 Syllabus Su19
4 pages
BUSN 6530 Syllabus Su19
No ratings yet
BUSN 6530 Syllabus Su19
4 pages
Kysis A Level
No ratings yet
Kysis A Level
13 pages
Evaluation of Transport Projects
No ratings yet
Evaluation of Transport Projects
22 pages
Ely - THC3106 OBE Professional Development and Applied Ethics R - 08 - 23 - 22
No ratings yet
Ely - THC3106 OBE Professional Development and Applied Ethics R - 08 - 23 - 22
13 pages
Efffective Teams
No ratings yet
Efffective Teams
1 page
Effective Team Checklist 2
No ratings yet
Effective Team Checklist 2
1 page
Prequalif Doc Contractor Construction RECHQ
100% (1)
Prequalif Doc Contractor Construction RECHQ
43 pages
BS en 752-5 Drains & Sewer
No ratings yet
BS en 752-5 Drains & Sewer
31 pages
Group Dynamics - GW
No ratings yet
Group Dynamics - GW
18 pages
City Ordinance Tanod
No ratings yet
City Ordinance Tanod
5 pages
Conversion Table
No ratings yet
Conversion Table
3 pages
Internship Report On Amtranet Group-Naznin Islam Nipa-15123456030
No ratings yet
Internship Report On Amtranet Group-Naznin Islam Nipa-15123456030
29 pages
MPA 622 - Civil Service Systems in The Philippines
100% (2)
MPA 622 - Civil Service Systems in The Philippines
16 pages
ISO Audit Checklist
86% (7)
ISO Audit Checklist
21 pages
Research Methods: Simple, Short, And Straightforward Way Of Learning Methods Of Research
From Everand
Research Methods: Simple, Short, And Straightforward Way Of Learning Methods Of Research
Sherwyn Allibang
4/5 (13)

1.0 Brief Overview of Educational Assessment

Uploaded by

1.0 Brief Overview of Educational Assessment

Uploaded by

1.

0 Brief Overview of Educational Assessment

Davidshofer, p. 156). For example, “extroversion” is an abstract personality dimension which

may be assessed through an individual’s behavioral responses to a personality inventory.

Any test is an assessment instrument which records behavior obtained under

typical responses) (Crocker & Algina, p. 4).

Assessment allows us to identify individual differences among people.

A norm-referenced test compares an individual’s scores on an assessment instrument to the

a B, and so on (Chatterji, p. 85).

In contrast to norm-referenced tests, criterion-referenced tests compare an individual’s

score on an assessment instrument to a specific standard, usually related to the degree of

D, <60% = F) establishes standards for achievement with respect to course content.

Assessments assist teachers in making two kinds of instructional decisions: Formative

students’ mastery of learning objectives, e.g., 70% of my students received an A or a B on the

final exam (Chatterji, p. 28).

Educational decisions may also be described as “low-stake” or “high-stake.” High-stake

licensure or certification (jointly referred to as credentialing) opportunities” (Sackett, Schmitt,

the assessments “should be of defensible quality” (Chatterji, p. 29).

be valid, a test’s reliability places limits on a test’s validity1.

item choice), although assessments may be conducted by observing behavior, judging a

product, conducting an interview, or reviewing a portfolio. Among written assessments, items

e.g., short answer or essay tests (Chatterji, p. 86 – 89).

In general, assessments consisting of structured-response items allow a large amount of

validity and reliability may be diminished.

domain not accessible by structured-response items (increasing validity). In general,

validity), require human scorers, and take more time to grade.

American Educational Research Association, American Psychological Association, & National

You might also like