0% found this document useful (0 votes)

923 views5 pages

Validity and Reliability in Education

Validity and reliability are two important concepts for evaluating data and assessments. Validity refers to how well a measurement tool measures what it intends to measure, while reliability is about consistency and producing comparable outcomes over time. Some key points made in the document are: - Schools are increasingly using data to inform decisions but must consider the validity of what the data measures and the goals being assessed. - Validity has to do with how well a measurement matches the purpose or construct it aims to assess, while reliability is about consistency of results. - Both validity and reliability are important but validity takes precedence, and assessments should be evaluated on both factors when possible.

Uploaded by

Jeraldine Repollo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

923 views5 pages

Validity and Reliability in Education

Uploaded by

Jeraldine Repollo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Validity and Reliability in

Education
Schools all over the country are beginning to develop a culture
of data, which is the integration of data into the day-to-day
operations of a school in order to achieve classroom, school, and
district-wide goals. One of the biggest difficulties that comes with
this integration is determining what data will provide an accurate
reflection of those goals.

Such considerations are particularly important when the goals of

the school aren’t put into terms that lend themselves to cut and
dry analysis; school goals often describe the improvement of
abstract concepts like “school climate.”

Schools interested in establishing a culture of data are advised to

come up with a plan before going off to collect it. They need to
first determine what their ultimate goal is and what
achievement of that goal looks like. An understanding of the
definition of success allows the school to ask focused questions to
help measure that success, which may be answered with the
data.

For example, if a school is interested in increasing literacy, one

focused question might ask: which groups of students are
consistently scoring lower on standardized English tests? If a
school is interested in promoting a strong climate of
inclusiveness, a focused question may be: do teachers treat
different types of students unequally?

These focused questions are analogous to research questions

asked in academic fields such as psychology, economics, and,
unsurprisingly, education. However, the question itself does not
always indicate which instrument (e.g. a standardized test,
student survey, etc.) is optimal.
If the wrong instrument is used, the results can quickly become
meaningless or uninterpretable, thereby rendering them
inadequate in determining a school’s standing in or progress
toward their goals.

Differences Between
Validity and Reliability
When creating a question to quantify a goal, or when deciding on
a data instrument to secure the results to that question, two
concepts are universally agreed upon by researchers to be of
pique importance.

These two concepts are called validity and reliability, and they
refer to the quality and accuracy of data instruments.

WHAT IS VALIDITY?
The validity of an instrument is the idea that the instrument
measures what it intends to measure.

Validity pertains to the connection between the purpose of the

research and which data the researcher chooses to quantify that
purpose.

For example, imagine a researcher who decides to measure the

intelligence of a sample of students. Some measures, like physical
strength, possess no natural connection to intelligence. Thus, a
test of physical strength, like how many push-ups a student could
do, would be an invalid test of intelligence.

WHAT IS RELIABILITY?
Reliability, on the other hand, is not at all concerned with intent,
instead asking whether the test used to collect data produces
accurate results. In this context, accuracy is defined by
consistency (whether the results could be replicated).

The property of ignorance of intent allows an instrument to be

simultaneously reliable and invalid.

Returning to the example above, if we measure the number of

pushups the same students can do every day for a week (which, it
should be noted, is not long enough to significantly increase
strength) and each person does approximately the same amount
of pushups on each day, the test is reliable. But, clearly, the
reliability of these results still does not render the number of
pushups per student a valid measure of intelligence.

Because reliability does not concern the actual relevance of the

data in answering a focused question, validity will generally
take precedence over reliability. Moreover, schools will often
assess two levels of validity:

1. the validity of the research question itself in quantifying the

larger, generally more abstract goal
2. the validity of the instrument chosen to answer the research
question

Conclusion
3. Validity and reliability are meaningful measurements that
should be taken into account when attempting to evaluate
the status of or progress toward any objective a district,
school, or classroom has.
4. If precise statistical measurements of these properties are
not able to be made, educators should attempt to evaluate
the validity and reliability of data through intuition, previous
research, and collaboration as much as possible.
5. An understanding of validity and reliability allows educators
to make decisions that improve the lives of their students
both academically and socially, as these concepts teach
educators how to quantify the abstract goals their school or
district has set.

Validity and reliability of assessment methods are considered the two most important
characteristics of a well-designed assessment procedure.
Validity refers to the degree to which a method assesses what it claims or intends to assess. The
different types of validity include:

Validity Definition
the assessment method matches the content
content
of the work

relates to whether the assessment method is

criterion explicit in terms of procedures correlating
with particular behaviours

relates to whether scores reflect the items

construct
being tested.5,13

Performance based assessments are typically viewed as providing more valid data than
traditional examinations because they focus more directly on the tasks or skills of practice.2
Reliability refers to the extent to which an assessment method or instrument measures
consistently the performance of the student. Assessments are usually expected to produce
comparable outcomes, with consistent standards over time and between different learners and
examiners. However, the following factors impede both the validity and reliability of assessment
practices in workplace settings:

 inconsistent nature of people

 reliance on assessors to make judgements without bias
 changing contexts/conditions
 evidence of achievement arising spontaneously or incidentally.2,13

Explicit performance criteria enhance both the validity and reliability of the assessment process.
Clear, usable assessment criteria contribute to the openness and accountability of the whole
process. The context, tasks and behaviours desired are specified so that assessment can be
repeated and used for different individuals. Explicit criteria also counter criticisms of
subjectivity.13

As mentioned in Key Concepts, reliability and validity are closely related. To better
understand this relationship, let's step out of the world of testing and onto a bathroom
scale.

If the scale is reliable it tells you the same weight every time you step
on it as long as your weight has not actually changed. However, if the
scale is not working properly, this number may not be your actual
weight. If that is the case, this is an example of a scale that is reliable,
or consistent, but not valid. For the scale to be valid and reliable, not
only does it need to tell you the same weight every time you step on the
scale, but it also has to measure your actual weight.

Switching back to testing, the situation is essentially the same. A test can
be reliable, meaning that the test-takers will get the same score no matter
when or where they take it, within reason of course. But that doesn't mean
that it is valid or measuring what it is supposed to measure. A test can be
reliable without being valid. However, a test cannot be valid unless it is
reliable.

TUT101
No ratings yet
TUT101
33 pages
Principles of Fixation of Wages and Salary
88% (17)
Principles of Fixation of Wages and Salary
4 pages
New Holland E26C Mini Excavator Operator's Manual
100% (1)
New Holland E26C Mini Excavator Operator's Manual
282 pages
Understanding Problem-Based Learning (PBL) in Teaching Science
No ratings yet
Understanding Problem-Based Learning (PBL) in Teaching Science
20 pages
D' Mallows Income Statement For The Year Ended 2018-2022 Schedule 2018 2019
No ratings yet
D' Mallows Income Statement For The Year Ended 2018-2022 Schedule 2018 2019
23 pages
Contemporary Philosophies and Curriculum Development: Unit-8
No ratings yet
Contemporary Philosophies and Curriculum Development: Unit-8
22 pages
Those Magical Manatees by Jan Lee Wicker
0% (2)
Those Magical Manatees by Jan Lee Wicker
11 pages
Raspberry Pi
No ratings yet
Raspberry Pi
20 pages
BAC 582 Chap 1-7
No ratings yet
BAC 582 Chap 1-7
104 pages
So You Want To Be A Criminal Intelligence Specialist
No ratings yet
So You Want To Be A Criminal Intelligence Specialist
2 pages
Supplementary KYC
No ratings yet
Supplementary KYC
1 page
Start Practice Exam Test Questions Part 1 of The Series
No ratings yet
Start Practice Exam Test Questions Part 1 of The Series
20 pages
Law of Germany
No ratings yet
Law of Germany
6 pages
Candle Strategy
No ratings yet
Candle Strategy
6 pages
BS Procedure
No ratings yet
BS Procedure
3 pages
Curriculum Studies Short Notes
No ratings yet
Curriculum Studies Short Notes
7 pages
9T83B3874 75kVA GE
100% (1)
9T83B3874 75kVA GE
1 page
HPLC Analysis of Organic Acids and Sugars in Tomato Juice
No ratings yet
HPLC Analysis of Organic Acids and Sugars in Tomato Juice
3 pages
Imm5756 2-113jarnc
100% (1)
Imm5756 2-113jarnc
3 pages
Flumes For Accurate Flow Measurement
No ratings yet
Flumes For Accurate Flow Measurement
10 pages
Texture Mapping Tutorial
No ratings yet
Texture Mapping Tutorial
25 pages
Four Rules For Conflict Resolution in A Family Business - Family Business Forum - Economic Times
No ratings yet
Four Rules For Conflict Resolution in A Family Business - Family Business Forum - Economic Times
5 pages
Me465 Plate Angle Control Project
No ratings yet
Me465 Plate Angle Control Project
16 pages
Module in 213: Facilitating Learning
No ratings yet
Module in 213: Facilitating Learning
249 pages
Conceptual Framework of Model
No ratings yet
Conceptual Framework of Model
29 pages
Service Center Repairs We Buy Used Equipment: Instra
No ratings yet
Service Center Repairs We Buy Used Equipment: Instra
5 pages
Pragmatism in Education
100% (2)
Pragmatism in Education
7 pages
Micro Teaching ON Question Bank
100% (1)
Micro Teaching ON Question Bank
40 pages
Mineral Deposit Value
No ratings yet
Mineral Deposit Value
3 pages
Inglese Eta Vittoriana Verifica
No ratings yet
Inglese Eta Vittoriana Verifica
2 pages
Click Here To Download The Asnt Book List
No ratings yet
Click Here To Download The Asnt Book List
1 page
Assessment and Evalaution
No ratings yet
Assessment and Evalaution
55 pages
Outcome Based Education (Obe)
0% (1)
Outcome Based Education (Obe)
35 pages
Panel Discussion
No ratings yet
Panel Discussion
26 pages
What Is Curriculam
0% (1)
What Is Curriculam
3 pages
Moss (1994 Validity Reliability)
No ratings yet
Moss (1994 Validity Reliability)
9 pages
Scale of Measurement
No ratings yet
Scale of Measurement
10 pages
Test Validity and Reability
No ratings yet
Test Validity and Reability
11 pages
Principle of High-Quality Assessment
No ratings yet
Principle of High-Quality Assessment
8 pages
Validity
No ratings yet
Validity
4 pages
Curriculum Planning
100% (1)
Curriculum Planning
2 pages
What Is The Criterion and Norm Reference Test
100% (1)
What Is The Criterion and Norm Reference Test
6 pages
Meaning and Nature of CCE
No ratings yet
Meaning and Nature of CCE
2 pages
Topic: How To Assess? Ă Essay Tests
No ratings yet
Topic: How To Assess? Ă Essay Tests
32 pages
Validity Refers To How Well A Test Measures What It Is Purported To Measure
No ratings yet
Validity Refers To How Well A Test Measures What It Is Purported To Measure
6 pages
Criteria On Test Construction
No ratings yet
Criteria On Test Construction
25 pages
Educational Assessment Handout
No ratings yet
Educational Assessment Handout
6 pages
Validity and Reliability
No ratings yet
Validity and Reliability
19 pages
838 2
No ratings yet
838 2
11 pages
Core Curriculum and Integrated Curriculum
No ratings yet
Core Curriculum and Integrated Curriculum
11 pages
Interview Questions
No ratings yet
Interview Questions
3 pages
Reporting Face Validity and Content Validity
No ratings yet
Reporting Face Validity and Content Validity
7 pages
Taxonomy of Objectives
No ratings yet
Taxonomy of Objectives
4 pages
Assignment No 3 B.ed
No ratings yet
Assignment No 3 B.ed
37 pages
Syllabus Assesment of Learning
No ratings yet
Syllabus Assesment of Learning
10 pages
Stages of Curriculum Development
No ratings yet
Stages of Curriculum Development
19 pages
Mariam Juma - Flip Book
No ratings yet
Mariam Juma - Flip Book
8 pages
What Objective Tests
No ratings yet
What Objective Tests
3 pages
Essay Type Test
100% (1)
Essay Type Test
4 pages
Curriculum Changes
No ratings yet
Curriculum Changes
11 pages
LECTURE NOTES 7B - Importance of Validity and Reliability in Classroom Assessments
No ratings yet
LECTURE NOTES 7B - Importance of Validity and Reliability in Classroom Assessments
13 pages
External and Internal Evaluation
No ratings yet
External and Internal Evaluation
1 page
Microteaching
No ratings yet
Microteaching
12 pages
Development of English Language Communication Skills in B.ed Student
No ratings yet
Development of English Language Communication Skills in B.ed Student
11 pages
Competency Based Learning
No ratings yet
Competency Based Learning
7 pages
The Concept of Teaching - Sir Omar
100% (3)
The Concept of Teaching - Sir Omar
32 pages
Edu 726 Measurement and Evaluation
No ratings yet
Edu 726 Measurement and Evaluation
21 pages
7-13-0003 Rev7 - Hot Insulation Supports For Storage Tanks
No ratings yet
7-13-0003 Rev7 - Hot Insulation Supports For Storage Tanks
3 pages
Instructional Designs
No ratings yet
Instructional Designs
4 pages
Current Issues in Measurement and Evaluation
No ratings yet
Current Issues in Measurement and Evaluation
11 pages
TMMi-P - Syll2.1 Exam Questions
No ratings yet
TMMi-P - Syll2.1 Exam Questions
3 pages
Teaching and Learning
No ratings yet
Teaching and Learning
29 pages
Advantages and Disadvantages or Various Types of Questions (Also See The "Types of Test Questions" Document)
100% (2)
Advantages and Disadvantages or Various Types of Questions (Also See The "Types of Test Questions" Document)
2 pages
Importance of Validity and Reliability in Classroom Assessments
No ratings yet
Importance of Validity and Reliability in Classroom Assessments
13 pages
Advantages of Objective Test Items
No ratings yet
Advantages of Objective Test Items
6 pages
Activity Aids
100% (1)
Activity Aids
7 pages
Educational Research Validity & Types of Validity: Ayaz Muhammad Khan
No ratings yet
Educational Research Validity & Types of Validity: Ayaz Muhammad Khan
17 pages
Formative and Summative Assessment
100% (1)
Formative and Summative Assessment
4 pages
Factors Affecting Learning
No ratings yet
Factors Affecting Learning
12 pages
Factors That Influence Child Development
No ratings yet
Factors That Influence Child Development
3 pages
Types of Validity
100% (2)
Types of Validity
4 pages
Semantic Differential
No ratings yet
Semantic Differential
2 pages
Instructional Strategies
No ratings yet
Instructional Strategies
6 pages
Which Groups of Students Are Consistently Scoring Lower On Standardized English Tests?
No ratings yet
Which Groups of Students Are Consistently Scoring Lower On Standardized English Tests?
17 pages
Week 01 Assignment Solution
No ratings yet
Week 01 Assignment Solution
5 pages
8602 Assignment
No ratings yet
8602 Assignment
30 pages
Test, Measurement, Evaluation, and Assessment
No ratings yet
Test, Measurement, Evaluation, and Assessment
8 pages
4 Substations
No ratings yet
4 Substations
14 pages
Test Administration
100% (1)
Test Administration
1 page
Teaching Students With Severe Disabilities 6th Edition by David Westling Erik W Carter
No ratings yet
Teaching Students With Severe Disabilities 6th Edition by David Westling Erik W Carter
318 pages

Validity and Reliability in Education

Uploaded by

Validity and Reliability in Education

Uploaded by

Validity and Reliability in

Such considerations are particularly important when the goals of

Schools interested in establishing a culture of data are advised to

For example, if a school is interested in increasing literacy, one

These focused questions are analogous to research questions

Validity pertains to the connection between the purpose of the

For example, imagine a researcher who decides to measure the

The property of ignorance of intent allows an instrument to be

Returning to the example above, if we measure the number of

Because reliability does not concern the actual relevance of the

1. the validity of the research question itself in quantifying the

relates to whether the assessment method is

relates to whether scores reflect the items

 inconsistent nature of people

You might also like