0% found this document useful (0 votes)

346 views9 pages

UNIT 05: Reliability: Module Overview

This document provides an overview of reliability in psychological testing. It discusses several types of reliability coefficients: 1. Test-retest reliability measures consistency over time by correlating scores from multiple administrations of the same test. 2. Alternate forms reliability assesses consistency between equivalent versions of a test. It helps minimize memory effects. 3. Split-half reliability correlates scores between two halves of a single test to estimate internal consistency. The Spearman-Brown formula adjusts this estimate. 4. Other internal consistency methods like Kuder-Richardson and Cronbach's alpha calculate inter-item correlations to assess how homogenously a test measures a single construct. Reliability ensures consistency in psychological measurement

Uploaded by

Jolly Abella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

346 views9 pages

UNIT 05: Reliability: Module Overview

Uploaded by

Jolly Abella

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

MODULE OVERVIEW

Reliability refers to the consistency of the test

scores obtained by the same persons when
UNIT 05: they are reexamined with the same test on
different occasions, or with different sets of
RELIABILITY equivalent items, or under other variables
examining conditions ( Anastasi & Urbina,
1997). This unit will explore the different kinds
of reliability co-efficients, including those for
measuring test – retest reliability, alternative
forms reliability, split half reliability, and
inter-score reliability.
DR. EVA MARIE P. GACASAN
DR. GWENDELINA A. VILLARANTE
1
DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY
LEARNING OUTCOMES OF THE
MODULE
• Explain the concept of reliability.
• Identify the different reliability estimates
• Describe the purpose of using and interpreting a coefficient of reliability
• Discuss about reliability and individual scores with respect to the types
of standard errors.

2
DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY
LECTURE CONTENT
UNIT 5: RELIABILITY
Reliability refers to consistency in measurement. It is also a synonym for dependability or consistency.
It is important for us, as users of tests and consumers of information about tests, to know how reliable tests and
other measurement procedures are.
A reliability coefficient is an index of reliability, a proportion that indicates the ratio between the true score
variance on a test and the total variance.

THE CONCEPT OF RELIABILITY

Because true differences are assumed to be stable, they are presumed to yield consistent scores on repeated
administrations of the same tests as well as on equivalent forms of tests.

SOURCES OF ERROR VARIANCE TEST CONSTRUCTION

• Item sampling or Content sampling refers to variation among items within a test as well as to variation
among items between tests.
• Differences are sure to be found in the way the items are worded and in the exact content sampled.
• The higher score would be due to the specific content sampled, the way the items were worded, and so on.

3
DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY
Test administration
• Examples of untoward influences during administration of a test include factors related to the test environment: the
room temperature, the level of lighting, and the amount of ventilation and noise, for instance.
• Other environment-related variables includes the instrument used to enter responses and even the writing surface
on which responses are entered.
• administration takes into consideration testtaker variables. Pressing emotional problems, physical discomfort, lack
of sleep, and the effect of drugs or medication can all be sources of error variance.
• Examiner-related variables are potential sources of error variance.

TEST SCORING AND INTERPRETATION

The advent of computer-scoring and a growing reliance on objective, computer-scorable items virtually have eliminated
error variance caused by scorer differences in many tests
• In some tests of personality, examinees are asked to supply open-ended responses to stimuli such as pictures,
words, sentences, and inkblots, and it is the examiner who must then quantify or qualitatively evaluate responses.
• Scorers and scoring systems are potential sources of error variance.
• Examiner/scorers occasionally still are confronted by situations where an examinee’s response is in a gray area.

4
DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY
RELIABILITY ESTIMATES
Test-retest Reliability estimates
• One way of estimating the reliability of a measuring instrument is by using the same instrument to measure the
same thing at two points in time.
• Test-retest reliability is an estimate of reliability obtained by correlating pairs of scores from the same people on
two different administrations of the same test. The test-retest measure is appropriate when evaluating the reliability
of a test that purports to measure something that is relatively stable over time, such as a personality trait.
• When the interval between testing is greater than six months, the estimate of test-retest reliability is often referred
to as the coefficient of stability.
• An estimate of test-retest reliability may be most appropriate in gauging the reliability of tests that employ outcome
measures such as reaction time or perceptual judgements.

5
DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY
Parallel-Forms and Alternate-Forms Reliability Estimates
• The degree of the relationships between various forms of a test can be evaluated by means if an alternate-forms or
parallel-forms coefficient of reliability, which is often termed the coefficient of equivalence.
• Parallel forms of a test exist when, for each form of the test, the means and the variance of observed test scores are
equal.
• Altername forms are simply different versions of a test that have been constructed so as to be parallel. Alternate forms
of a test are typically designed to be equivalent with respect to variables such a content and level of difficult
• Developing alternate forms of test can be time-consuming and expensive.
• It minimizes the effect of memory for the content of a previously administered form of the test.
Logically enough, it is referred to as an internal consistency estimate of reliability or as an estimate of inter-item consistency
estimates of reliability.

Split-half estimate
• An estimate of split-half reliability is obtained by correlating two pairs of scores obtained from equivalent halves of a
single test administered once.
• One acceptable way to split a test is to randomly assign items to one or the other half of the test. Another acceptable
way to split a test is to assign odd-numbered items to one half of the test and even-numbered items to the other half.

6
The Spearman-Brown Formula

DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY

• The Spearman-Brown formula allows a test developer or user to estimate internal consistency reliability from a correlation
of two halves of a test.
• Usually, but not always, reliability increases as test length increases. Ideally, the additional test items are equivalent with
respect to the content and the range of difficulty of the original items.
• A Spearman-Brown formula could also be used to determine the number of items needed to attain a desired level of
reliability.

Other Methods of Estimating Internal Consistency

• Developed by Kuder and Richardson (1937) and Cronbach(1951). Inter-item consistency refers to the degree of
correlation among all the items on a scale. A measure of inter-item consistency is calculated from a single administration
of a single form of a test.
• An index of inter-item consistency, in turn, is useful in assessing the homogeneity of the test.
• Homogeneity is the extent to which items in a scale are unifactorial. Homogeneity describes the degree to which is a test
measures different factors. A heterogeneous test is composed of items that measure more than one trait.
• Testtakers with the same score on a homogeneous test probably have similar abilities in the area tested.
The Kuder-Richardson formulas
G. Frederic Kurder and M. W. Richardson.
Kuder-Richardson formula test items are highly homogeneous, KR-20 and split-half reliability estimates will be similar.

7
Coefficient alpha

DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY

Developed by Cronbach (1951)
• Coefficient Alpha may be thought of as the mean of all possible split-half correlations, corrected by the
Spearman-Brown formula.
• Coefficient alpha is the preferred statistic for obtaining an estimate of internal consistency reliability.

❖ Essentially, this formula yields an estimate of the mean of all possible test-retest, split-half coefficients. Coefficient
alpha is widely used as a measure of reliability, in part because it requires only one administration of the test.
❖ Scorer reliability, judge reliability, observer reliability. Inter-scorer reliability is the degree of agreement or
consistency between two or more scorers (or judges or raters) with regard to a particular measure.

Homogeneity versus heterogeneity of test items

• Recall that a test is said to be homogeneous in items if it is functionally uniform throughout.
• By contrast, if the test is heterogeneous in items, an estimate of internal consistency might be low relative to a
more appropriate estimate of test-retest reliability.

Criterion-referenced test
• A Criterion-referenced test is designed to provide an indication of where a test taker stands with respect to some
variable or criterion, such as an educational or vocational objective.,
• Scores on criterion-referenced tests tends to be interpreted in pass-fail terms, and any scrutiny of performance on
individual items tends to be for diagnostic and remedial purposes.
8
DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY
REFERENCES AND MATERIALS
Main Text:
• Cohen, R. J. & Swerdlik, M.E. (2018). Psychological testing andassessment. New
York, NY: McGraw-Hill.

Supplementary Text:
• Anastasi, A. & Urbina, S. (2001). Psychological testing. Singapore: Pearson
Education Asia PTE. LTD.
For other books and materials e.g. Psychological Assessment Report Template that are
used in this course, they are found in our Google Classroom.

Filipino Personality Traits
50% (2)
Filipino Personality Traits
16 pages
3 Module 3 Statistics Refresher
No ratings yet
3 Module 3 Statistics Refresher
50 pages
KAREN HORNEY Psychoanalytic Social Theory
No ratings yet
KAREN HORNEY Psychoanalytic Social Theory
16 pages
Abnormal Psychology Case Study
67% (3)
Abnormal Psychology Case Study
6 pages
Practice Test 51-100 Thi Thu-Ss
No ratings yet
Practice Test 51-100 Thi Thu-Ss
174 pages
Chapter 2 - Research (Field Methods)
No ratings yet
Chapter 2 - Research (Field Methods)
12 pages
Structure and Bonding
No ratings yet
Structure and Bonding
38 pages
Reliability and Validity
No ratings yet
Reliability and Validity
47 pages
Psychology Block 1 Quiz
No ratings yet
Psychology Block 1 Quiz
4 pages
Chapter 06 Writing and Evaluating Test Items
No ratings yet
Chapter 06 Writing and Evaluating Test Items
21 pages
Chapter Iii: Flexibility Training Exercise: Physical Activities Towards Health and Fitness II
No ratings yet
Chapter Iii: Flexibility Training Exercise: Physical Activities Towards Health and Fitness II
5 pages
Psych Assessment Unit V
No ratings yet
Psych Assessment Unit V
2 pages
Data Mining Concepts Models and Techniques 1st Edition by Florin Gorunescu ISBN 3642197213 9783642197215 Download
100% (4)
Data Mining Concepts Models and Techniques 1st Edition by Florin Gorunescu ISBN 3642197213 9783642197215 Download
54 pages
PANUKAT NG UGALI AT PAGKATAO (PUP) Written Report
0% (1)
PANUKAT NG UGALI AT PAGKATAO (PUP) Written Report
3 pages
Political Self
No ratings yet
Political Self
12 pages
Ratio Pa1
No ratings yet
Ratio Pa1
52 pages
Spiritual Self
No ratings yet
Spiritual Self
16 pages
Understanding Gender Roles in The Workplace - A Qualitative Research Study
No ratings yet
Understanding Gender Roles in The Workplace - A Qualitative Research Study
10 pages
Field Methods in Psychology Prelims
100% (1)
Field Methods in Psychology Prelims
22 pages
The Stanford Prison Experiment
No ratings yet
The Stanford Prison Experiment
9 pages
The T Test For Independent Means
No ratings yet
The T Test For Independent Means
47 pages
Ronquillo 2015
No ratings yet
Ronquillo 2015
252 pages
Material Self
No ratings yet
Material Self
11 pages
Chapter 11-15 Quick Reviewer
No ratings yet
Chapter 11-15 Quick Reviewer
9 pages
Chapter One: The History of I/O Psychology
No ratings yet
Chapter One: The History of I/O Psychology
17 pages
Gordon Allport
100% (1)
Gordon Allport
20 pages
Counseling When It Matters How Guidance and Counseling in The Philippines Adjusted During The COVID 19 Pandemic 1
100% (1)
Counseling When It Matters How Guidance and Counseling in The Philippines Adjusted During The COVID 19 Pandemic 1
33 pages
Course Outline Intro To Counseling
No ratings yet
Course Outline Intro To Counseling
3 pages
Buss Evolutionary Theory of Personality
No ratings yet
Buss Evolutionary Theory of Personality
5 pages
Limbic System
No ratings yet
Limbic System
4 pages
2024 Syllabus On Current Issues and Trends in Education Fiscal MGT Advanced Psych Comparative
No ratings yet
2024 Syllabus On Current Issues and Trends in Education Fiscal MGT Advanced Psych Comparative
20 pages
Educ54 Technical Writing
No ratings yet
Educ54 Technical Writing
21 pages
PQ 1 Field Methods in Psychology 70%
No ratings yet
PQ 1 Field Methods in Psychology 70%
7 pages
Burned Final
No ratings yet
Burned Final
304 pages
Tos From PRC
No ratings yet
Tos From PRC
2 pages
BatStateU-FO-OJT-03 - Student Trainee's Performance Appraisal Report
No ratings yet
BatStateU-FO-OJT-03 - Student Trainee's Performance Appraisal Report
1 page
4 Alternatives To Experimentation Surveys and Interviews PDF
No ratings yet
4 Alternatives To Experimentation Surveys and Interviews PDF
46 pages
Balvatika - Kvs Samagam - Balvatika Shastika Nandakumar (Father)
No ratings yet
Balvatika - Kvs Samagam - Balvatika Shastika Nandakumar (Father)
4 pages
Evaluating Selection Techniques & Decisions
No ratings yet
Evaluating Selection Techniques & Decisions
3 pages
EDPH Mental Health
No ratings yet
EDPH Mental Health
34 pages
LKAU23 at Qur'an QA 2023
No ratings yet
LKAU23 at Qur'an QA 2023
8 pages
Human Error in Shipping
No ratings yet
Human Error in Shipping
6 pages
Configuring Cybertech Pro With Avaya CM and AES
No ratings yet
Configuring Cybertech Pro With Avaya CM and AES
48 pages
Ul Lafayette Thesis Guidelines
100% (3)
Ul Lafayette Thesis Guidelines
6 pages
Achievement Test
0% (1)
Achievement Test
13 pages
ExP Psych 07 The Basics of Experimentation
No ratings yet
ExP Psych 07 The Basics of Experimentation
32 pages
Reviewer For Assessment of Learning
No ratings yet
Reviewer For Assessment of Learning
9 pages
Assessment in Learning 1 Csa 2
No ratings yet
Assessment in Learning 1 Csa 2
2 pages
Sikfil Reviewer
No ratings yet
Sikfil Reviewer
8 pages
Classes Time Table II IV VI Even 2023 2024
No ratings yet
Classes Time Table II IV VI Even 2023 2024
13 pages
BEEA PT4 Exam-Questions
No ratings yet
BEEA PT4 Exam-Questions
28 pages
CHP 8 Controlling Extraneous Variables
100% (1)
CHP 8 Controlling Extraneous Variables
2 pages
Field Methods in Psychology Midterms
No ratings yet
Field Methods in Psychology Midterms
12 pages
Untitled
No ratings yet
Untitled
17 pages
#JX10 - Kevin Setiawan - Your Guide To Starting A Career in Business Intelligence - 2
No ratings yet
#JX10 - Kevin Setiawan - Your Guide To Starting A Career in Business Intelligence - 2
17 pages
Hypothesis Tests With Means of Samples
No ratings yet
Hypothesis Tests With Means of Samples
18 pages
Lagos State University of Science and Technology
No ratings yet
Lagos State University of Science and Technology
2 pages
LESSON 14 Application
No ratings yet
LESSON 14 Application
18 pages
CCP Practicum Manual Revised 051514
No ratings yet
CCP Practicum Manual Revised 051514
20 pages
Diagnostic Examination - Competency Appraisal 1
No ratings yet
Diagnostic Examination - Competency Appraisal 1
9 pages
EthicsPr Vulnerable
No ratings yet
EthicsPr Vulnerable
38 pages
Structured-Learning-Experience - 2
100% (1)
Structured-Learning-Experience - 2
2 pages
Field Methods Prelims
No ratings yet
Field Methods Prelims
5 pages
Io Chapter 6
No ratings yet
Io Chapter 6
6 pages
What Is An Indigenous Psychology
No ratings yet
What Is An Indigenous Psychology
20 pages
Psychology Assessment 1 Quiz MT
No ratings yet
Psychology Assessment 1 Quiz MT
8 pages
Medical Studies at The University of Santo Tomas (UST)
100% (1)
Medical Studies at The University of Santo Tomas (UST)
31 pages
Historical, Cultural, and Legal/Ethical Considerations
100% (1)
Historical, Cultural, and Legal/Ethical Considerations
25 pages
Io Psych
No ratings yet
Io Psych
9 pages
Acp Presentation
No ratings yet
Acp Presentation
10 pages
Student Government: Constitution and By-Laws
No ratings yet
Student Government: Constitution and By-Laws
13 pages
Code of Ethics For Portfolio
No ratings yet
Code of Ethics For Portfolio
4 pages
1 Lesson Plan in Mapeh 7
No ratings yet
1 Lesson Plan in Mapeh 7
7 pages
O - B T L (Obtl) P: Utcome Ased Eaching AND Earning LAN
No ratings yet
O - B T L (Obtl) P: Utcome Ased Eaching AND Earning LAN
9 pages
Pedoman Penulisan Karya Ilmiah UPI 2013
No ratings yet
Pedoman Penulisan Karya Ilmiah UPI 2013
6 pages
Advanced Counseling and Psychotherapy
No ratings yet
Advanced Counseling and Psychotherapy
7 pages
Designing A Rubric
No ratings yet
Designing A Rubric
28 pages
MODULE 2 - Descriptive Statistics
No ratings yet
MODULE 2 - Descriptive Statistics
8 pages
Test Development
No ratings yet
Test Development
17 pages
The Psychological Report
No ratings yet
The Psychological Report
9 pages
First Grade: Newspaper Activity: Major Questions
No ratings yet
First Grade: Newspaper Activity: Major Questions
4 pages
Cover Page - Tugopes Sip 2023-2028
No ratings yet
Cover Page - Tugopes Sip 2023-2028
135 pages
SEA 2024 Media Release
No ratings yet
SEA 2024 Media Release
2 pages
Amparo Psych Assessment Answer Sheet 2023
No ratings yet
Amparo Psych Assessment Answer Sheet 2023
5 pages
OJT Training Plan - School
No ratings yet
OJT Training Plan - School
2 pages
Ethics of The Guidelines For The Psychological Examinations of Overseas Work Applicants
No ratings yet
Ethics of The Guidelines For The Psychological Examinations of Overseas Work Applicants
3 pages
Three Idiots A Reaction Paper
75% (4)
Three Idiots A Reaction Paper
4 pages
Wind Turbine Design Project: Investigate
No ratings yet
Wind Turbine Design Project: Investigate
5 pages
Past Participle
No ratings yet
Past Participle
3 pages
PsychAssess 3 Assumptions
No ratings yet
PsychAssess 3 Assumptions
3 pages
Sense of Belonging Lit Review
No ratings yet
Sense of Belonging Lit Review
16 pages
Chapter 1
No ratings yet
Chapter 1
6 pages
Phonics Lesson Plan Baseball - 1
No ratings yet
Phonics Lesson Plan Baseball - 1
2 pages
Subhojit Roy Resume Java Latest
No ratings yet
Subhojit Roy Resume Java Latest
5 pages
Vietnamese For Beginners
100% (19)
Vietnamese For Beginners
152 pages
Relation Between Sociology and Social Work
100% (1)
Relation Between Sociology and Social Work
7 pages
Barangay Profiling System With Analytics
No ratings yet
Barangay Profiling System With Analytics
4 pages

UNIT 05: Reliability: Module Overview

Uploaded by

UNIT 05: Reliability: Module Overview

Uploaded by

MODULE OVERVIEW

Reliability refers to the consistency of the test

THE CONCEPT OF RELIABILITY

SOURCES OF ERROR VARIANCE TEST CONSTRUCTION

TEST SCORING AND INTERPRETATION

DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY

Other Methods of Estimating Internal Consistency

DEPARTMENT OF PSYCHOLOGY| CEBU NORMAL UNIVERSITY

Homogeneity versus heterogeneity of test items

You might also like