0% found this document useful (0 votes)
38 views3 pages

Different Reliability Tests

Uploaded by

Wilbert Piala
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views3 pages

Different Reliability Tests

Uploaded by

Wilbert Piala
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Different Reliability Tests

1. Test- Retest
Test-retest reliability measures the correlation between 2+ results for the same test taken by the
same participants. A high test-retest reliability implies the test has consistent results.
Test-retest reliability assesses the consistency of test results. For example, a test with high test-
retest reliability will produce similar scores if the same participants take it more than once. If
participants take a test with low test-retest reliability, their scores may be very different even
though they take the same test again.
How to Compute:
Administer the test twice to the same group.
Calculate the correlation coefficient (e.g., Pearson r) between the two sets of scores.
How to Interpret/Analyze the Result:
A high correlation (close to 1) indicates strong reliability and stability over time.
A low correlation suggests that external factors or test inconsistencies may have affected the
scores.

2. Equivalent Forms
Parallel form’s reliability (also called equivalent forms reliability) uses one set of questions divided
into two equivalent sets (“forms”), where both sets contain questions that measure the same
construct, knowledge or skill. The two sets of questions are given to the same sample of people
within a short period of time and an estimate of reliability is calculated from the two sets.

Put simply, you’re trying to find out if test A measures the same thing as test B. In other words
you want to know if test scores stay the same when you use different instruments.

Example: you want to find the reliability for a test of mathematics comprehension, so you create a
set of 100 questions that measure that construct. You randomly split the questions into two sets of
50 (set A and set B), and administer those questions to the same group of students a week apart.
Steps:

Step 1: Give test A to a group of 50 students on a Monday.


Step 2: Give test B to the same group of students that Friday.
Step 3: Correlate the scores from test A and test B.
In order to call the forms “parallel”, the observed score must have the same mean and variances.
If the tests are merely different versions (without the “sameness” of observed scores), they are
called alternate forms.
3. Split Half
Split-Half Correlation is defined as a method of estimating reliability by dividing a test into two
halves and measuring the correlation between the scores of each half. This procedure helps assess
the consistency of the test items and their equivalence.
How to Calculate Split Half Reliability
The process is simple.

Take the test and split it in half


Calculate the score of each examinee on each half
Correlate the scores on the two halves
The correlation is best done with the standard Pearson correlation.

Pearson-correlation-formula

This, of course, begs the question: How do we split the test into two halves? There are so many
ways. Well, psychometricians generally recommend three ways:

 First half vs last half


 Odd-numbered items vs even-numbered items
 Random split
4. Kuder-Richardson
The Kuder and Richardson Formula 20 test checks the internal consistency of measurements with
dichotomous choices.
https://real-statistics.com/reliability/internal-consistency-reliability/kuder-richardson-formula-20/

5. Pilot Testing (Cronbach Alpha)

Cronbach’s alpha coefficient measures the internal consistency, or reliability, of a set of survey
items. Use this statistic to help determine whether a collection of items consistently measures the
same characteristic.
https://statisticsbyjim.com/basics/cronbachs-alpha/

You might also like