0% found this document useful (0 votes)

23 views4 pages

Psychometric Evaluation of A Knowledge Based Examination Using Rasch Analysis

1. The document describes using Rasch analysis to evaluate a knowledge-based examination taken by 355 medical students. 2. Rasch analysis provides a deeper analysis than classical test theory by considering the interaction between student ability and item difficulty. 3. The analysis identifies test items that do not fit the Rasch model well or are dependent on other items, allowing test developers to iteratively improve the test so item difficulty matches student ability levels.

Uploaded by

Mustanser Farooq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views4 pages

Psychometric Evaluation of A Knowledge Based Examination Using Rasch Analysis

Uploaded by

Mustanser Farooq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Psychometric evaluation of a knowledge

based examination using Rasch analysis

Abstract Classical Test Theory has traditionally been used to carry out post-examination analysis of
objective test data. It uses descriptive methods and aggregated data to help identify sources of
measurement error and unreliability in a test, in order to minimize them. Item Response Theory (IRT),
and in particular Rasch analysis, uses more complex methods to produce outputs that not only identify
sources of measurement error and unreliability, but also identify the way item difficulty interacts with
student ability. In this Guide, a knowledge-based test is analyzed by the Rasch method to demonstrate the
variety of useful outputs that can be provided. IRT provides a much deeper analysis giving a range of
information on the behavior of individual test items and individual students as well as the underlying
constructs being examined. Graphical displays can be used to evaluate the ease or difficulty of items
across the student ability range as well as providing a visual method for judging how well the difficulty of
items on a test match student ability. By displaying data in this way, problem test items are more easily
identified and modified allowing medical educators to iteratively move towards the ‘perfect’ test in which
the distribution of item difficulty is mirrored by the distribution of student ability

Introduction
The quality of assessment methods and processes is as important as the quality of the teaching and
learning process in any form of educational activity.

Practice points
1. Rasch analysis is a particular method used in IRT. .

2. IRT supersedes CTT, in that it takes into consideration the interaction between student ability and item
difficulty. .

3. The characteristics of a test that fits the Rasch model can be identified, so that test developers can
iteratively move towards the ‘perfect’ test. .

4.The ‘perfect’ test is one on which the distribution of student ability is perfectly mirrored by the
distribution of item difficulty.

Comparing Classical Test Theory with Item

Response Theory.
This section of the Guide describes and compares the concepts and methods that underpin Classical Test
Theory (CTT), which is the more traditional approach to psychometric analysis, and Item Response
Theory (IRT), which is a more developed and contemporary approach..

Methods.
The Rasch model Despite the complexity of the statistical and measurement methods, used by the Rasch
model, the results can answer some simple questions given below.

1.How well does a student answer a question if we know the student’s ability and the item’s difficulty? .

2.What is the probability of a student answering an item correctly given a measure of item difficulty? .

3. If student ability equals item difficulty, what is the probability of answering the item correctly? .

4.What is the probability of a less or more able student answering an easy or difficult item?

Unidimensionality
One of the assumptions of Rasch modeling is that a test optimally measures a single underlying
construct; this is termed unidimensionality. For example this underlying single construct can be identified
with cognitive ability in a knowledge-based test or practical performance in an OSCE. Unidimensionalty
implies that all items in a test or all OSCE stations assess a single construct or dimension .

Response dependency
Another assumption of Rasch analysis is local independency of items. This means that the probability of
answering one item correctly should be independent of the answer to other items. When the value of an
item is predicted by the value of another item, the assumption of independency is violated. In the context
of the Rasch model, items with a high positive correlation indicate that one of the two questions is
redundant for the test. Correlations greater than 0.50 between items are considered an indication of
response dependency and items should be investigated. For example if item 1 has a correlation coefficient
of 70% with item 2 this indicates a local item dependency between item 1 and item 2, suggesting both
item 1 and item 2 are required for the test..

Item difficulty invariance

Another feature is ‘item difficulty invariance’ which provides valuable information about the invariance
or stability properties of item values within a test. Invariance in this context means that the properties of
an item are not influenced by the ability of the students answering the item. A scatter plot of item
difficulty values from high- and low-ability students can display a correlation that reveals the extent to
which item difficulties vary between the two groups. By inserting 95% confidence interval control limits
onto such plots items that are not invariant or unstable with respect to ability can be easily identified. Item
difficulty invariance also allows us to identify items that are useful across the ability range in order to
calibrate questions for item banks. This means that assessors will have convenient access to a large
number of tested questions which are classified according to student ability and item difficulty. Such
questions can also be used for computer adaptive testing (CAT) where the questions administrated to
students can be modified according to their performance on the previous questions.

Response dependency
The local independence assumption is not violated if the order of the questions in an examination does not
affect their difficulty. Test Response dependency was assessed for the complete test and for each case.

Reliability and separation estimates

The PSR for the whole test was 0.65 with a PSI of 1.37. A PSI value less than 2 indicates that the spread
or separation of students on the construct being measured was not satisfactory, suggesting that the
questions had low discrimination.

Rasch item fit

Table 3 shows item difficulty, standard error and item fit in each case. The outfit statistics show that Q11
and 16 are not within the acceptable range (both for MNSQ and ZSTD) implying they needed to be
investigated as they did not contribute towards the underlying test construct.

Participants
The examination data used in this Guide was processed from results obtained from 355 medical students
in their final clinical knowledge-based exam. We used Winsteps* software (Linacre 2011), to produce
simulated modifications of the data to create examples for the purposes of this Guide. We did not require
approval from our research ethics committee as this study was carried out using data acquired from
normal exams within the curriculum with the goal of monitoring the quality of individual questions in
order to improve student assessment.

Data collection
Knowledge-based test The simulated knowledge-based questions were used to assess cognitive
performance of students in this study. The test consisted of 43 questions to assess two clinical cases. Case
1 consisted of 24 questions on Clinical Laboratory Sciences and Case 2 consisted of 19 questions on
chronic illness in General Practice. Each question was marked dichotomously, i.e. students received 1
mark if they answered the question correctly and 0 if they answered incorrectly. The potential score for
Case 1 and Case 2 was 24 and 19, respectively. There was no negative marking for incorrect answers.
Students responded to the questions through an online assessment system (Rogo¯, University of
Nottingham) during a normal summative examination.

Psychometric software
The Rasch measurement model (Rasch 1980) was used to analyses the different response patterns
obtained using Winsteps* software

Results
In this section, we will demonstrate the results of the Rasch analysis of our simulated exam data under the
headings previously discussed. For each section, we will discuss the following.

Stages of Test Development
100% (5)
Stages of Test Development
3 pages
CC-LINK Interface: SR83 Digital Controller
No ratings yet
CC-LINK Interface: SR83 Digital Controller
24 pages
CSD Rise Ultra Wrap Manual
No ratings yet
CSD Rise Ultra Wrap Manual
36 pages
Middleware MidSem Preparation - PDF
No ratings yet
Middleware MidSem Preparation - PDF
108 pages
Students Perceptions On Online Education
No ratings yet
Students Perceptions On Online Education
4 pages
HCA5 Rack Layout - Equipment Layout Specification
100% (1)
HCA5 Rack Layout - Equipment Layout Specification
43 pages
Bentinho Massaro - 3 Main Teachings PDF
No ratings yet
Bentinho Massaro - 3 Main Teachings PDF
1 page
Whitepaper: Decentralized Finance Global Smart AMM DEX Protocol
No ratings yet
Whitepaper: Decentralized Finance Global Smart AMM DEX Protocol
16 pages
Water Body Extraction From Sentinel-3 Image With Multiscale Spatiotemporal Super-Resolution Mapping
No ratings yet
Water Body Extraction From Sentinel-3 Image With Multiscale Spatiotemporal Super-Resolution Mapping
20 pages
Order No. 11909520: Thank You For Your Order
No ratings yet
Order No. 11909520: Thank You For Your Order
2 pages
SVM-Based Detection of Tomato Leaves Diseases: Abstract. This Article Introduces An e Cient Approach To Detect and
No ratings yet
SVM-Based Detection of Tomato Leaves Diseases: Abstract. This Article Introduces An e Cient Approach To Detect and
12 pages
Optimization of Rocksdb For Redis On Flash: Keren Ouaknine Oran Agra Zvika Guz
No ratings yet
Optimization of Rocksdb For Redis On Flash: Keren Ouaknine Oran Agra Zvika Guz
7 pages
Test Bank Management System Applying Rasch Model and Data Encryption Standard (DES) Algorithm
No ratings yet
Test Bank Management System Applying Rasch Model and Data Encryption Standard (DES) Algorithm
9 pages
A Reliable Architecture Based On Reactive Microservices For Iot Applications
No ratings yet
A Reliable Architecture Based On Reactive Microservices For Iot Applications
5 pages
Tugas Inggris
No ratings yet
Tugas Inggris
2 pages
Unidimensionality in Rasch Models Efficient
No ratings yet
Unidimensionality in Rasch Models Efficient
26 pages
Bondvalidity PDF
No ratings yet
Bondvalidity PDF
16 pages
Testing 07
No ratings yet
Testing 07
27 pages
Website SEO Adudit Report Thecopycreators
No ratings yet
Website SEO Adudit Report Thecopycreators
21 pages
Math Homework Tic Tac Toe
100% (1)
Math Homework Tic Tac Toe
8 pages
MODULE 8: Test Development: PSY 112: Psychological Assessment
No ratings yet
MODULE 8: Test Development: PSY 112: Psychological Assessment
59 pages
Sonnenschein A412/20 G5 Data Sheet: Drawing: Terminal
No ratings yet
Sonnenschein A412/20 G5 Data Sheet: Drawing: Terminal
1 page
Exam Question Evaluation With Item Response Theory: Evert-Jan - Bakker@wur - NL
No ratings yet
Exam Question Evaluation With Item Response Theory: Evert-Jan - Bakker@wur - NL
4 pages
Sample Report
No ratings yet
Sample Report
33 pages
Rasch
No ratings yet
Rasch
9 pages
Current Issues in Psychometric Assessment of Outcome Measures
No ratings yet
Current Issues in Psychometric Assessment of Outcome Measures
8 pages
2022 Inspiring Profiles en Forster 0 Cat355cb804
No ratings yet
2022 Inspiring Profiles en Forster 0 Cat355cb804
32 pages
Using Rasch Analysis To Inform Rating Scale
No ratings yet
Using Rasch Analysis To Inform Rating Scale
12 pages
A Simple Guide To The Item Response Theory (IRT) and Rasch Modeling Chong Ho Yu, PH - Ds Email: Website: December 10, 2010
No ratings yet
A Simple Guide To The Item Response Theory (IRT) and Rasch Modeling Chong Ho Yu, PH - Ds Email: Website: December 10, 2010
29 pages
03 Psychometrics 3x2
No ratings yet
03 Psychometrics 3x2
6 pages
Statistical Analysis System: First SAS Program
No ratings yet
Statistical Analysis System: First SAS Program
8 pages
500.electronics For Computing 1 Assignment J
No ratings yet
500.electronics For Computing 1 Assignment J
4 pages
Measuring Cognitive Performance On Programming Knowledge - Classical Test Theory Versus Item Response Theory
No ratings yet
Measuring Cognitive Performance On Programming Knowledge - Classical Test Theory Versus Item Response Theory
5 pages
O133932v89 SUPER 19003L EN 2951844 MPW 080221
No ratings yet
O133932v89 SUPER 19003L EN 2951844 MPW 080221
18 pages
Question 2.21: What Are The Reasons of Using Load Equalisation in The Electric Drive? Answer
No ratings yet
Question 2.21: What Are The Reasons of Using Load Equalisation in The Electric Drive? Answer
1 page
Survey Data Quality - Rasch
No ratings yet
Survey Data Quality - Rasch
42 pages
Panel Discussion 2 Mixed Methods Approach To Assuring Content Validity
No ratings yet
Panel Discussion 2 Mixed Methods Approach To Assuring Content Validity
48 pages
Job Recommendation System Using NLP
No ratings yet
Job Recommendation System Using NLP
10 pages
Tentative 3rd International Conference On Communication
No ratings yet
Tentative 3rd International Conference On Communication
2 pages
PC 7 MK Ii Limitation
No ratings yet
PC 7 MK Ii Limitation
2 pages
Rasch Model
No ratings yet
Rasch Model
3 pages
RM Assumptions
No ratings yet
RM Assumptions
7 pages
A Fundamental Conundrum in Psychology's Standard Model of Measurement and Its Consequences For Pisa Global Rankings.
100% (1)
A Fundamental Conundrum in Psychology's Standard Model of Measurement and Its Consequences For Pisa Global Rankings.
10 pages
(AR) Making Better Tests With The Rasch Measurement Model (2018)
No ratings yet
(AR) Making Better Tests With The Rasch Measurement Model (2018)
25 pages
Week 5 Chpt.5 Creating Tests
No ratings yet
Week 5 Chpt.5 Creating Tests
22 pages
Finals Psychass Reviewer
No ratings yet
Finals Psychass Reviewer
11 pages
Rasch Model: January 2010
No ratings yet
Rasch Model: January 2010
3 pages
Theories Related To Tests and Measurement Final
No ratings yet
Theories Related To Tests and Measurement Final
24 pages
Cylinder Form
No ratings yet
Cylinder Form
1 page
L11 ItemAnalysis
No ratings yet
L11 ItemAnalysis
59 pages
Book Item 126579
No ratings yet
Book Item 126579
18 pages
Rasch Model
No ratings yet
Rasch Model
8 pages
Inmoov Report
No ratings yet
Inmoov Report
94 pages
Test Development of Assessment
No ratings yet
Test Development of Assessment
26 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
17 pages
Anp - Item Analysis
No ratings yet
Anp - Item Analysis
20 pages
Chapter 5 Conduction Shape Factor
No ratings yet
Chapter 5 Conduction Shape Factor
10 pages
Week13 - Ã Ä Renci
No ratings yet
Week13 - Ã Ä Renci
41 pages
PPL Members As On 23 March 07
No ratings yet
PPL Members As On 23 March 07
27 pages
Rasch Analysis
No ratings yet
Rasch Analysis
5 pages
Item Analysis and Test Revision
100% (1)
Item Analysis and Test Revision
4 pages
Evaluating The Reliability Index of An Entrance Exam Using Item Response Theory
No ratings yet
Evaluating The Reliability Index of An Entrance Exam Using Item Response Theory
4 pages
Tmpa291 TMP
No ratings yet
Tmpa291 TMP
11 pages
An Introduction To Psychometrics
100% (1)
An Introduction To Psychometrics
5 pages
Assessing The Validity and Reliability of Dichotomous Test Result
No ratings yet
Assessing The Validity and Reliability of Dichotomous Test Result
11 pages
Exceelnte para Hacer Analisis Con Winsteps y Rumm
No ratings yet
Exceelnte para Hacer Analisis Con Winsteps y Rumm
54 pages
12test Construction
No ratings yet
12test Construction
3 pages
Rasch Model
No ratings yet
Rasch Model
3 pages
Guide To Item Response Theory
No ratings yet
Guide To Item Response Theory
30 pages
405 Irt
No ratings yet
405 Irt
51 pages
31163-Article Text-91954-1-10-20230203
No ratings yet
31163-Article Text-91954-1-10-20230203
7 pages
Efficacy: Wollenberg
No ratings yet
Efficacy: Wollenberg
4 pages
Iot 220112132928
No ratings yet
Iot 220112132928
31 pages
CHAPTER 8 Clavillas Garma Garcia, J. Layog
No ratings yet
CHAPTER 8 Clavillas Garma Garcia, J. Layog
41 pages
Effect of Counseling Intervention On The Academic Performance of Secondary School
No ratings yet
Effect of Counseling Intervention On The Academic Performance of Secondary School
16 pages
Item Response
No ratings yet
Item Response
7 pages
Slide 6 - Test Construction and Adaptation
No ratings yet
Slide 6 - Test Construction and Adaptation
34 pages
IRT (Item Response Theory)
No ratings yet
IRT (Item Response Theory)
45 pages
Linacre (1999)
No ratings yet
Linacre (1999)
102 pages
Reporting - Test Development
No ratings yet
Reporting - Test Development
5 pages
Wright 1996 Rasch
No ratings yet
Wright 1996 Rasch
24 pages
Educational Measurement - 2024 - Harris - in The Beginning There Was An Item
No ratings yet
Educational Measurement - 2024 - Harris - in The Beginning There Was An Item
6 pages
Quantitative Psychology Research The 78th Annual Meeting of The Psychometric Society Academic PDF Download
100% (17)
Quantitative Psychology Research The 78th Annual Meeting of The Psychometric Society Academic PDF Download
15 pages
ETS Research Report Series - June 1991 - Bejar - A GENERATIVE APPROACH TO PSYCHOLOGICAL AND EDUCATIONAL MEASUREMENT
No ratings yet
ETS Research Report Series - June 1991 - Bejar - A GENERATIVE APPROACH TO PSYCHOLOGICAL AND EDUCATIONAL MEASUREMENT
59 pages
Chapter5 3
No ratings yet
Chapter5 3
35 pages
Global Summit On Innovation, Productivity in The Age of AI - Revised
No ratings yet
Global Summit On Innovation, Productivity in The Age of AI - Revised
6 pages
A Comprehensive Review of Rasch Measurement in Language Assessment: Recommendations and Guidelines For Research
No ratings yet
A Comprehensive Review of Rasch Measurement in Language Assessment: Recommendations and Guidelines For Research
35 pages
Chapter 6 Writing and Evaluating Test Items
No ratings yet
Chapter 6 Writing and Evaluating Test Items
12 pages
Revision Exercises in Basic Engineering Mechanics
From Everand
Revision Exercises in Basic Engineering Mechanics
Gregory Pastoll
No ratings yet
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet

Psychometric Evaluation of A Knowledge Based Examination Using Rasch Analysis

Uploaded by

Psychometric Evaluation of A Knowledge Based Examination Using Rasch Analysis

Uploaded by

Psychometric evaluation of a knowledge

based examination using Rasch analysis

Comparing Classical Test Theory with Item

Item difficulty invariance

Reliability and separation estimates

Rasch item fit

You might also like