Computerized Adaptive and Multistage Testing With R Using Packages Catr and MSTR Unrestricted Download
Computerized Adaptive and Multistage Testing With R Using Packages Catr and MSTR Unrestricted Download
Visit the link below to download the full version of this book:
https://medipdf.com/product/computerized-adaptive-and-multistage-testing-with-r-
using-packages-catr-and-mstr/
vii
viii Foreword
test, many questions must be answered and many decisions must be made. For
instance, is my item bank big enough given the foreseen number of respondents
and lifespan of the test, is the test length appropriate for the targeted reliability,
and are the item parameters adequate for the targeted population, to mention a
few questions. The website of the International Association for Computerized
Adaptive Testing (IACAT) adds some more issues: content balancing (to create
a test where the content matter is appropriately balanced), item exposure control
(to prevent compromising of the test), combining multiple scales, and proctoring
of administration via the Internet. To complicate things, all these aspects usually
interact. The simulation studies such as those supported by the R packages can
help answering such questions and support the decisions. The choice of R is a
good one, motivated by the fact that it has become the most versatile and flexible
open-source platform for statistical analyses and creation of graphical output.
For most developers and users of statistical tools, R has become the standard of
industry. It offers practitioners the opportunity to add their own functionality to
existing applications and provides a common ground for exchanging their software.
Therefore, I am sure that this book and the simulation software accompanying it
will be extremely helpful for designing adaptive tests.
This book takes its origin from an inspiring discussion between the authors during
the meeting of the International Association for Computerized Adaptive Testing
(IACAT) in Cambridge (UK), October 2015. From this early project, several drafts
were written, re-organized, reviewed, and updated to end up with the current version
of the book.
The authors wish to express their sincere acknowledgments to the many people
who took some of their precious time to read the provisional chapters and make
insightful comments: Hong Jiao (University of Maryland), Kim Fryer, Lixiong Gu,
Sooyeon Kim, Yanming Jiang, Longjuan Liang, Guangming Ling, Yuming Liu,
Manfred Steffen, Fred Robin, Jonathan Weeks, Meng Wu (Educational Testing
Service). The authors want to express their gratitude to Cees Glas who accepted to
preface this book long before it came out in its final version. Eventually, special
thanks to Jim Carlson from Educational Testing Service for his comments and
suggestions throughout the whole book, and to Andrew Cantine (technical editor,
ACTNext) for carefully editing the writing of this book.
ix
Contents
xi
xii Contents
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
Acronyms
xv
List of Figures
xvii
xviii List of Figures
xix
xx List of Tables
Table 8.1 Module sizes by stage for the six possible MST designs
using the 2PL item bank . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
Table 8.2 Means and variances of item discrimination parameters in
each module and for each MST design . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
Table 8.3 Means and variances of item difficulty
parameters in each module and for each MST
design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142
Chapter 1
Overview of Adaptive Testing
Traditionally, linear tests have been the most common way of measuring test takers’
knowledge, skills, and abilities, especially in educational assessments. However,
over the last two decades, computer science and technology have advanced rapidly
and the demand for computer-based tests (CBT) has greatly increased. In particular,
computerized adaptive tests have been used in many real-world settings, due to their
efficiency and precision in testing. More recently, computerized multistage tests
(MST) have become very popular for their features and efficiency.
regardless of whether the items are very easy or very difficult for them. Test takers
with a relatively high (or low) level of knowledge or ability would still need to
answer the easy (or difficult) items. Because it is likely that low ability test takers
will get all difficult items wrong and high ability test takers will get all easy items
correct, these easy and difficult items contribute little information toward measuring
test takers’ abilities at the higher and lower ends of measurement scale, respectively.
The result is that linear tests require large numbers of items in order to obtain
uniformly good precision in the final scores (Rudner, 1998).
A computer-based test is a test administered on a computer and can be a linear,
adaptive, or multistage test. A linear CBT is similar to a traditional, linear, paper-
and-pencil test, except that it is administered via computer. Therefore, a CBT enjoys
many of the advantages associated with computerization, such as the flexibility of
test scheduling and the efficiency of test administration, test assembly, and score
reporting, but suffers the same limitations as a linear paper-and-pencil test.
1.1.2 CAT
1.1.3 MST