Lecture 1
Lecture 1
The line between what constitutes testing and what constitutes assessment is not always
very clear.
The assessor may meet with the assessee or others before the formal
assessment in order to clarify aspects of the reason for referral. The assessor
prepares for the assessment by selecting the tools of assessment to be used.
For example, if the assessment is in a corporate or military setting and the
referral question concerns the assessee’s leadership ability, the assessor may
wish to employ a measure (or two) of leadership.
Typically, it the assessor’s own past experience, education, and training that
play a key role in the specific tests or other tools to be employed in the
assessment. Sometimes an institution in which the assessment is taking
place has prescribed guidelines for which instruments can and cannot be
used. In most every assessment situation, particularly situations that are
relatively novel to the assessor, the tool selection process may be informed
by some research in preparation for the assessment.
Consider, for example, the term medical test, which refers to a device or
procedure designed to measure some variable related to the practice of
medicine (including a wide range of tools and procedures such as X-rays,
blood tests, and testing of reflexes). In a like manner, the term psychological
test refers to a device or procedure designed to measure variables related
to psychology (for example, intelligence, personality, aptitude, interests,
attitudes, and values). Whereas a medical test might involve analysis of a
sample of blood, tissue, or other parameters, a psychological test almost
always involves analysis of a sample of behavior.
Psychological tests and other tools of assessment may differ with respect to a
number of variables such as -
Content - The content (subject matter) of the test will, of course, vary with the
focus of the particular test. But even two psychological tests purporting to
measure the same thing—for example, personality —may differ widely in item
content. This is so because what is deemed important in measuring
“personality” for one test developer might be entirely different for another
test developer; different test developers employ different definitions of
“personality.” Additionally, different test developers come to the test
development process with different theoretical orientations.
Format - The term format pertains to the form, plan, structure, arrangement,
and layout of test items as well as to related considerations such as time
limits. Format is also used to refer to the form in which a test is administered:
computerized, pencil-and-paper, or some other form.
Administration procedures - Tests differ in their administration procedures. Some tests,
particularly those designed for administration on a one-to-one basis, may require an
active and knowledgeable test administrator. Alternatively, some tests, particularly
those designed for administration to groups, may not even require the test
administrator to be present while the test takers independently do whatever it is the
test requires.
Scoring and interpretation procedures - Tests differ in their scoring and interpretation
procedures. In testing and assessment, scores can be defined as a code or summary
statement, usually but not necessarily numerical. in nature, that reflects an evaluation
of performance on a test, task, interview, or some other sample of behavior. Scoring is
the process of assigning such evaluative codes or statements to performance on tests,
tasks, interviews, or other behavior samples. Some scores result from the simple
summing of responses (such as the summing of correct/ incorrect or agree/disagree
responses), and some scores result from the application of more elaborate procedures.
Technical quality - Tests differ with respect to their technical quality i.e. psychometric
soundness of a test (psychometrics may be defined as the science of psychological
measurement).
2. The Interview
4. Case History
Data Case history data refers to records, transcripts, and other accounts in
written, pictorial, or other form that preserve archival information relevant to
an assessee. Case history data may include files maintained at institutions
and agencies such as schools, hospitals, employers, religious institutions, and
criminal justice agencies. Other examples of case history data are letters and
written correspondence, photos and family albums, newspaper and magazine
clippings, and home videos, movies, and audiotapes etc. Case history data is
a useful tool in a wide variety of assessment contexts.
5. Behavioral Observation
Behavioral observation, as it is employed by assessment professionals, may be
defined as monitoring the actions of others or oneself by visual or electronic means
while recording quantitative and/or qualitative information regarding the actions.
Behavioral observation is often used as a diagnostic aid in various settings such as
inpatient facilities, behavioral research laboratories, and classrooms. In addition to
diagnosis, behavioral observation may be used for selection purposes. Sometimes
researchers venture outside of the confines of clinics, classrooms, workplaces, and
research laboratories in order to observe behavior of humans in a natural setting—
that is, the setting in which the behavior would typically be expected to occur. This
variety of behavioral observation is referred to as naturalistic observation.
6. Role-Play
Tests Role play may be defi ned as acting an improvised or partially improvised part in
a simulated situation. A role-play test is a tool of assessment wherein assessees are
directed to act as if they were in a particular situation. Assessees may then be
evaluated with regard to their expressed thoughts, behaviors, abilities, and other
variables.
7. Computers as Tools
Computers can serve as test administrators (online or offline) and as highly efficient
test scorers. Computers can derive not only test scores but patterns of test scores.
The term computer assisted psychological assessment (CAPA ) refers to the
assistance computers provide to the test user, not the test-taker. Another term
computer adaptive testing (CAT), is a reference to the computer’s ability to tailor
the test to the test-taker’s ability or test-taking pattern. For example, on a
computerized test of academic abilities, the computer might be programmed to
switch from testing math skills to English skills after three consecutive failures on
math items.
8. Other Tools
•Video recordings of actual occurrences
•Tools traditionally associated with medical health, such as thermometers, blood
pressure measuring instruments
In general, there has been no shortage of innovation on the part of psychologists in
devising measurement tools, or adapting existing tools, for use in psychological
assessment.