Assessment Notes
Assessment Notes
What Is Intelligence?
We may define intelligence as a multifaceted capacity that manifests itself in different
ways across the life span. In general, intelligence includes the abilities to:
■ acquire and apply knowledge ■ reason logically
■ plan effectively ■ make sound judgments and solve
problems
■ grasp and visualize concepts ■ pay attention
■ be intuitive ■ find the right words and thoughts
with facility
■ cope with, adjust to, and make the most of new situations
Intelligence Defined: Views of the Lay Public
Research 1
For the nonpsychologists, the behaviors most commonly associated with intelligence
were “reasons logically and well,” “reads widely,” “displays common sense,” “keeps an open
mind,” and “reads with high comprehension.”
Research 2 and 3
In another study (Siegler & Richards, 1980), students enrolled in college developmental
psychology classes were asked to list behaviors associated with intelligence in infancy,
childhood, and adulthood
A study conducted with first-, third-, and sixth-graders
Intelligence Defined: Views of Scholars and Test Professionals
Galton (1883) believed that the most intelligent persons were those equipped with the
best sensory abilities.
For Binet, these components included reasoning, judgment, memory, and abstraction
David Wechsler: Intelligence, operationally defined, is the aggregate or global capacity
of the individual to act purposefully, to think rationally and to deal effectively with his
environment.
History of IQ test
Paul Broca (1824-1880) and Sir Francis Galton (1822-1911)
could determine intelligence by measuring the size of the human skull.
Wilhelm Wundt (1932-1920) used introspection - the human ability to reflect on their
own thoughts
Now these ideas are outdated
• In 1904 a French professional group for child psychology, was called upon by the French
government to appoint a commission on the education of retarded children. Binet, being
an active member of this group, found the impetus for the development of his mental
scale.
• In 1905 Binet and Simon developed the first Intelligence Test known as the Binet-Simon
Scale. The test was developed in order to identify children with learning disabilities so
that they might be placed in a special class.
• 1908, H.H. Goddard
• 1916 Lewis Terman
• The revised Stanford-Binet scale measured five weighted factors and consists of both
verbal and nonverbal subtests.
The original Simon-Binet Intelligence Scale was composed of 30 tasks, including items on
memory, vocabulary, verbal ability, and reasoning as:
• Le Regard (coordination in the movement of the head and the eyes which is associated
with the act of vision)
• Prehension Provoked by a Tactile Stimulus (coordination exists b/w a tactile stimulus
of the hand, and the movement of seizing and carrying to the mouth)
• Prehension Provoked by a Visual Perception (coordination exists b/w the sight of an
object and its prehension, when the object is not placed in contact with the hand of the
subject).
• Recognition of Food (subject can make the distinction by sight b/w familiar food and
what cannot be eaten)
• Quest of Food Complicated by a Slight Mechanical Difficulty (to bring into play a
rudiment of memory, an effort of will, and a coordination of movements)
1-Verbal Knowledge of Objects 2-Verbal Knowledge of Pictures
3- Naming of Designated Objects 4- Comparison of Two Weights
5- Immediate Comparison of Two Lines of Unequal Lengths
6-Repetition of Three Figures 7- Suggestibility
8-Verbal Definition of Known Objects 9-Definitions of Abstract Terms
10- Repetition of Sentences of Fifteen Words
11- Comparison of Known Objects from Memory 12- Exercise of Memory on Pictures
13- Drawing a Design from Memory 14- Immediate Repetition of Figures
15- Resemblances of Several Known Objects Given from Memory
16- Comparison of Lengths 17- Five Weights to be Placed in Order
18- Gap in Weights 19- Exercise upon Rhymes
20- Verbal Gaps to be Filled
21- Synthesis of Three Words in One Sentence 22- Reply to an Abstract Question
23- Reversal of the Hands of a Clock 24- Paper Cutting
REVISIONS OF BINET INTELLIGENCE SCALE
• April 1905: Development of Binet-Simon Test announced at a conference in Rome
• June 1905: Binet-Simon Intelligence Test introduced
• 1908 and 1911: New Versions of Binet-Simon Intelligence Test
• 1916: Stanford–Binet First Edition by Terman
• 1937: Second Edition by Terman and Merrill
• 1973: Third Edition by Merrill
• 1986: Fourth Edition by Thorndike, Hagen, and Sattler
• 2003: Fifth Edition by Roid
STANFORD-BINET 5th ADDITION
• Fifth Edition is based in the schooling process to assess intelligence.
• It is also capable of measuring multiple dimensions of abilities.
• It can be administered to individuals as early as two years of age.
• There are ten subsets included in this revision including both verbal and nonverbal
domains.
• Five factors are also incorporated as;
Fluid Knowledge Quantitative Visual-Spatial Working
Reasoning Reasoning Processing Memory
Early reasoning Vocabulary Non-verbal Form board and Delayed
quantitative form patterns response (non-
reasoning(non- (non-verbal) verbal)
verbal)
Verbal Procedural Verbal Position and Block span (non-
absurdities knowledge (non- quantitative direction verbal)
verbal) reasoning
Verbal analogies Picture Memory for
absurdities (non- sentences
verbal)
Object series Last word
matrices(non-
verbal)
• Depending on age and ability, administration can range from 15 minutes to 1 hour 15
minutes.
• It incorporated a new scoring system, which can provide a wide range of information
such as four intelligence score composites, five factor indices, and ten subtest scores.
• Additional scoring information includes percentile ranks, age equivalents. Extended IQ
scores and gifted composite scores are available.
• In order to reduce errors and increase diagnostic precision, scores are obtained
electronically through the use of computers now.
IQ Range IQ Classification
145-160 Very gifted or highly advanced
130-144 Gifted or very advanced
120-129 Superior
110-119 High average
90-109 Average
80-89 Low average
70-79 Borderlined impaired or delayed
55-69 Midly impaired or delayed
40-54 Moderately impaired or delayed
RELIABILITY
Several reliability tests performed on it are split-half reliability, standard error of measurement
and test-retest stability. On average, the IQ scores for this scale have been found to be quite
stable across time (Janzen, Obrzut, & Marusiak, 2003).
Internal consistency was tested by split-half reliability and was reported to be substantial and
comparable to other cognitive batteries (Bain & Allin, 2005).
The test has also been found to have great precision at advanced levels of performance meaning
that the test is especially useful in testing children for giftedness (Bain & Allin, 2005).
Re-administration can occur in a six-month interval rather than one year due to the small mean
differences in reliability (Bain & Allin, 2005).
VALIDITY
Content validity has been found based on the professional judgments Roid received concerning
fairness of items and item content as well as items concerning the assessment of giftedness (Bain
& Allin, 2005).
CRITICISM:
• The test is not being able to compare people of different age categories, since each
category gets a different set of tests.
• Furthermore, very young children tend to do poorly on the test due as they lack the ability
to concentrate long enough to finish it.
PRESENT USE:
• Clinical and neuropsychological assessment
• Educational placement,
• Career assessment,
• Adult neuropsychological treatment,
• Forensics and research on aptitude.
WECHSLER SCALES
• David Wechsler (1896 - 1981)
• American psychologist
• Developed prominent intelligence scales:
Wechsler Adult Intelligence Scale (WAIS)
Wechsler Intelligence Scale for Children (WISC)
Wechsler’s Criticism on Binet Scale
Wechsler was an influential advocate for concept of non-intellective factors
(non-intellective factors are variables that contribute to the overall score in intelligence, but are
not made up of intelligence-related items. These include lack of confidence, fear of failure,
attitudes etc).
• He felt that the 1937 Binet scale did not do a good job of incorporating these factors into
the scale
• Argued that the Binet scale items focused on use with children rather than adults.
• The "Binet scale's emphasis on speed, with timed tasks scattered throughout the scale,
tended to unduly handicap older adults."
• He believed that "mental age norms clearly did not apply to adults."
• He criticized Binet scale because "it did not consider that intellectual performance could
deteriorate as a person grew older. “
These criticisms of the 1937 Binet test helped produce the Wechsler–Bellevue scale, released in
1939.
Many of the original concepts Wechsler argued for, have become standards in psychological
testing, including the point-scale concept and the performance-scale concept.
AGE SCALE (BINET)
• In the Binet scales (prior to the 1986 version) items were grouped according to age level.
• Each of these age levels was composed of a group of tasks that could be passed by two-
thirds to three-quarters of the individuals in that level. This meant that items were not
arranged according to content.
• Additionally, an individual taking a Binet test would only receive credit if a certain
number of the tasks were completed. This meant that falling short just one task required
for the credit, resulted in no credit at all
For example, if passing three out of four tasks was required to receive credit, then passing two
yielded no credit)
POINT SCALE (WECHSLER)
• The point scale concept assigned credits or points to each item of the test.
• This had two large effects. First, this allowed items to be grouped according to content.
• Second, participants were able to receive a set number of points or credits for each item
passed.
• The result was a test that could be made up of different content areas (or subtests) with
both an overall score and a score for each content area. In turn, this allowed for an
analysis to be made of an individual's ability in a variety of content areas.
NON-VERBAL PERFORMANCE SCALE
• Essentially, this scale required a subject to do something (such as copying symbols or
point to a missing detail) rather than just answer questions. This was an important
development as it attempted to overcome biases that were caused by "language, culture,
and education."
• Further, this scale also provided an opportunity to observe a different type of behavior
because something physical was required. Clinicians were able to observe how a
participant reacted to the "longer interval of sustained effort, concentration, and attention"
that the performance tasks required.
• IQ test designed to measure intelligence and cognitive ability in adults and older
adolescents.
• The WAIS was initially created as a revision of the Wechsler–Bellevue Intelligence Scale
(WBIS), which was a battery of tests published by Wechsler in 1939.
• First released in February 1955 by David Wechsler
• The WBIS was composed of subtests that consisted of six verbal and five performance
subtests. The verbal subtests were: Information, Comprehension, Arithmetic, Digit
Span, Similarities, and Vocabulary. The Performance subtests were: Picture
Arrangement, Picture Completion, Block Design, Object Assembly, and Digit Symbol.
• A verbal IQ, performance IQ and full scale IQ were obtained.
WAIS-R
• A revised form was released in 1981 and this edition did not provide new validity data,
but used the data from the original WAIS.
WAIS-III
• It was released in 1997. In this version four secondary indices were introduced (Verbal
Comprehension, Working Memory, Perceptual Organization, and Processing Speed)
WAIS-IV
• Was released in 2008
• It is appropriate for use with individuals aged 16–90 years.
• Composed of 10 core subtests and 5 supplemental subtests, with the 10 core subtests
comprising the Full Scale IQ.
• The verbal/performance subscales were replaced by the index scores, General Ability
Index (GAI), The GAI is clinically useful because it can be used as a measure of
cognitive abilities that are less vulnerable to impairments of processing and working
memory.
Index scores and scales
• Verbal Comprehension Index (VCI)
• Perceptual Reasoning Index (PRI)
• Working Memory Index (WMI)
• Processing Speed Index (PSI)
Two broad scores can be used to summarize general intellectual abilities, can also be derived:
• Full Scale IQ (FSIQ), based on the total combined performance of the VCI, PRI, WMI,
and PSI
• General Ability Index (GAI), based only on the six subtests that the VCI and PRI
• Has three validity scales: the L scale (the Lie scale), the F scale (the Frequency scale),
and the K scale (Correction scale).
• The L scale will call into question the examinee’s honesty.
• The F scale contains items that are infrequently endorsed by members of nonpsychiatric
populations and do not fall into any known pattern of deviance, which can help determine
if the examinee responded to questions randomly and if he/she is a malingerer.
• The K score is associated with defensiveness and social desirability.
• The MMPI has a fourth scale, the Cannot Say scale (denoted with?), which functions as a
frequency count of the number of items to which the examinee responded cannot say or
failed to mark any response.
• The validity of an answer sheet with a cannot say count of 30 or higher is called into
question.
• Harris-Lingoes subscales are groupings of items into subscales (with labels such as
Brooding) that were designed for internal consistency.
Following publication it was found that the MMPI could not be scored into neat
diagnostic categories. Hathaway and McKinley (1943) suggested a configural interpretation
of scores, that is, interpretation based on a pattern or profile. Paul Meehl (1951) proposed a
2-point code derived from the numbers of clinical scales on which the test taker achieved the
highest scores. Welsh codes were another popular approach to scoring and interpretation.
The MMPI-2 is quite similar to its predecessor, though some important differences exist,
• Items that addressed topics such as drug abuse, suicidality, marital adjustment,
attitudes toward work, and Type A behavior patterns were added.
• Three additional validity scales were added: Back-Page Infrequency (Fb), True
Response Inconsistency (TRIN), and Variable Response Inconsistency (VRIN).
The MMPI-2 RF was devised in response to two basic problems with the MMPI-2.
• Overlapping item: There was an average of more than six overlapping items per
pair of clinical scales in MMPI-2.
• What a person can do (hasn’t done • What a person has already done
yet) • What a person has generally learned
• How well a person might perform in prior to being tested
school or employment situation • Measures recent learning in specific
• Examine broader range of knowledge subjects
and experience • Measures acquired skill or knowledge
• Measures natural talents abilities • Do not measure higher order thinking
• Higher order skills skills, problem solving and teamwork
• Measure certain special abilities that a • Assesses specific subject or area of
person may possess knowledge
• Assess persons overall performance
over a broad area of mental
capabilities
• Provides general profile of persons
strengths and weaknesses
Theories Of Measurement
• Lead to descriptions of examinees' abilities, independent of particular choice of test items
and sample.
• They provide framework for considering issues and addressing technical problems (e.g.
issue of measurement error)
• Theories: General framework of linking observable variables to unobservable variables
such as true scores or ability scores
• Models specify a detailed relationship of the variables in light of theoretical concepts
• Models provide incomplete representation of the test data to which they fit
• So, all models are wrong, but some are useful.