Chapter 3
Chapter 3
Key Terms
clarity of the learning target
appropriateness of assessment tools
validity
reliability
fairness
objectivity
comprehensiveness
ease in scoring and administering
practicality and efficiency
table of specification
matching type of test
multiple-choice test
true or false test
completion test or short answer test
objective test
subjective test
stem
distracters
key option
performance-based assessment
portfolio assessment
oral questioning
self-report
Learning Outcomes
Ebel and Frisbie (1999) as cited by Garcia (2008) listed five basic principle
that should guide teachers in assessing the learning progress of the students and in
developing their own assessment tools. These principles are discussed below.
Test construction believed that every assessment tool should possess good
qualities. Most literatures consider the most common technical concepts in
assessment are the validity and reliability. For any type of assessment, whether
traditional or authentic, it should be carefully developed so that it may serve
whatever purpose it is intended for and the test results must be consistent with the
type of assessment that will be utilized.
In this section, we shall discuss the different terms such as clarity of the learning
target, appropriateness of an assessment tool, fairness, objectivity, comprehensiveness,
and ease of scoring and administering. Once these qualities of a good test are taken into
consideration in developing an assessment tool, the teacher will have accurate
information about the performance of each individual pupils or student.
When a teacher plans for his classroom instruction, the learning target should
be clearly stated and must be focused on student learning objectives rather than
teacher activity. The learning outcomes must be Specific, Measurable, Attainable, Realistic
and Time-bound (SMART) as discussed in the previous chapter. The performance task of the
students should also be clearly presented so that they can accurately demonstrate
what they are supposed to do and how the final product should be done. The teacher
should also discuss clearly with the students the evaluation procedures, the criteria to
be used and the skills to be assessed in the task.
The type of test used should always match the instructional objectives or
learning outcomes of the subject matter posed during the delivery of the instruction.
Teachers should be skilled in choosing and developing assessment methods appropriate
for instructional decisions. The kinds of assessment tools commonly used to assess
the learning progress of the students will be discussed in details in this chapter and in
the succeeding chapter.
1. Objective Test. It is a type of test that requires students to select the correct
response from several alternatives or to supply a word or short phrase to
answer a question or complete a statement. It includes true-false, matching
type, and multiple-choice questions. The word objective refers to the scoring,
it indicates that there is only one correct answer.
2. Subjective Test. It is a type of test that permits the student to organize and
present an original answer. It includes either short answer questions or long
general questions. This type of test has no specific answer. Hence, it is usually
scored on an opinion basis, although there will be certain facts and
understanding expected in the answer.
3. Performance Assessment. (Mueller, 2010) is an assessment in which
students are asked to perform real-world tasks that demonstrate meaningful
application of essential knowledge and skills. It is can appropriately measure
learning objectives which focus on the ability of the students to demonstrate
skills or knowledge in real-life situations.
4. Portfolio Assessment. It is an assessment that is based on the systematic,
longitudinal collection of student work created in response to specific known
instructional objectives and evaluated in relation to the same criteria (Ferenz, K.,
2001). Portfolio is a purposeful collection of student’s work that exhibits that
student’s efforts, progress and achievements in one or more areas over a
period of time. It measures the growth and development of students.
5. Oral Questioning. This method is used to collect assessment data by asking oral
questions. The most commonly used of all forms of assessment in class, assuming
that the learner hears and shares the use of common language with the teacher
during instruction. The ability of the students to communicate orally is very
relevant to this type of assessment. This is also a form of formative assessment.
6. Observation Technique. Another method of collecting assessment data is
through observation. The teacher will observe how students carry out certain
activities either observing the process of product. There are two types of
observation techniques: formal and informal observations. Formal observation
are planned in advance like when the teacher assess oral report or presentation
in class while informal observation is done spontaneously, during instruction like
observing the working behavior of students while performing a laboratory
experiment in a biology class and the like. The behavior of students involved in
his performance during instruction is systematically monitored, described,
classified, and analyzed.
7. Self-report. The response of the students may be used to evaluate both
performance and attitude. Assessment tools could include sentence
completion, likert scales, checklists, or holistic scales.
Let us discuss in details the different steps needed in developing good assessment
tools. Following the different steps is very important so that the test items developed
will measure the different learning outcomes appropriately. In this case, the teacher
can measure what is supposed to measure. Consider the following discussions in
each step.
Examine the instructional Objectives of the Topic Previously Discussed
Table of Specification (TOS) is a chart or table that details the content and level
of cognitive level assessed on a test as well as the types and emphases of test items
(Gareis and Grant, 2008). Table of specification is very important in addressing the
validity and reliability of the test items. The validity of the test means that the
assessment can be used to draw appropriate result from the assessment because the
assessment guarded against any systematic error.
Table of specification provides the test constructor a way to ensure that the
assessment is based from the intended learning outcomes. It is also a way of ensuring
that the number of questions on the test is adequate to ensure dependable results that
are not likely caused by chance. It is also a useful guide in constructing a test and in
determining the type of test items that you need to construct.
Below are the suggested steps in preparing a table of specification used by the
test constructor. Consider these steps in making a two-way chart table of
specification. See also format 1 of the Table of Specification for the other steps.
d. Make the two-way chart s shown in the format 2 and format 3 of a Table of
Specification.
e. Construct the test items. A classroom teacher should always follow the
general principle of constructing test items. The test item should always correspond
with the learning outcome so that it serves whatever purpose it may have.
If properly prepared, a table of specification will help you limit the coverage of test
and identify the necessary skills or cognitive level required to answer the test item
correctly.
The first format of a table of specification is composed of the specific objectives, the
cognitive level, type of test used, the item number and the total points needed in each
item. Below is the template of the said format.
Type of Test Item identifies the type or kind of test a test item belongs to. Examples
of entries in the column could be “multiple-choice, true or false, or even essay.”
Item Number simply identifies the question number as it appears in the test.
Note:
The number if item for each level will depend on the skills the teacher wants to
develop in his students. In the case of tertiary level, the teacher must develop more
higher-order thinking skills (HOTS) questions.
For elementary and secondary levels, the guidelines in constructing test will be as
stipulated in the DepEd Order 33, Series 2004 must be followed. That is, factual
information 60%, moderately difficult or more advanced questions 30% and higher
order thinking skills 10% for distinguishing honor students.
In this section, we shall discuss the different format of objective type of test
items, the steps in developing objective and subjective test, the advantages and its
limitations. The different guidelines of constructing different types of objective and
subjective test items will also be discussed in this section.
Kubiszyn and Borich (2007) suggested some general guidelines for writing
test items, to help classroom teachers improve the quality of test items to write.
1. Begin writing items far enough or in advance so that you will have time to revise
them.
2. Match items to intended outcomes at appropriate level of difficulty to provide
valid measure of instructional objectives. Limit the question to the skill being
assessed.
3. Be sure each item deals with an important aspect of the content area and not
with trivia.
4. Be sure the problem posed is clear and unambiguous.
5. Be sure that the item is independent with all other items. The answer to one
item should not be required as a condition in answering the next item. A hint
to one answer should not be embedded to another item.
6. Be sure the item has one or best answer on which expert would agree.
7. Prevent unintended clues to an answer in the statement or question.
Grammatical inconsistencies such a or an give clues to the correct answer to
those students who are not well prepared for the test.
8. Avoid replication of the textbook in writing test items; do not quote directly from
the textual materials. You are usually not interested in how well students
memorize the text. Besides, taken out of context, direct quotes from the text
are often ambiguous.
9. Avoid trick or catch questions in an achievement test. Do not waste time
testing how well the students can interpret your intentions.
10. Try to write items that require higher-order thinking skills.
Consider the following average time in constructing the number of test items.
The length of time and the type of item used are also factors to be considered in
determining the number of items to be constructed in an achievement test. These
guidelines will be very important in determining appropriate assessment for college
students.
Assessment Format Average Time to Answer
True-false 30 seconds
Multiple-choice 60 seconds
Multiple-choice of higher level
90 seconds
learning objectives
Short Answer 120 seconds
Completion 60 seconds
Matching 30 seconds per response
Short Essay 10-15 minutes
Extended Essay 30 minutes
Visual Image 30 seconds
The number of items included in a given assessment will also depend on the
length of the class period and the type of items utilized. The following guidelines
will assist you in determining an assessment appropriate for college-level students
aside from the previous formula discussed.
Yes No
The item is appropriate to measure a learning objective.
The item format is the most effective means of measuring the desired
knowledge.
The item is clearly worded and can be easily understood by the target
student population.
The items of the same format are grouped together.
There are various item types include in the assessment.
The students have enough time to answer all test items.
The test instructions are specific and clear,.
The number of questions targeting each objective matches the weight
of importance of that objective.
The scoring guidelines are discussed clearly and available to students.
After constructing the test items following the different principles of constructing
test item, the next step to consider is to assemble the test items. There are two steps
in assembling the test: (1) packaging the test; and (2) reproducing the test,.
a. Group all test items with similar format. All items in similar format must be
grouped so that the students will not be confused.
b. Arrange test items from easy to difficult. The test items must be arranged
from easy to difficult so that students will answer the first few items correctly
and build confidence at the start of the test.
c. Space the test items for easy reading.
d. Keep items and option in the same page.
e. Place the illustrations near the description.
f. Check the answer key.
g. Decide where to record the answer.
Write Directions
Check the test directions for each item format to be sure that it is clear for the
students to understand. The test direction should contain the numbers of items to
which they apply; how to record their answers; the basis of which they select answer;
and the criteria for scoring or the scoring system.
Before reproducing the test, it is very important to proofread first the test items
for typographical and grammatical errors and make necessary corrections if any. If
possible, let others examine the test to validate its content. This can save time during
the examination and avoid destruction of the students.
Be sure to check your answer key so that the correct answers follow a fairly
random sequence. Avoid answers such as TFTFTF, etc., or TTFFF for a true or false
type, and A B C D A B C D patterns for multiple-choice type. The number of true
answers must be equally the same with dales answers and also among the multiple-
choice options.
Analyzing and improving the test should be done after checking, scoring and
recording the test. The details of this part will be discussed in the succeeding chapter.
There are two general types of test item to use in achievement test using paper
and pencil test. It is classified as selection-type items and supply type items.
Selection type items require students to select the correct response from
several options. This is also known as objective test item. Selection type items can be
classified as: Multiple-choice; matching type; true or false; or interpretative
exercises.
Objective test item requires only one correct answer in each item.
Kinds of Objective Type Test
In this section, we shall discuss the different format of objectives types of test
items and the general guidelines in constructing multiple-choice type of test,
guidelines in constructing the stem, options and distracters, advantages and
disadvantages of multiple-choice, guidelines in constructing matching type of test,
advantages and disadvantages of matching type of test, guidelines in constructing
true or false and comprehension types of test, advantages and disadvantages of true
or false and interpretative exercises.
a. Multiple-choice Test
Multiple-choice item consists of three parts: the stem, the keyed option and the
incorrect options or alternatives. The stem represents the problem or question usually
expressed in completion form or question form. The keyed option is the correct answer.
The incorrect options or alternatives also called distracters or foil.
1. Knowledge Level
The most stable measure(s) of central tendency is the .
A. Mean
B. Mean and median
C. Median
D. Mode
This kind of question is a knowledge level type because the students are required
only to recall the properties of the mean. The correct answer is option A.
2. Comprehension Level
Which most of the following statements describe normal distribution?
A. The mean is greater than the median.
B. The mean median and mode are equal.
C. The scores are more concentrated at the other part of the distribution.
This kind of question is a comprehension level type because the students are
required to describe the scores that are normally distributed. The correct
answer is option B.
3. Application Level
What is the standard deviation of the following scores of 10 students in
mathematics quiz, 10, 13, 16, 16, 17, 19, 20, 20, 20, 25?
A. 3.90
B. 3.95
C. 4.20
D. 4.25
This kind of question is an application level because the students are asked to
apply the formula and solve for the variance. The correct answer is option C.
4. Analysis Level
What is the statistical test used when you test the mean difference between
pre- test?
A. Analysis of variance
B. t-test
C. Correlation
D. Regression analysis
This kind of question is an example of analysis level type because students are
required to distinguish which type of test is used. The correct answer is option B.
b. Matching type
Matching type item consist of two columns. Column A contains the
description and must be place at the left side while column B contains the
options and placed at the right side. The examinees are asked to match the
option that are associated with the description(s).
Guideline in Constructing Matching types of Test
Direction: Match the function of the part of computer in Column A with its name
in Column B. Write the letter of your choice before the number.
Column A Column B
_____ 1. Stores information waiting to be used A. Central Processing Unit
_____ 2. Consider as the brain of the computer B. Hard Drive
_____ 3. Hand-held device used to move the cursor C. Hardware
_____ 4. An example of an output device D. Mass Storage Device
_____ 5. Stores permanent information in the computer E. Mouse
_____ 6. Physical aspect of the computer F. Monitor
_____ 7. Used to display the output G. Processor
_____ 8. The instruction fed into the computer H. Printer
_____ 9. Pre-loaded data I. Random Access Memory
______ 10. Permits a computer to store large amount of data J. Read Only Memory
K. Software
L. Universal Serial Bus
Another format of an objective type of test is the true or false type of test items.
In this type of test, the examinees determine whether the statement presented true or
false. True or false test item is an example of a “force-choice test” because there are only
two possible choices in this type of test. The students are required to0 choose the
answer true or false in recognition to a correct statement or incorrect statement.
True or False type of test is appropriate in assessing the behavioral objectives such
as “identify” “select,” or “recognize”. It is also suited to assess the knowledge and
comprehension level in cognitive domain. This type of test is appropriate when there
are only two plausible alternatives or distracters.
Direction: Write your answer before the number in each item. Write T if the
statement is true and F if the statement if false.
food.
1. Limited only to low level of thinking skills such as knowledge and comprehension,
or recognition or recall information.
2. High probability of guessing the correct answer (5%) compared to multiple-
choice which consist of four option (25%).
Supply type items require students to create and supply their own answer or
perform a certain task to show mastery of knowledge or skills. It is also known as
constructed response test. Supply type items or constructed response test are
classified as:
Subjective test item requires the students to organize and present an original
answer (essay test) and perform task to show mastery of learning (performance-
based assessment and portfolio assessment) or supply a word or phrase to answer a
certain question (completion or short answer type of test).
Essay test is a form of subjective type of test. Essay test measures complex cognitive
skills or processes. This type of test has no one specific answer per students. It is usually
scored on an option basis, although there will be certain facts and understanding
expected in the answer. There are two kinds of essay items: extended response essay
and restricted response essays.
The teacher must present and discuss the criteria used in assessing the answer of
the students in advance to help them to prepare from the test.
Essay Item 1. Which supply type Essay Item 1. Supply type item used to
item is used to measure the ability to measure the ability too organize and
organize and integrated material? integrate material is called .
1. It is only appropriate for questions that can be answered with short responses.
2. There is a difficult in scoring when the questions are not prepared properly
and clearly. The question should be clearly stated so that the answer of the
student is clear.
3. It can assess only knowledge, comprehension and application levels in
Bloom’s taxonomy of cognitive domain.
4. It is not adaptable in measuring complex learning outcomes.
5. Scoring is tedious and time consuming.
b. Essay Items
It is appropriate when assessing students’ ability to organize and present their
original ideas. It consists of a few number of questions wherein the examinee is
expected to demonstrate the ability to recall factual knowledge; organize his
knowledge; and present his knowledge in logical and integrated answer.
There are two types of essay item: extended response and restricted response
essay.
An essay test that allows the students to determine the length and complexity of
the response is called extended response essay item (Kubiszyn and Borich, 2007). It is
very useful in assessing the synthesis and evaluation skills of the students. When
the objective is to determine whether the students can organize ideas, integrated
and express ideas, evaluate information in knowledge, it is best to use extended
response essay test.
Using extended response essay item has advantages and disadvantages. Advantages
are: demo9nstrate learning outcomes at the synthesis and evaluation levels; evaluate
the answers with sufficient reliability to provide useful measures of learning; provides
more freedom to give responses to the question and provide creative integration of
ideas. Disadvantages are: more difficult to construct extended response essay
questions; scoring is time consuming than restricted response essay.
1. Present and describe the modern theory of evolution and discuss how it is
supported by evidence from the areas of (a) comparative anatomy, (b)
population genetic.
2. From the statement, “Wealthy politicians cannot offer fair representation to
all the people.” What do you think is the reasoning of the statement?
Explain your answer.
An essay item that places strict limits on both content and the response given by
the students is called restricted response essay item. In this type of essay, the
content is, usually restricted by the scope of the topic to be discussed and the
limitations on the form of the response is indicated in the question.
When there is a restriction on the form and scope of the answer of the
students in an essay test, there can be advantages and disadvantages. The advantages
are: it is easier to prepare questions; it is easier to score; and it is more directly
related to the specific learning outcomes. The disadvantages are: it provides little
opportunity for the students to demonstrate their abilities to organize ideas, to
integrate materials, and to develop new patterns of answers; it measures learning
outcomes at comprehension, application and analysis levels only.
1. List the major facts and opinions in the first state of the nation address
(SONA) of Pres. Benigno Cojuangcon Aquino, Jr. Limit your answer to one
page only. The score will depend on the content, organization and accuracy of
your answer.
2. Point out the strength =s and weaknesses of a multiple-choice type of test.
Limit your answer to five strengths and five weaknesses. Explain each
answer in not more than two sentences.
1. Choose a leader you admire most and explain why you admire him or her.
2. Pick a controversial issue in the Aquino administration. Discuss the issue and
suggest a solution.
3. If you were the principal of a certain school, describe how would you
demonstrate your leadership ability inside and outside of the school.
4. Describe the difference between Norm-referenced assessment and
Criterion- referenced assessment.
5. Do you agree or disagree with the statement, “Education comes not from
books but from practical experience. “Support your position.
1. It is easier to prepare and less time consuming compared to other paper and
pencil tests.
2. It measures higher-order thinking skills (analysis, synthesis and evaluation).
3. It allows students’ freedom to express individuality in answering the given
question.
4. The students have a chance to express their own ideas in order to plan their
own answer.
5. It reduces guessing answer compared to any of the objective type of test.
6. It presents more realistic task to the students.
7. It emphasizes on the integration and application of ideas.
Yes No
The test item is appropriate for measuring the intended learning outcomes.
The test item task matches with the learning task to be measured.
The questions constructed measure complex learning outcomes.
It is states in the questions what is being measured and how the answer
are to be evaluated.
The terminology used clarified and limits the task.
All students are required to answer the same question.
There is an establish time limit to answer each question.
Provisions for scoring answers are given (criteria for evaluating answer).
Chapter Exercises
10. What are the advantages of a subjective or essay type of test over objective type of test?
11. Differentiate extended response essay test items from restricted response essay test items.
12. Discuss the limitations of essay test items in terms of scoring the quality of the responses from the
students.
13. When is it appropriate to give an essay test?
14. Identify the different guidelines in rating essay test items.
15. Construct an essay test using Krathwolh's 2001 revised cognitive domain.
a. Remembering
b. Understanding
c. Applying
d. Analyzing
e. Evaluating
f. Creating