Part 7 Evaluation
Part 7 Evaluation
HCI 2019
Introduction
Evaluations assess design and implementation
Formative evaluation during development (cook tastes
the soup)
Summative evaluation at completion of project (guests
taste the soup)
Ideally, evaluation should be considered at all stages in
the design life cycle
◦ That's practically difficult but formal methods (such as expert-
based methods – coming on later slides) can and should be
used
◦ Later, we’ll talk about how to plan and conduct actual user
evaluations
Introduction
Since evaluation should be considered at all life cycle stages, there is a link
between:
◦ evaluation & design techniques
◦ evaluation & prototyping techniques
◦ evaluation & implementation techniques
◦ Etc.
Goals of Evaluation
Assess extent of system performance and functionality (the tasks that users are
interested in)
Assess effect of the user interface on the user
◦ (the user's experience of the interaction e.g., easy to learn, easy to use, satisfaction
etc., and the usability attributes e.g., learnability, speed of operation, robustness,
recoverability, adaptability)
❖The enlisted experts are provided with the proper roles (and sometimes scenarios to
use) to support them when interacting with the system/prototype under evaluation
❖They assess the user interface as a whole and also the individual user interface
elements
❖When all the experts are through with the assessment, they come together and
compare and appropriately aggregate their findings
Cognitive Walkthrough
❖Cognitive Walkthrough (CW) was proposed by Polson et
al.
❖CW evaluates design on how well the design supports
user in learning the task to be performed [primarily
through exploration i.e. hands on]
❖CW is usually performed by expert in cognitive
psychology
❖The expert ‘walks through’ the design [i.e. steps through
each step of some known/representative task] to identify
potential problems
Cognitive Walkthrough - requirements
Four requirements in order to perform the
CW:
❖ specification or prototype of the system
❖ description of the task the user is to
perform
❖ complete, written list of actions
constituting the task
❖ description of the user (including the
level of experience and knowledge)
Cognitive Walkthrough – the process
With the foregoing information, the evaluator steps through
each of the actions trying to answer the following 4 questions:
◦ is the effect of the action the same as the user's goal at that
point? [what the action will do/action's effect should be what
the user intends/user's goal.]
◦ will users see that the action is available [when they want it] -
visibility at that time?
◦ once users have found the correct action [as in the foregoing],
will they know/recognize it is the one they need? [effective
representation of the action, clear representation.]
◦ after the action is taken, will users understand the feedback
they get? [effective confirmation that the action has been
taken.]
Cognitive Walkthrough – the process
Forms are used to guide analysis e.g.
◦cover form [for the four requirements above, date,
time, evaluators of the CW],
◦answer form [for answering the four questions
above],
◦usability problem report [for describing any
negative answers/problems, severity of the
problem e.g. frequency of occurrence and
seriousness of the problem, date, time, evaluators]
Reading Assignment: Model-Based Evaluation
Disadvantages:
◦ Subjective [really depends on the user]
◦ Selective [out of many things, the user may choose what to describe]
◦ Act of describing may alter task performance
Observational Methods
Cooperative Evaluation
Reading Assignment:
◦Describe this evaluation method
Observational Methods
Protocol Analysis
Paper and pencil: cheap, limited to writing speed
Audio: good for think aloud, difficult to record sufficient information to identify exact
actions in later analysis, difficult to match with other protocols ('synchronization')
Video: accurate and realistic, needs special equipment, obtrusive
Computer logging: automatic and unobtrusive, large amounts of data difficult to
analyze
User notebooks: coarse and subjective, useful insights, good for longitudinal studies
Note:
Mixed use in practice
Audio/video transcription difficult and requires skill
Some automatic support tools available e.g., EVA (Experimental Video Annotator),
Observer Pro (from Noldus), Workplace project (Xerox PARC), etc.
Observational Methods
Post-task Walkthrough
Disadvantages:
◦ Lack of freshness
◦ May be post-hoc interpretation of events
Query Techniques
Query techniques
◦Questionnaires
◦ interviews
Physiological Monitoring Methods
Physiological Monitoring Methods such as eye tracking,
measuring skin conductance, measuring heart rate etc.
Example: Eye-tracking
Head or desk mounted equipment tracks the position of the eye
Eye movement reflects the amount of cognitive processing a
display requires
Measurements include: fixations, scan paths, etc. For instance:
◦ number of fixations
◦ duration of fixation
◦ scan paths: moving straight to a target with a short fixation at the target is
optimal
Physiological Monitoring Methods
Physiological Measurements
Emotional response linked to physical changes
These may help determine a user’s reaction to a user interface
Measurements include: heart, sweat, muscle, brain. For instance:
◦ heart activity: e.g. blood pressure, volume and pulse.
◦ activity of sweat glands: Galvanic Skin Response (GSR)
◦ electrical activity in muscle: electromyogram (EMG)
◦ electrical activity in brain: electroencephalogram (EEG)