Kmklo
Kmklo
evaluation techniques
Evaluation?
• Evaluation
– tests usability and functionality of system
– occurs in laboratory, field and/or in collaboration
with users
– evaluates both design and implementation
– should be considered at all stages in the design life
cycle
Background
• Model-based evaluation
• Model-based evaluation
– cognitive and design models provide a
means of combining design specification
and evaluation
– GOMS (goals, operators, methods and
selection) model
– predicts user performance with a particular
interface and can be used to filter particular
design options.
Review-based evaluation
• keystroke-level model
– lower-level modeling techniques provide
predictions of the time users will take to
perform low-level physical tasks
Review-based evaluation
• Advantages:
– specialist equipment available
– uninterrupted environment
• Disadvantages:
– lack of context
– difficult to observe several users cooperating
• Appropriate
– if system location is dangerous or impractical for
constrained single user systems to allow controlled
manipulation of use
Field Studies
• Advantages:
– natural environment
– context retained (though observation may alter it)
– longitudinal studies possible
• Disadvantages:
– distractions
– Noise
– movement
• Appropriate
– where context is crucial for longitudinal studies
Evaluating Implementations
Requires an artefact:
simulation, prototype,
full implementation
Experimental evaluation
• Subjects
– who – representative, sufficient sample
• Variables
– things to modify and measure
• Hypothesis
– what you’d like to show
• Experimental design
– how you are going to do it
Subjects or participants
• prediction of outcome
– framed in terms of IV and DV
• null hypothesis:
– states no difference between conditions
– aim is to disprove this
• parametric
– assume normal distribution
– robust
– powerful
• non-parametric
– do not assume normal distribution
– less powerful
– more reliable
Analysis of data (cont.)
Problems with:
– subject groups
– choice of task
– data gathering
– analysis
Subject groups
difficult to timetable
options:
– creative task
– decision games
– control task
– Decision Games (desert survival task)
Data gathering
problems:
– synchronisation
– sheer volume!
one solution:
– record from each perspective
Analysis
solutions:
– within groups experiments
Contrast:
psychology – controlled experiment
sociology and anthropology – open study and rich data
Observational Methods
Think Aloud
Cooperative evaluation
Protocol analysis
Automated analysis
Post-task walkthroughs
Think Aloud
• Advantages
– simplicity - requires little expertise
– can provide useful insight
– can show how system is actually use
• Disadvantages
– subjective
– selective
– act of describing may alter task performance
Cooperative evaluation
• Additional advantages
– less constrained and easier to use
– user is encouraged to criticize system
– clarification possible
Protocol analysis
• paper and pencil – cheap, limited to writing speed
• audio – good for think aloud, difficult to match with other
protocols
• video – accurate and realistic, needs special equipment,
obtrusive
• computer logging – automatic and unobtrusive, large amounts
of data difficult to analyze
• user notebooks – coarse and subjective, useful insights, good
for longitudinal studies
• Workplace project
• Post task walkthrough
– user reacts on action after the event
– used to fill in intention
• Advantages
– analyst has time to focus on relevant incidents
– avoid excessive interruption of task
• Disadvantages
– lack of freshness
– may be post-hoc interpretation of events
post-task walkthroughs
Interviews
Questionnaires
Interviews
• Advantages
– can be varied to suit context
– issues can be explored more fully
– can elicit user views and identify unanticipated
problems
• Disadvantages
– very subjective
– time consuming
Questionnaires
• Advantages
– quick and reaches large user group
– can be analyzed more rigorously
• Disadvantages
– less flexible
– less probing
Questionnaires (ctd)
• Styles of question
– general
– open-ended
– scalar
– multi-choice
– ranked
Physiological methods
Eye tracking
Physiological measurement
eye tracking