0% found this document useful (0 votes)

6 views33 pages

Part 7 Evaluation

The document discusses the importance of evaluation in the design life cycle, highlighting formative and summative evaluations. It categorizes evaluation techniques into expert-based, model-based, and user-based methods, detailing specific techniques such as Heuristic Evaluation and Cognitive Walkthrough. Additionally, it outlines various user-based evaluation methods, including experimental, observational, and physiological monitoring techniques, emphasizing the need to choose appropriate methods based on context and resources.

Uploaded by

reginamakena05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views33 pages

Part 7 Evaluation

Uploaded by

reginamakena05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Part 7 – Evaluation

HCI 2019
Introduction
Evaluations assess design and implementation
Formative evaluation during development (cook tastes
the soup)
Summative evaluation at completion of project (guests
taste the soup)
Ideally, evaluation should be considered at all stages in
the design life cycle
◦ That's practically difficult but formal methods (such as expert-
based methods – coming on later slides) can and should be
used
◦ Later, we’ll talk about how to plan and conduct actual user
evaluations
Introduction
Since evaluation should be considered at all life cycle stages, there is a link
between:
◦ evaluation & design techniques
◦ evaluation & prototyping techniques
◦ evaluation & implementation techniques
◦ Etc.
Goals of Evaluation
Assess extent of system performance and functionality (the tasks that users are
interested in)
Assess effect of the user interface on the user
◦ (the user's experience of the interaction e.g., easy to learn, easy to use, satisfaction
etc., and the usability attributes e.g., learnability, speed of operation, robustness,
recoverability, adaptability)

Identify specific problems (e.g., errors, confusion, unexpected results)

[the above goals are of course interrelated…]

Evaluation Techniques
Evaluation techniques can be categorized as
follows:
◦ Expert-based
◦ Model-based
◦ User-based (user-centered evaluation )
Expert-based Evaluation Techniques
Expert-based evaluation techniques are also referred to as expert
analysis techniques
Evaluation through expert analysis is done because:
◦ It can be expensive to regularly carry out user tests at all life cycle
stages
◦ Moreover, it can be difficult to get an accurate assessment based on
incomplete designs and prototypes
Expert-based Evaluation Techniques
Expert analysis characteristics:
◦ Designer or HCI expert assesses a design based on known/standard
cognitive principles, design principles or empirical results
◦ Expert analysis methods can be used at any stage in the life cycle
◦ Expert analysis methods are relatively cheap
◦ Expert analysis methods, however, do not assess the actual use of
the system

Examples of expert analysis methods:

◦ Heuristic Evaluation (HE)
◦ Cognitive Walkthrough (CW)
Heuristic Evaluation
❑Heuristic Evaluation (HE) was proposed by Nielsen and
Molich
❑In HE, experts scrutinize the interface and its elements
against established usability heuristics [another previous
lesson]
❑The experts should have some background knowledge or
experience in HCI design and usability evaluation
Heuristic Evaluation – the process
❖3 to 5 experts are considered to be sufficient to detect most of the usability problems

❖The enlisted experts are provided with the proper roles (and sometimes scenarios to
use) to support them when interacting with the system/prototype under evaluation

❖They then evaluate the system/prototype individually

❖This is to ensure an independent and unbiased evaluation by each expert

❖They assess the user interface as a whole and also the individual user interface
elements

❖ The assessment is performed with reference to a set of established usability

principles

❖When all the experts are through with the assessment, they come together and
compare and appropriately aggregate their findings
Cognitive Walkthrough
❖Cognitive Walkthrough (CW) was proposed by Polson et
al.
❖CW evaluates design on how well the design supports
user in learning the task to be performed [primarily
through exploration i.e. hands on]
❖CW is usually performed by expert in cognitive
psychology
❖The expert ‘walks through’ the design [i.e. steps through
each step of some known/representative task] to identify
potential problems
Cognitive Walkthrough - requirements
Four requirements in order to perform the
CW:
❖ specification or prototype of the system
❖ description of the task the user is to
perform
❖ complete, written list of actions
constituting the task
❖ description of the user (including the
level of experience and knowledge)
Cognitive Walkthrough – the process
With the foregoing information, the evaluator steps through
each of the actions trying to answer the following 4 questions:
◦ is the effect of the action the same as the user's goal at that
point? [what the action will do/action's effect should be what
the user intends/user's goal.]
◦ will users see that the action is available [when they want it] -
visibility at that time?
◦ once users have found the correct action [as in the foregoing],
will they know/recognize it is the one they need? [effective
representation of the action, clear representation.]
◦ after the action is taken, will users understand the feedback
they get? [effective confirmation that the action has been
taken.]
Cognitive Walkthrough – the process
Forms are used to guide analysis e.g.
◦cover form [for the four requirements above, date,
time, evaluators of the CW],
◦answer form [for answering the four questions
above],
◦usability problem report [for describing any
negative answers/problems, severity of the
problem e.g. frequency of occurrence and
seriousness of the problem, date, time, evaluators]
Reading Assignment: Model-Based Evaluation

For instance: Dialog models

(such as State Transition
Networks) can be used to
evaluate dialog problems in a
user interface e.g. unreachable
states, circular dialogs, etc.
Note: Model-based evaluation is
sometimes classified under
expert-based evaluation
techniques
User-Based Evaluation
User-based evaluation basically is evaluation through
user participation i.e. evaluation that involves the
people for whom the system is intended; the users
User-based evaluation techniques include:
◦ experimental methods
◦ observational methods
◦ query techniques (e.g., questionnaires and interviews)
◦ physiological monitoring methods (e.g., eye tracking, measuring skin
conductance, measuring heart rate)

User-based methods can be conducted in the

laboratory and/or in the field
Usability Laboratory
User-Based Evaluation
Laboratory
Advantages:
◦ Specialist equipment available
◦ Uninterrupted environment
Disadvantages:
◦ Lack of context
◦ Difficult to observe several users cooperating
Appropriate:
◦ If system usage location is dangerous, remote or impractical
◦ For very constrained single-user tasks [to allow controlled
manipulation of use]
◦ Ag military systems, airplane systems
User-Based Evaluation
Field or Working Environment
Advantages:
◦ Natural environment
◦ Context retained (though observation may alter it)
◦ Longitudinal studies possible
Disadvantages:
◦ Field challenges e.g., distractions, interruptions, movements,
danger, noise
Appropriate:
◦ Where context is crucial [especially for longitudinal studies]
◦ Customer management system to manage customer preferences
User-Based Evaluation Techniques
User-based evaluation techniques
include:
◦experimental methods
◦observational methods
◦query techniques
◦physiological monitoring methods
Experimental Methods
Experimental methods are also called controlled
experiments
Controlled experiments are:
◦ considered to be the most rigorous of empirical
methods
◦ capable of providing empirical evidence to support a
particular claim or hypothesis
Note:
◦ Empirical research, is a way of gaining knowledge by means of direct and indirect
observation or experience.
◦ Empirical evidence, also known as sensory experience, is the knowledge received
by means of the senses, particularly by observation and experimentation
Experimental Methods
What makes up the Experiment?
Participants
◦ Should match the expected users as closely as possible e.g.,
age, education, general computing experience, domain
knowledge, etc.
◦ Sample size of the participants should be large enough to be
considered representative of the population

Variables (two types: independent and

dependent)
Hypothesis
Experimental Methods
Example:
We want to find out whether users perform faster when using a
graphical user interface than when using a command-line interface
Independent variable (IV): interaction mode (with two levels: graphical
vs. command-line)
Dependent variable (DV): task completion time
Hypothesis: Users perform faster (DV) when using a graphical user
interface (IV level one) than when using a command-line interface (IV
level two)
Null hypothesis: There is no difference in performance (DV) when using
a graphical user interface (IV level one) or when using a command-line
interface (IV level two)
There are two conditions: Graphical (G) and Command-line (C)
Observational Methods
Observational methods include:
◦think aloud
◦cooperative evaluation
◦protocol analysis,
◦post-task walkthroughs
Observational Methods
Think aloud
User is observed performing task
User is asked to describe what s/he is doing and why, what s/he
thinks is happening, etc.
Advantages:
◦ Simplicity - requires little expertise
◦ Can provide useful insight
◦ Can show how system is actually used

Disadvantages:
◦ Subjective [really depends on the user]
◦ Selective [out of many things, the user may choose what to describe]
◦ Act of describing may alter task performance
Observational Methods
Cooperative Evaluation
Reading Assignment:
◦Describe this evaluation method
Observational Methods
Protocol Analysis
Paper and pencil: cheap, limited to writing speed
Audio: good for think aloud, difficult to record sufficient information to identify exact
actions in later analysis, difficult to match with other protocols ('synchronization')
Video: accurate and realistic, needs special equipment, obtrusive
Computer logging: automatic and unobtrusive, large amounts of data difficult to
analyze
User notebooks: coarse and subjective, useful insights, good for longitudinal studies
Note:
Mixed use in practice
Audio/video transcription difficult and requires skill
Some automatic support tools available e.g., EVA (Experimental Video Annotator),
Observer Pro (from Noldus), Workplace project (Xerox PARC), etc.
Observational Methods
Post-task Walkthrough

Transcript played back to participant for comment i.e. user

reacts on action after the event
Used to fill in intention i.e. reasons for actions performed
and alternatives considered
It also is necessary where think aloud is not possible
Advantages:
◦ Analyst has time to focus on relevant incidents
◦ Avoids excessive interruption of task

Disadvantages:
◦ Lack of freshness
◦ May be post-hoc interpretation of events
Query Techniques
Query techniques
◦Questionnaires
◦ interviews
Physiological Monitoring Methods
Physiological Monitoring Methods such as eye tracking,
measuring skin conductance, measuring heart rate etc.

Example: Eye-tracking
Head or desk mounted equipment tracks the position of the eye
Eye movement reflects the amount of cognitive processing a
display requires
Measurements include: fixations, scan paths, etc. For instance:
◦ number of fixations
◦ duration of fixation
◦ scan paths: moving straight to a target with a short fixation at the target is
optimal
Physiological Monitoring Methods
Physiological Measurements
Emotional response linked to physical changes
These may help determine a user’s reaction to a user interface
Measurements include: heart, sweat, muscle, brain. For instance:
◦ heart activity: e.g. blood pressure, volume and pulse.
◦ activity of sweat glands: Galvanic Skin Response (GSR)
◦ electrical activity in muscle: electromyogram (EMG)
◦ electrical activity in brain: electroencephalogram (EEG)

There is some difficulty in interpreting these physiological

responses; more research is needed
Choosing an Evaluation Method
Factors that can influence the choice

◦ when in process : design vs. implementation

◦ style of evaluation : laboratory vs. field
◦ how objective : subjective vs. objective
◦ type of measures : qualitative vs. quantitative
◦ level of information: high level vs. low level
◦ level of interference: obtrusive vs. unobtrusive
◦ resources available: time, subjects, equipment, expertise
Reading Assignment
In user-centered design, what are mental models?
◦ What is a designer mental model and a user mental model?
In HCI, what is a Loop of Interaction?
◦ What are the main aspects in loop of interaction?
Why is it crucial to include error messages during the
design of user interfaces?
Why is the correct use of color important in user
interface design?
What are the characteristics of a poorly designed
website?

Thermo King Tool Catalog Part 2
100% (1)
Thermo King Tool Catalog Part 2
53 pages
How To Transmit SAP Purchase Order To Vendor Via E-Mail
100% (14)
How To Transmit SAP Purchase Order To Vendor Via E-Mail
16 pages
Training Report On Telecommunication and Signal-Indian Railways
100% (3)
Training Report On Telecommunication and Signal-Indian Railways
38 pages
HCI - Evaluation
No ratings yet
HCI - Evaluation
52 pages
RVR FM Product List
0% (1)
RVR FM Product List
37 pages
Mathematics Questions and Answers Wassce 2017
No ratings yet
Mathematics Questions and Answers Wassce 2017
23 pages
Evaluation in HCI
No ratings yet
Evaluation in HCI
41 pages
Mod 1 Lesson 1 Ict and Its Current State
No ratings yet
Mod 1 Lesson 1 Ict and Its Current State
71 pages
Introduction To The UPS Developer Kit
No ratings yet
Introduction To The UPS Developer Kit
33 pages
cs3240 09 Qualitative Evaluation
No ratings yet
cs3240 09 Qualitative Evaluation
85 pages
User Interfaces Evaluation: DCO10104: User-Centered Design and Testing
No ratings yet
User Interfaces Evaluation: DCO10104: User-Centered Design and Testing
25 pages
JAILBREAKER-Automated Jailbreak Across Multiple Large Language Model Chatbots-2023 7
100% (2)
JAILBREAKER-Automated Jailbreak Across Multiple Large Language Model Chatbots-2023 7
15 pages
HDL Based Synthesis
No ratings yet
HDL Based Synthesis
23 pages
Application Note: Revision 01
No ratings yet
Application Note: Revision 01
34 pages
Module 7 Design Evaluation
100% (1)
Module 7 Design Evaluation
40 pages
Lecture 5 Evaluation - Techniques
No ratings yet
Lecture 5 Evaluation - Techniques
72 pages
Class Action Complaint Against Apple Over M1 MacBook Display Issues
No ratings yet
Class Action Complaint Against Apple Over M1 MacBook Display Issues
39 pages
Chapter Seven: Evaluation Techniques
No ratings yet
Chapter Seven: Evaluation Techniques
33 pages
Computer Network MCQ
No ratings yet
Computer Network MCQ
42 pages
Design Heuristics & Usability Testing
No ratings yet
Design Heuristics & Usability Testing
40 pages
G1 - System Design Evaluation
No ratings yet
G1 - System Design Evaluation
34 pages
Netflix - Ecommerce
No ratings yet
Netflix - Ecommerce
17 pages
Notes HCI
No ratings yet
Notes HCI
41 pages
Bizhub PRO 1200 Series Product Guide 4.8
No ratings yet
Bizhub PRO 1200 Series Product Guide 4.8
73 pages
HCI Lecture Evaluation Part-I
No ratings yet
HCI Lecture Evaluation Part-I
54 pages
Chapter 7-Evaluation Techniques and Universal Design
No ratings yet
Chapter 7-Evaluation Techniques and Universal Design
22 pages
AccurioPress C2070 C2070P C2060 Catalog en PDF
No ratings yet
AccurioPress C2070 C2070P C2060 Catalog en PDF
16 pages
HCI Unit7 (1st Final)
No ratings yet
HCI Unit7 (1st Final)
79 pages
Hci CH7
No ratings yet
Hci CH7
11 pages
W02 DesignHeuristics - UsabilityTesting 01
No ratings yet
W02 DesignHeuristics - UsabilityTesting 01
27 pages
Session 13 - 14 - IsYS6596 - Techniques For Designing UX Evaluation
No ratings yet
Session 13 - 14 - IsYS6596 - Techniques For Designing UX Evaluation
47 pages
Alienware 17 R4 Service Manual: Computer Model: Alienware 17 R4 Regulatory Model: P31E Regulatory Type: P31E001
No ratings yet
Alienware 17 R4 Service Manual: Computer Model: Alienware 17 R4 Regulatory Model: P31E Regulatory Type: P31E001
133 pages
Kmklo
No ratings yet
Kmklo
69 pages
Human Computer Interaction
No ratings yet
Human Computer Interaction
56 pages
UI - Evaluation Methods-Inspection (Jacob Heuristiics) - Mar 2023 - (Part-1)
No ratings yet
UI - Evaluation Methods-Inspection (Jacob Heuristiics) - Mar 2023 - (Part-1)
25 pages
FNIS FNIS Readme 7.4.5
No ratings yet
FNIS FNIS Readme 7.4.5
17 pages
Evaluationtech
No ratings yet
Evaluationtech
40 pages
LECTURE - 7 - Evaluation Techniques
No ratings yet
LECTURE - 7 - Evaluation Techniques
34 pages
Usability Testing
No ratings yet
Usability Testing
29 pages
HCI Module 6 and 7
No ratings yet
HCI Module 6 and 7
44 pages
Figure PPT ch003
No ratings yet
Figure PPT ch003
50 pages
Exp 7
No ratings yet
Exp 7
17 pages
Evaluation Techniques and Universal Design
No ratings yet
Evaluation Techniques and Universal Design
64 pages
Chapter 2 HCI
No ratings yet
Chapter 2 HCI
32 pages
Evaluation
No ratings yet
Evaluation
19 pages
Part 1 (Chapter 1-4) : Fundamental Components of Interactive System
No ratings yet
Part 1 (Chapter 1-4) : Fundamental Components of Interactive System
46 pages
Lecture07 Quick Prototype Evaluation
No ratings yet
Lecture07 Quick Prototype Evaluation
26 pages
Chapter 7
No ratings yet
Chapter 7
59 pages
HCI (Human Computer Interaction)
No ratings yet
HCI (Human Computer Interaction)
18 pages
Statistical Quality Control
No ratings yet
Statistical Quality Control
36 pages
Uma-L31 (Evaluation Technique)
No ratings yet
Uma-L31 (Evaluation Technique)
33 pages
Materi Part 10 - Evaluation Tehnique
No ratings yet
Materi Part 10 - Evaluation Tehnique
34 pages
E3 Chap 09 - 1
No ratings yet
E3 Chap 09 - 1
30 pages
Unit 7: Evaluation Techniques
No ratings yet
Unit 7: Evaluation Techniques
41 pages
DBMT103-EUR 500BN Swift
No ratings yet
DBMT103-EUR 500BN Swift
2 pages
HCI Lec08
No ratings yet
HCI Lec08
24 pages
How To Set Up A LLC in USA For Non Residents
No ratings yet
How To Set Up A LLC in USA For Non Residents
29 pages
RR1720 User Manual PDF
No ratings yet
RR1720 User Manual PDF
71 pages
SIMATIC IT Historian Pres
No ratings yet
SIMATIC IT Historian Pres
59 pages
Lecture 7 - Universal Design Evaluations
No ratings yet
Lecture 7 - Universal Design Evaluations
30 pages
HCI - Evaluation Techniques
No ratings yet
HCI - Evaluation Techniques
43 pages
Chapter 7 Developing Er Diagram
No ratings yet
Chapter 7 Developing Er Diagram
17 pages
Chapter 7 - Formative Evaluation
No ratings yet
Chapter 7 - Formative Evaluation
9 pages
Potential
No ratings yet
Potential
40 pages
HCI Lecture17
No ratings yet
HCI Lecture17
13 pages
Predictive Studies
No ratings yet
Predictive Studies
19 pages
Predictive Studies
No ratings yet
Predictive Studies
19 pages
Week - 13 - 14 - Evaluation Techniques
No ratings yet
Week - 13 - 14 - Evaluation Techniques
23 pages
Chap7 Evaluation
No ratings yet
Chap7 Evaluation
20 pages
CHAPTER 7 - Human Computer Interaction
No ratings yet
CHAPTER 7 - Human Computer Interaction
13 pages
Human Computer Interaction - Evaluation Techniques
No ratings yet
Human Computer Interaction - Evaluation Techniques
26 pages
Human Computer Interaction - Evaluation Techniques
No ratings yet
Human Computer Interaction - Evaluation Techniques
26 pages
HC I Assignment
No ratings yet
HC I Assignment
6 pages
Ambo University Woliso Campus: School of Technology and Informatics
No ratings yet
Ambo University Woliso Campus: School of Technology and Informatics
11 pages
Evaluation Techniques: Human-Computer Interaction
No ratings yet
Evaluation Techniques: Human-Computer Interaction
22 pages
Chapter 7 - Formative Evaluation
No ratings yet
Chapter 7 - Formative Evaluation
9 pages
HCI Individual Assagnment
No ratings yet
HCI Individual Assagnment
8 pages
Practical 01
No ratings yet
Practical 01
5 pages
Practice Sheet Divide and Conquer
No ratings yet
Practice Sheet Divide and Conquer
5 pages
Evaluation Without Users Evaluation Predictive Evaluation Heuristic Evaluation Discount Usability Testing Cognitive Walkthrough (User Modeling)
No ratings yet
Evaluation Without Users Evaluation Predictive Evaluation Heuristic Evaluation Discount Usability Testing Cognitive Walkthrough (User Modeling)
15 pages
SO - HPE GreenLake For Aruba
No ratings yet
SO - HPE GreenLake For Aruba
4 pages
Quadcopter With Arduino Uno Running MultiWii
No ratings yet
Quadcopter With Arduino Uno Running MultiWii
5 pages
Human-Computer Interaction (HCI) : Evaluation Techniques
No ratings yet
Human-Computer Interaction (HCI) : Evaluation Techniques
6 pages
Evaluation Through Expert Analysis
No ratings yet
Evaluation Through Expert Analysis
3 pages
XII Sci Practical SLips
No ratings yet
XII Sci Practical SLips
2 pages
WF Broadcast Network LUXeTV
No ratings yet
WF Broadcast Network LUXeTV
2 pages
IGNOU BCA System Analysis and Design Previous Year Solved Papers MCS 014
From Everand
IGNOU BCA System Analysis and Design Previous Year Solved Papers MCS 014
Manish Soni
No ratings yet
Elicitation Techniques for Business Analysis
From Everand
Elicitation Techniques for Business Analysis
Kadir Çamoğlu
No ratings yet
MCS-034: Software Engineering
From Everand
MCS-034: Software Engineering
Dr. DK Sukhani
No ratings yet
Automated Software Testing Interview Questions You'll Most Likely Be Asked
From Everand
Automated Software Testing Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Part 7 Evaluation

Uploaded by

Part 7 Evaluation

Uploaded by

Part 7 – Evaluation

Identify specific problems (e.g., errors, confusion, unexpected results)

[the above goals are of course interrelated…]

Examples of expert analysis methods:

❖They then evaluate the system/prototype individually

❖ The assessment is performed with reference to a set of established usability

For instance: Dialog models

User-based methods can be conducted in the

Variables (two types: independent and

Transcript played back to participant for comment i.e. user

There is some difficulty in interpreting these physiological

◦ when in process : design vs. implementation

You might also like