Tree Testing
Tree Testing
• Page Laubheimer
on August 6, 2023
Topics:
• Information Architecture,
• Navigation,
• Research Methods
On This Page:
Select a Tool
You could conduct a tree test using a paper prototype (or any clickable
prototyping tool), but a service designed specifically for tree testing will
vastly expedite the data analysis and is well worth
it. Userzoom and Treejack are both good options for conducting tree
testing.
Prepare your tree in a spreadsheet, where you can easily visualize and edit
it, then copy and paste the entire hierarchy into your tree-testing tool. The
spreadsheet should be formatted with your homepage in the top cell of
Column A, then lower levels listed out in columns from left to right. Make
sure to list only one category on each row, so that your levels will be
correctly parsed when you import the hierarchy.
This spreadsheet illustrates the tree, or menu hierarchy, for the New
Mexico state- government website. Each category appears on a separate
row, and subcategories are placed in columns to the right of the parent
category which contains them.
Once you have pasted your hierarchy into the testing tool, the categories
are parsed and used to automatically create a clickable menu hierarchy in
which each category can be expanded to show the corresponding
subcategories.
Tree-Testing Tasks
The tasks you ask users to complete are just as important as the tree itself.
Here are some typical task types:
• Resource-finding tasks to key business goals, where users must
locate important products or services) or important information that
may not have a dedicated page in the navigation
• Sleight-of-hand tasks, where users must find a resource that isn’t
all that important or commonly needed, but that helps assess
the information scent of a parent category. For example, on
an intranet tree-test project, I asked participants to find information on
their company’s charitable-donation match program –– not because
this was a common resource, but because I wanted to assess
potential category names for the HR department (People, HR,
Policies and Procedures, etc.).
• Potential problem areas, such as new categories proposed by
stakeholders or participants in a card sort, or even categories where
there was a lot of disagreement between participants in a card-
sorting study
• Label or location comparisons: alternate labels or locations for the
same category
• Warmup task: an easy task at the beginning meant to warm up your
participants to the test procedure and which can be used in
unmoderated tests to quickly screen out cheaters. If participants get
this task wrong, they probably were not paying attention.
For each task you write, you should also define the correct answer(s),
corresponding to where the information is located within the tree. This
information will allow the testing tool to automatically calculate success
rates for each task.
This screen from the Userzoom tree-testing system is used to indicate
which category is the correct answer for a particular task.
Each task should test a category label by asking the user to find something
contained within that category. As with usability-testing tasks, tree-
testing task instructions should avoid using terms that give away the
answers. Preventing priming can sometimes be accomplished by
describing a scenario and motivation. However, keep in mind that users
may not read the instructions carefully and may easily miss important
details if they are buried in a lengthy story.
Here are a few different possible phrasings for evaluating the Starting a
Business category on the New Mexico state-government tree (depicted
above):
2. You are moving to Santa Fe next year, and once you arrive you would like to supplement your income by
opening a side business providing lawn-care services. Find out what regulations you will need to follow.
tha
3. You are considering opening a lawn-care service. See if there are any resources on this site that can help
Multiple Trees
Sometimes a tree test may involve several trees rather than just one. For
example, if you are considering different labels (i.e., word choices) for
the same category, you may want to test two different trees to compare
how the terms perform.
Tree testing can be used to rapidly test and iterate on ideas for
information architecture at the beginning of a project, as it doesn’t
require designing layouts, content, or interactions. Much like usability
testing, qualitative tree testing requires only a few participants to gather
insights for how to improve the navigation schema. It is often done as a
moderated study, to benefit from the ability to have a trained facilitator
probe when participants do interesting things. However, qualitative tree
testing is not appropriate for gathering statistics (such as success rate or
time on task).
• To pilot your study design and ensure that your tasks are
understandable and do not bias participants.
• To obtain insights about why users chose the answers they did,
which category labels are confusing, and ideas for how to address
these issues
Card sorting and tree testing are two key research methods that are
specific to information architecture. Many newcomers to information
architecture often misunderstand the purpose of card sorting, and use it
inappropriately, instead of a tree test.
Card sorting does not usually produce the exact categorization scheme you
should follow. For example, participants in a card sort often create a
generic category to hold a few items which don’t seem to fit anywhere else;
this is understandable, but if you were to include an “other stuff” category in
your menu, the same users would avoid it like the plague. (Website visitors
are notoriously reluctant to click on vague labels because they quite rightly
suspect they’ll have to do a lot of work to sift through the content.)
Tree testing can fit quite well in a few different stages of the product-
development process.
However, this format does not capture the full context of user
behavior (such as comments made while performing the task) and you
can’t ask personalized followup questions.
You can also partially compensate for the inability to ask followup questions
in unmoderated tree tests by including a short survey after the tree test.
Rather than asking users to recall any labels they found confusing, provide
them with a list of labels and ask them to check which were difficult to
understand. This question can be followed up with an open-ended
question inviting users to share any further comments and feedback, to
elicit unexpected assumptions or misunderstandings that may not be
apparent from the click history.