ITIL Problem Management
ITIL Problem Management
In their ITIL 4 framework, Axelos Ltd define the practice of problem man-
agement as being distinct from the incident management practice. Reactive
problem management involves responding to incidents which have already
occurred in order to understand the underlying causes and address these.
Proactive problem management is about identifying risks and responding
to those risks before they manifest themselves in incidents.
PROBLEM CONTROL
ITIL 4 recommends that a key aspect of problem management is the
process developed for controlling and managing problems. Each prob-
lem which is identified (either through reactive or proactive problem
management) should be recorded in a problem record within an ITSM
tool or similar system. Problem records should be linked to related
resources. In reactive problem management, the related incidents
should be linked to the problem record. Configuration Items (CIs) such
as desktops, servers, printers and software assets should also be linked
to the problem record as required. The problem record is a way to:
• collate the information
• prioritise the effort
• coordinate who is involved
52
Problem Control
53
ITIL Problem Management
staff effort and if the impact of this problem had been less, this
might not have been considered cost effective.
• Resolving the root cause: Having evaluated the optimum means of
fixing the root cause, this needs to be added to the work queue
for the relevant teams, appropriately prioritised alongside their
other work. Adequate testing of any changes to the system need
to be done before the fix is implemented and normal change
enablement processes followed. Once the fix is in place, the
result on users who have been affected needs to be evaluated.
Sometimes the fix at the server end will not resolve the issue
for the end users, who may also need to make a change on
their desktops (e.g. clearing the cache). If users are still using a
workaround, they need to be notified that the permanent fix is
now in place. The Known Error may be removed from the Service
Desk list of current problems once this has been completed.
• Long-term monitoring: Unlike incidents which should be marked
as resolved as soon after resolution as possible, a problem record
will typically be left in a semi-open state for a period of time in
order to assess whether the fix which has been applied has been
effective. Not all fixes address all issues. If the incidents reoccur,
then the problem record should be re-activated and moved back to
the identification stage. However, it should be noted that it is often
the case that the incidents for two related problems will all be
linked to the first problem record. If there is evidence that the first
problem has been successfully fixed, but that a second problem
exists with a different root cause, then a new problem record
should be created and the relevant incidents moved across. As a
general rule of thumb, an incident should not be linked to two
problem records as there should not be two independent problems
causing it (as distinct from one problem with multiple root causes).
• Closed: a problem record which has been monitored for a reasonable
length of time, with no recurrences may be marked as closed.
KNOWLEDGE MANAGEMENT
One key aspect of both proactive problem management and reactive
problem management is knowing how data is meant to flow between
55
ITIL Problem Management
SUMMARY
A formal practice and process for problem management, such as
the ITIL 4 practice, is a good way of methodically keeping track of
problems.
Note
1 https://support.hpe.com/hpesc/public/docDisplay?docId=emr_
na-a00092491en_us
56