Capacity and performance management_ Practice Guide
Capacity and performance management_ Practice Guide
This document provides practical guidance for the capacity and performance
management practice.
Table of Contents
1. About this document
2. General information
7. Important reminder
8. Acknowledgements
the practice’s processes and activities and their roles in the service value chain
2. General information
Key message
This practice ensures that the requirements for the capacity and performance of services
and resources are understood and fulfilled efficiently, in line with the organization’s
strategy and commitments. To achieve this, the practice is applied throughout the
organization’s product and service lifecycle, from ideation to operations. This practice is
extremely important when products and services are planned and designed; decisions
made at these stages will affect performance-level and other constraints, as well as the
organization’s ability to monitor and manage these aspects.
Definition: Performance
Service performance is usually associated with the rate of service transactions and the
time needed to fulfil service transactions at a given level of demand. Service performance
depends on service capacity; the maximum throughput that a configuration item (CI) or
service can deliver. The specific metrics that are used will depend on the technology and
business nature of the service or CI.
When services are provided to thousands or millions of people, there is usually no single
generic agreement on the service performance with customers. However, overall service
performance is critical for the service provider.
2.3 Scope
The capacity and performance management practice ensures that services deliver
agreed levels of performance to meet the needs of customers and users in a cost-
effective way. To achieve this, the capacity and performance management practice
includes the definition, measurement, analysis, and improvement of the capacity and
performance of services, products, and components. It is a centre of expertise for
capacity-related matters and supports other service management practices.
The scope of the capacity and performance management is very broad. Many practices
directly or indirectly contribute to service performance. Table 2.1 lists activities which are
closely related to capacity and performance management. It is important to remember
that ITIL practices are merely collections of tools to use in the context of value streams
and should be combined as necessary depending on the specific organizational, service,
and customer contexts.
Testing the capacity and performance controls during Service validation and
service transition testing
A complex functional component of a practice that is required for the practice to fulfil
its purpose.
A practice success factor (PSF) is more than a task or activity, as it includes components
from all four dimensions of service management. The nature of the activities and
resources of PSFs within a practice may differ, but together they ensure that the practice
is effective.
The capacity and performance management practice includes the following PSFs:
Determining performance and capacity criteriaThe line between high and low
performance should be clearly defined. The following factors should be considered
when determining service performance criteria:
scale factor: service performance degradation generally means that delays are
experienced by significant numbers of users, not individuals.
Choosing the right set of capacity and performance metrics Metrics should reflect
how service degradation may affect the service provider and customers.
Two of the most important objectives of the capacity and performance management
practice are to ensure sufficient capacity and performance monitoring and translate
monitoring data into service performance information.
Incident records can be sources of service disruption data. However, it is often difficult to
obtain reliable performance and capacity data based on these, especially for user-
reported incidents, and to align it with the agreed service performance metrics.
More reliable sources of performance and capacity data are infrastructure monitoring
tools. However, although these can work well for measuring resource-provision type
services, it is almost impossible to correctly measure the performance of service
transactions based solely on the infrastructure monitoring data. Tools such as real user
monitoring and business transaction monitoring can help to overcome this issue.
The capacity and performance management practice ensures that risks will be treated
effectively by:
assessing the impacts of products’ and services’ capacity and performance on the
user and customer experience
monitoring and controlling capacity and performance risks on an ongoing basis and
optimizing risk management activities within the scope of the practice.
Key metrics for the capacity and performance management practice are mapped to its
PSFs. They can be used as KPIs in the context of value streams in order to assess the
contribution of the practice to the effectiveness and efficiency of those value streams.
Some examples of key metrics are given in Table 2.2.
The correct aggregation of metrics into complex indicators will make it easier to use the
data for the ongoing management of value streams, and for the periodic assessment and
continual improvement of the capacity and performance management practice. There is
no single best solution. Metrics will be based on the overall service strategy and priorities
of an organization, as well as on the goals of the value streams to which the practice
contributes.
improve
obtain/build
plan.
The contribution of the capacity and performance management practice to the service
value chain is shown in Figure 3.1.
Figure 3.1 Heat map of the contribution of the capacity and performance
management practice to value chain activities
3.2 Processes
Each practice may include one or more processes and activities that may be necessary to
fulfil the purpose of that practice.
Definition: Process
A set of interrelated or interacting activities that transform inputs into outputs. A
process takes one or more defined inputs and turns them into defined outputs.
Processes define the sequence of actions and their dependencies.
The capacity and performance management practice activities form two processes:
SLAs
Figure 3.2 Workflow of the establishing capacity and performance control process
This process may vary, depending on the type of services and service components to
which it is applied. Table 3.2 demonstrates how the activities may vary for a modern
cloud-enabled services and for the first tier of service support staff.
It is important to differentiate
between the above
requirements and short-term
service demand spikes (such as
an increased user flow to a
website following a marketing
campaign) that in a cloud
environment can be detected
and satisfied automatically via
specialized capacity extension
tools, and do not require
thorough analysis.
This process may vary, depending on the type of services and service components to
which it is applied. Table 3.4 demonstrates how the activities may vary for modern cloud-
enabled services and for a first tier of technical support staff.
Planning and It can be tempting to use the Other practices can request the
designing virtually unlimited scalability of capacity and performance
service computing power in the cloud management practice help with
capacity and to tackle the volatile and specific calculations upon staff
performance growing demand for services. numbers and capabilities, and
However, it may be more with planning for the automation
prudent to alter the underlying of manual support tasks. The
application, middleware, and outputs of these efforts are
load balancing architecture improvement initiatives. For
when the demand hits a certain example, practitioners can
threshold (for example, altering suggest automated diagnostic
the network design to cater to data being harvested from end
users on a newly acquired user devices in order to save time
geographical market). spent on user questionnaires.
Capacity practitioners possess
the necessary expertise to
suggest these optimizations to
avoid the excessive service costs
that are associated with linear
scaling.
4 O i i d l
4. Organizations and people
Roles are described in the context of processes and activities. Each role is characterized
with a competency profile based on the model shown in Table 4.1.
Establishing
capacity and
performance
control
Service capacity MT
and performance
Capacity and Excellent analytical skills
analysis
performance
manager Knowledge of methods
and techniques, such as
Service owner Fault-Tree Analysis,
Component Failure
Technical Impact Analysis, and so
expert on
Good understanding of
the possible business
impacts of service
outages
Reporting on CA
service capacity
Service owner Knowledge of
and performance
agreements and
Relationship expectations
manager
Understanding of the
Customer consumer context
Communication and
negotiation
Planning and TM
designing service
Capacity and Good understanding of
capacity and
performance resilience options
performance
manager
Awareness of existing
Service controls
designer
Awareness of technology
Technical available on the market
expert
Good understanding the
Architecture possible business
manager impacts of service
outages
Identifying CTA
service capacity
Service or Business analysis
and performance
product owner
requirements
Good knowledge of
Relationship business activity
manager patterns, throughputs,
Service and markets that
designer generate demand
Communication and
coordination
Agreeing service CA
capacity and
Service owner Communication and
performance
negotiation, and ability
requirements
Relationship to advocate for
manager improvements
Determining TM
capacity and
Capacity and Good understanding of
performance
performance monitoring tools and
measurement
manager techniques
requirements
Service
designer
Technical
expert
Designing CM
capacity and
Capacity and Communication and
performance
performance negotiation
metrics and
manager
reports
Report and dashboard
Service owner design skills
Relationship
manager
IT quality
manager
Where service providers are responsible for a limited number of services and
components (such as a service integrator function), there can be a capacity and
performance manager. This role is accountable for coordinating practices, functions, and
organizations to ensure cost-efficient service capacity and sufficient levels of service
performance.
Business and technical knowledge is pivotal to the success of this practice, as well as the
service provider’s staff’s ability to plan, monitor, and report on the performance of
services and components.
5 I f i d h l
5. Information and technology
component-based reports
service-based reports
This information may take various forms. The key inputs and outputs of the practice are
listed in section 3.
In most cases, the capacity and performance management practice can significantly
benefit from automation. Where this is possible and effective, it may involve the solutions
outlined in Table 5.1.
Establishing
capacity and
performance
control
Very few services are delivered using only an organization’s own resources. Most, if not all,
depend on other services. These are often provided by third parties (see section 2.4 of ITIL
Foundation: ITIL 4 Edition for a model of a service relationship). Relationships and
dependencies introduced by supporting services are described in the practice guides for
service design, supplier management, and SLM.
As the service integration model becomes common within modern corporate service
consumer environments, the importance of orchestrating the service performance
becomes apparent. Where multiple external service providers are responsible for
different service components, or even for entire service offerings, the end-user
experience is at risk of being overlooked (especially when it comes to such less-tangible
impressions as ‘waiting for the system to unfreeze’). A service integration and
management body should be responsible for maintaining the end-user focus of all
efforts relating to service capacity and performance by multiple service providers.
Where organizations aim to ensure fast and effective capacity and performance
management, they usually try to agree to close cooperation with their partners and
suppliers, removing formal bureaucratic barriers in communication, collaboration, and
decision-making. All parties in such relationships should aim for mutual transparency
and visibility of changes that may affect the other parties (see the supplier management
practice guide for more information).
7. Important reminder
Most of the content of the practice guides should be taken as a suggestion of areas that
an organization might consider when establishing and nurturing their own practices.
The practice guides are catalogues of topics that organizations might think about, not a
list of answers. When using the practice guides, organizations should always follow the
ITIL guiding principles:
focus on value
More information on the guiding principles and their application can be found in section
4.3 of ITIL® Foundation: ITIL 4 Edition.
8. Acknowledgements
AXELOS Ltd is grateful to everyone who has contributed to the development of this
guidance. These practice guides incorporate an unprecedented level of enthusiasm and
feedback from across the ITIL community. In particular, AXELOS would like to thank the
following people.
8.1 Authors
Konstantin Naryzhny
8.2 Reviewers
Roman Jouravlev