Yes You Can Measure Software Developer Productivity
Yes You Can Measure Software Developer Productivity
August 2023
Compared with other critical business functions can require significant, long-term investment.
such as sales or customer operations, software Furthermore, the landscape of software development
development is perennially undermeasured. The is changing quickly as generative AI tools such
long-held belief by many in tech is that it’s not as Copilot X and ChatGPT have the potential
possible to do it correctly—and that, in any case, to enable developers to complete tasks up to two
only trained engineers are knowledgeable enough times faster.
to assess the performance of their peers. Yet
that status quo is no longer sustainable. Now that To help overcome these challenges and make
most companies are becoming (to one degree this critical task more feasible, we developed an
or another) software companies, regardless of approach to measuring software developer
industry, leaders need to know they are deploying productivity that is easier to deploy with surveys
their most valuable talent as successfully or existing data (such as in backlog management
as possible. tools). In so doing, we built on the foundation
of existing productivity metrics that industry
There is no denying that measuring developer leaders have developed over the years, with an
productivity is difficult. Other functions can eye toward revealing opportunities for
be measured reasonably well, some even with just performance improvements.
a single metric; whereas in software development,
the link between inputs and outputs is considerably This new approach has been implemented at nearly
less clear. Software development is also highly 20 tech, finance, and pharmaceutical companies,
collaborative, complex, and creative work and and the initial results are promising. They include
requires different metrics for different levels (such the following improvements:
as systems, teams, and individuals). What’s more,
even if there is genuine commitment to track — 20 to 30 percent reduction in customer-
productivity properly, traditional metrics can require reported product defects
systems and software that are set up to allow
more nuanced and comprehensive measurement. — 20 percent improvement in employee
For some standard metrics, entire tech stacks experience scores
and development pipelines need to be reconfigured
to enable tracking, and putting in place the necessary — 60-percentage-point improvement in customer
instruments and tools to yield meaningful insights satisfaction ratings
— What are the impediments to the engineers Another critical dimension to recognize is what
working at their best level? the various metrics do and do not tell you. For
example, measuring deployment frequency or lead
— How much does culture and organization affect time for changes can give you a clear view of certain
their ability to produce their best work? outcomes, but not of whether an engineering
organization is optimized. And while metrics such
— How do we know if we’re using their time on as story points completed or interruptions can
activities that truly drive value? help determine optimization, they require more
investigation to identify improvements that might
— How can we know if we have all the software be beneficial.
engineering talent we need?
In building our set of metrics, we looked to expand
on the two sets of metrics already developed by
Understanding the foundations the software industry. The first is DORA metrics,
named for Google’s DevOps research and
To use a sufficiently nuanced system of measuring
assessment team. These are the closest the tech
developer productivity, it’s essential to understand
sector has to a standard, and they are great at
the three types of metrics that need to be tracked:
measuring outcomes. When a DORA metric returns
those at the system level, the team level, and the
a subpar outcome, it is a signal to investigate what
individual level. Unlike a function such as sales, where
has gone wrong, which can often involve protracted
a system-level metric of dollars earned or deals
sleuthing. For example, if a metric such as deployment
closed could be used to measure the work of both
frequency increases or decreases, there can be
teams and individuals, software development
multiple causes. Determining what they are and how
is collaborative in a distinctive way that requires
to resolve them is often not straightforward.
On top of these already powerful metrics, our Inner/outer loop time spent. To identify specific
approach seeks to identify what can be done to areas for improvement, it’s helpful to think of the
Web 2023
MeasuringDeveloperProductivity
Exhibit 1 of 2
Exhibit 1
Team Lead time for changes Story points completed Quality of documentation
level Change failure rate Handoffs Developer Velocity Index
Time to restore service benchmark4
Code-review velocity Contribution analysis
1
Google’s DevOps research and assessment team, which developed these outcome metrics.
2
Satisfaction and well-being, performance, activity, communication and collaboration, and efficiency and flow; GitHub and Microsoft Research developed these
metrics, which aim to look at developer well-being as a measurement at the individual level.
3
Nonexhaustive.
4
Benchmarks an organization’s technology, working practices, and organizational enablement; see Shivam Srivastava, Kartik Trehan, Dilip Wagle, and Jane
Wang, “Developer Velocity: How software excellence fuels business performance,” McKinsey, Apr 20, 2020.
Web 2023
MeasuringDeveloperProductivity
Exhibit
Exhibit 2 of22
Software development can be broadly divided into two sets, or loops, of tasks;
the less time spent on less fulfilling, outer-loop activities, the better.
Deploy
at scale
Build
Test
Meetings Integrate
1
Activities listed are nonexhaustive.
1
To read more about McKinsey’s DVI survey, see Shivam Srivastava, Kartik Trehan, Dilip Wagle, and Jane Wang, “Developer velocity: How
software excellence fuels business performance,” McKinsey, April 20, 2020; and Chandra Gnanasambandam, Neha Jindal, Shivam Srivastava,
and Dilip Wagle, “Developer velocity at work: Key lessons from industry leaders,” McKinsey, February 22, 2021.
2
Klemens Hjartar, Peter Jacobs, Eric Lamarre, and Lars Vinter, “It’s time to reset the IT talent model,” MIT Sloan Management Review,
March 5, 2020.
The increase in remote work and its popularity Assess your systems. Because developer
among developers is one overriding factor. productivity has not typically been measured at
Scan • Download • Personalize
Developers have long worked in agile teams, the level needed to identify improvement
collaborating in the same physical space, and some opportunities, most companies’ tech stacks will
technology leaders believe that kind of in-person require potentially extensive reconfiguration.
teamwork is essential to the job. However, the For example, to measure test coverage (the extent
digital tools that are so central to their work made it to which areas of code have been adequately
easy to switch to remote work during the pandemic tested), a development team needs to equip their
lockdowns, and as in most sectors, this shift is hard codebase with a tool that can track code executed
to undo. As remote and hybrid working increasingly during a test run.
becomes the norm, organizations will need to rely on
broad, objective measurements to maintain Build a plan. As with most analytics initiatives,
confidence in these new working arrangements and getting lost in mountains of data is a risk. It’s
ensure they are steadily improving the function important to start with one area that you know will
that could easily determine their future success or result in a clear path to improvement, such as
failure. The fact that the markets are now putting identifying friction points and bottlenecks. Be
greater emphasis on efficient growth and ROI only explicit about the scope of such a plan, as even the
makes it more important than ever to know how best approaches, no matter how comprehensive,
they can optimize the performance of their highly will not be a silver bullet.
valued engineering talent.
Remember that measuring productivity is
Another key driver of this need for greater visibility is contextual. The point is to look at an entire system
the rapid advances in AI-enabled tooling, especially and understand how it can work better by improving
large-language models such as generative AI. These the development environment at the system, team,
are already rapidly changing the way work is done, or individual level.
which means that measuring software developers’
productivity is only a first step to understanding how No matter the specific approach, measuring
these valuable resources are deployed. productivity should ideally create transparency and
insights into key improvement areas. Only then
But as critical as developer productivity is can organizations build specific initiatives to drive
becoming, companies shouldn’t feel they have to impact for both developer productivity and
embark on a massive, dramatic overhaul almost experience—impact that will benefit both those
individuals and the company as a whole.
Chandra Gnanasambandam and Martin Harrysson are senior partners in McKinsey’s Bay Area office, where Alharith
Hussin and Shivam Srivastava are partners; and Jason Keovichit is an associate partner in the New York office.
The authors wish to thank Pedro Garcia, Diana Rodriguez, and Jeremy Schneider for their contributions to this article.