IBV - How To Create Business Value With AI
IBV - How To Create Business Value With AI
IBV - How To Create Business Value With AI
How to create
business value
with AI
12 stories from the field
How IBM
can help
B
Executive
summary
1
2
Introduction
It’s a matter of perception versus reality.
While headlines herald artificial intelligence (AI) as a panacea for mounting economic
malaise, executives are still left wondering: what are companies actually doing with
AI? What results are they achieving and how?
The IBM Institute for Business Value partnered with the MIT-IBM Watson AI Lab, inter-
viewing more than 35 organizations to help answer these and other questions. What we
learned is how business and technology experts involved in deep learning projects are
applying artificial intelligence in the real world of business to drive real value.
FIGURE 1
2016 2018 2020 2022
(estimated)
AI adoption*
2016–2022 25%
Piloting/implementing 22% 5%
15%
3
The pandemic narrowed, then accelerated, organiza- To get to that story and the challenges AI is helping to
tions’ adoption of AI. The number of companies that solve, we interviewed individuals involved in deep
were piloting AI use cases in the midst of the pandemic learning projects around the world. From April through
more than doubled from 2018—and recent data August 2021, we talked with business and technology
indicates their ranks continue to grow.2 experts in more than a dozen industries about their AI
goals, challenges, and learnings (see Figure 2).
While these numbers show an upward trend, they don’t
tell the full story that many business and technology
leaders need in order to benchmark their own organiza-
tions’ use of AI.
13 12 11
of our interviews
Respondents
12
Countries
13
Industries
12
Functional domains
4
Perspective
Myth 3
hype cycle that has become endemic to so many
emerging technologies. Unfortunately, these miscon-
ceptions often deter and distract organizations from
Cost reduction is
engaging with the more pragmatic realities of AI.
AI’s sweet spot
In the pages that follow, we debunk five of the most
prevalent AI myths by highlighting relevant insights
and practical examples from our interviews. The
observations and anecdotes that follow—curated Myth 4
from our discussions with more than 55 professionals
at more than 35 organizations on the front lines of Shortcuts don’t
AI—can help disentangle fact from fiction. They allow work in AI
a peek behind the curtain—a virtual check-in with
peers—as organizations seek to increase AI’s impact
and value. (Readers who want additional insight will
find 12 detailed case studies in the appendix).
Myth 5
AI only delivers
value for the
problem at hand
55
Myth 1 Myth 1
AI is a one-size-fits-
all proposition
Myth 2
If it isn’t deep learning,
AI is a one-size-fits-all
it isn’t real AI
Myth 3
proposition
Cost reduction is AI’s
sweet spot
Myth 4
Shortcuts don’t work Reality
in AI
Myth 5 Fit-for-purpose matters. AI-driven business
AI only delivers value
for the problem at hand improvements stem from many techniques.
Appendix
For example, one of those many AI techniques, deep learning, is often best suited
to problems related to underlying (often large) data sets in vision, language, and
other predictive models. From virtual assistants to fraud detection, deep learning
is changing the way we work and play. In these situations, traditional machine-
learning techniques may be less effective.
But not every business challenge or desired outcome is a fit for AI, despite the
hype that might make it appear so. Organizations first need to determine whether a
broader strategic initiative or a particular business problem is a candidate for AI
enablement, a topic we address more fully in “Rethinking your approach to AI.”
Companies can start with an assessment of their overall “data wealth,” as well as
examine discrete business problems.
6
6
Perspective
AI, machine
learning, and
deep learning
defined
Like stacked Matryoshka dolls, deep learning is a subset of machine learning, which is
AI a subset of artificial intelligence. These techniques are often complemented by
robotics, sensors and actuators from the Internet of Things, virtual interfaces, and
other adjacent technologies.
Machine
learning What is AI and how did it come about?
Deep AI allows computers to perform tasks that previously could have been done only by
learning humans. But where human capacity begins to plateau in terms of accuracy, speed, and
processing power, AI really begins to gain traction.
While AI hype is very much grounded in the 21st century, AI was actually born
decades ago in the mid-20th century. In 1955, two mathematics professors (from
Dartmouth and Harvard) and two research scientists (from Bell Labs and IBM)
suggested that a “2-month…study of artificial intelligence be carried out during the
[following] summer.” According to the proposal’s abstract, “an attempt will be made to
find how to make machines use language, form abstractions and concepts, solve kinds
of problems now reserved for humans, and improve themselves.”3
Thus, the first formal definition of artificial intelligence was created, with academia
and the enterprise working to build increasingly better AI ever since.
Today, we use machine learning systems for a variety of ends, to select the most relevant
results in response to a keyword search, to parse visual images, and more. Increasingly,
these AI applications make use of a class of techniques called deep learning.5
For complex problems where enough data is available, deep learning often delivers
greater performance than other machine-learning methods.
77
European clothing retailer Marketing platform boosts
uses AI to increase efficiency response rate with machine
and sustainability learning techniques
Demand prediction and selling efficiency have A marketing and advertising agency used machine-
consistently been central to the consumer goods and learning models to predict consumer receptivity to
retail industries; even incremental improvements can client campaigns. They included this capability as
impact the business dramatically. part of a data and analytics platform that served all
clients. A tiny improvement in accuracy at scale can
BESTSELLER, a clothing retailer, sought to increase be worth millions (USD) in additional sales, so the
the accuracy of its demand forecasts and predictions stakes are high.
to help ensure it sold as much of its clothing as
possible. At the time, it already sold 78% of the The agency discovered it could raise response rates
products it made—a relatively high performance in by 20%-30%, but doing so would also increase
the volatile world of fashion. But if BESTSELLER could compute costs to store, train, and process additional
increase the granularity of product attributes used in data and model parameters. Fortunately, a move to
its forecast algorithms, it could continue to improve the cloud enabled more visibility into their costs—
efficiency. providing greater insight into how to manage them.
As a result, the team was able to preserve the
When teams determined that traditional analytic response-rate lift while increasing the efficiency of
techniques had reached their limits, BESTSELLER their compute utilization and cutting processing costs
trained a convolutional neural network (CNN) from by about two-thirds.
images of its clothing. (A CNN is a class of artificial
neural network commonly applied to analyze visual
imagery.) Doing so allowed BESTSELLER to classify AI helps call centers,
its products based on additional features not food science, and more
otherwise included in its structured data sets.
Crédit Mutuel, a French cooperative banking group,
Feeding these outputs into its core forecasting engine
used deep learning extensively to assist human
increased selling efficiency to 82% and reduced the
call-center agents, saving tens of thousands of hours
design samples needed by 15%—welcome improve-
each month.
ments during the broader pandemic-related sales
downturn. The change also had a positive impact on In the same vein but a completely different industry,
sustainability as the company reduced its discounted, McCormick, an American food flavoring company,
donated, or dumped apparel. used AI to supplement the experience of junior food
scientists to help them perform equivalently to a
senior scientist with 20 years of expertise.
McCormick, an American food Other examples from our interviews highlight how
tailored uses of AI can solve distinct business
flavoring company, used AI to problems—across geographies, industries, and even
supplement the experience of functions. Often, the right approach is clearer after
junior food scientists to help them the right data set to solve the problem is chosen—as
highlighted by the BESTSELLER and marketing
perform equivalently to a senior agency examples.
scientist with 20 years of expertise.
8
Myth 1 Myth 2
AI is a one-size-fits-
all proposition
Myth 2
If it isn’t deep learning,
If it isn’t deep learning,
it isn’t real AI
Myth 3
it isn’t real AI
Cost reduction is AI’s
sweet spot
Myth 4
Shortcuts don’t work Reality
in AI
Myth 5 Large enterprises are solving discrete business
AI only delivers value for
the problem at hand problems and attaining meaningful business value
Appendix with a mix of data science, traditional machine
learning, deep learning, and preprocessing techniques.
Many of the advancements in AI research over the past decade have occurred in
the field of deep learning. The explosive growth of social media, search, retail,
streaming, and other B2C platforms with deep learning embedded throughout their
business models has given rise to the fallacy that if it isn’t deep learning, it isn’t AI.
In reality, deep learning is just one tool among many in an enterprise analytics
toolbox that enables AI (see Figure 3 on page 10).
Conceptually, concerns about the costs of deep learning may pose significant
challenges to the future direction and nature of AI research (see, “Will cost be deep
learning’s demise?” on page 11). Pragmatically, determining where deep learning
may fall short often occurs by comparing results in practice—usually in a proof-of-
concept or pilot.
9
FIGURE 3
Source: 2021 IBV AI Capability Survey, unpublished data. Q16A. What Machine Learning (ML) techniques
does your organization employ? Choose all that apply.
KPMG uses deep learning and The best approaches, however, were rule-based
machine learning, with accuracy exceeding 85%.
other analytics to help clients Automating these processes translated into a more
save millions cost-effective way to save millions (USD) in tax
expenses for a given client each year. One client was
The global tax, audit, and consulting firm, KPMG, able to secure an additional 40% tax credit to its R&D
initiated an internal hackathon to determine the best spend as a direct result of this approach.
approach to reducing the manual effort involved in
documenting clients’ R&D projects, investments, and
tax credits. Doing so could yield tangible business
value by helping reduce clients’ tax bills each year.
The firm found that accuracy ranged from 55% using
out-of-the box document discovery software (about
as effective as a manual keyword search) to greater
than 70% for deep learning.
10
Perspective
Will cost be
deep learning’s
demise?
While artificial neural networks have been around since the 1950s—surviving
famines of investment and focus during AI winters6—deep learning has been
basking in the summer sun since the late 2000s.
For example, OpenAI’s GPT-3 cost $3 million to develop and train, and Alphabet
subsidiary DeepMind’s AlphaGo is reported to have cost $35 million just to train.
With costs this significant (and rising quickly), the conundrum grows—how to
balance the need for bigger models, more data and training, as well as more
compute power, with the inherent business realities of budgets and efficiency.
Researchers will have to address this conundrum, or progress may languish.8
An IEEE Spectrum article summed it up: “While deep learning’s rise may have been
meteoric, its future may be bumpy,” as those research efforts unfold.9
In the meantime, organizations need to keep a careful eye on their own trade-offs
between cost and performance in using deep learning—especially relative to other
AI tools.
11
11
Using the CNN, the team extracted 50 topographical
Zzapp Malaria: Using AI for world
and other features from the images, using these
good, not just good business features in a traditional linear regression-based
approach to determine the likelihood of standing
Malaria caused an estimated 627,000 deaths in 2020, water. The performance was equivalent to earlier
with Africa accounting for 96% of all deaths.10 Zzapp results but provided much greater transparency
Malaria, 2021’s winner of the XPRIZE AI, creates about which factors were driving the prediction.
AI-powered approaches to eliminating malaria and This meant the results were more explainable to the
delivers them directly to the field through a dedicated team—and therefore, more transferable to places
mobile application. where terrain differed significantly. They used this
AI-driven success to inform adjustments to their
In a pilot, Zzapp Malaria’s convolutional neural
approach as they extended their reach to help reduce
network could analyze visual imagery to detect small
the incidence of malaria in other locales.
bodies of water—potential malaria-carrying mosquito
breeding grounds—not readily apparent from existing
satellite imagery. It achieved about 75% accuracy but
with limited visibility into the drivers underpinning the
predictions. While these were good results, they were
not good enough to scale to other locations.
12
13
Myth 1 Myth 3
AI is a one-size-fits-
all proposition
Myth 2
If it isn’t deep learning,
Cost reduction is
it isn’t real AI
Myth 3
AI’s sweet spot
Cost reduction is AI’s
sweet spot
Myth 4
Shortcuts don’t work Reality
in AI
Myth 5 Applying AI to business problem-solving can indeed
AI only delivers value
for the problem at reduce cost, but that’s not all it can do. Leading
hand
organizations are actively (and strategically) seeking
Appendix
competitive differentiation with AI, achieving top-line-
oriented process efficiencies, growth, and business
model innovation.
Cost matters, but growth, innovation, and societal good matter more. IBV research
shows organizations have consistently ranked customer-centered growth as the top
area where they see the greatest business impact from AI (see Figure 4 on page 15).
We wondered whether this top-line emphasis was more wish than reality, but as we
talked with executives during our research, we identified several companies who
are walking the talk.
14
14
FIGURE 4
AI value drivers
2016–2020
2016 2018 2020 AI value drivers
Companies are focused on Customer satisfaction enhancement
top-line, customer-centered
growth. Customer retention improvement
Headcount redeployment
Headcount reduction
15
Myth 1: Myth 4
AI is a on-size-fits-
all proposition
Myth 2:
If it isn’t deep learning,
Shortcuts don’t
it isn’t real AI
Myth 3:
work in AI
Cost reduction is AI’s
sweet spot
Myth 4:
Shortcuts don’t Reality
work in AI
Myth 5: While use cases for AI models vary by industry and
AI only delivers value
for the problem at function, a growing set of “off-the-shelf” foundation
hand
and pretrained models can provide a more cost-
Appendix
effective starting point for enterprise data scientists.
The IBM Institute for Business Value has been taking a systematic approach to
quantifying various trends in enterprise AI since 2016. One of the surprises in 2020
was the reemergence of “availability of technology” as a barrier to AI adoption after
it ceded top billing to “skills and other factors” in 2018 (see Figure 5 on page 17).
We asked ourselves: why is this barrier rearing its head again?
Our interpretation continues to be that organizations are finally realizing what they
thought it took to make AI technology work—workers with the right data skills—is
necessary but insufficient on its own. The many data scientists companies hired to
train various data sets dutifully did as they were asked. But, each business
problem—often approached with a different AI model than the last—seemed like
starting over. There was no easy way to take advantage of what had gone before.
16
16
FIGURE 5
AI barriers
2016–2020 2016 2018 2020 AI barriers
Regulatory constraints
Legal/security/privacy concerns
about use of data and information
Sources: “Shifting toward Enterprise-grade AI: Confronting skills and data challenges to realize value.” IBM Institute for Business Value. September
2018. https://www.ibm.com/thought-leadership/institutebusiness-value/report/enterpriseai. Figure 1, Barriers in implementing AI: 2016 versus
2018, The business value of AI, unpublished data. Q9. What are the top barriers your organization faces in implementing artificial intelligence?
Select top 5.
More recently, AI shortcuts are starting to help Alphabet’s BERT, OpenAI’s GPT-3) that can be used
organizations gain leverage from their AI-driven to address many challenges. Now even these may
solutions. What “off-the-shelf” is to software, have been surpassed by China’s Wu Dao 2.0, the
pretrained and foundation models are to AI. They can world’s first trillion-parameter exemplar.
be a more effective starting point for new AI projects.
Foundation models can deliver value in a few key
How? By helping organizations get up the curve ways:
without generating completely new data sets, instead – Improved economics: Amortizing costs across
leveraging AI knowledge gathered from solving one multiple use cases
problem to help solve related problems. Key to this
– Improved results: Greater accuracy from larger,
approach is transfer learning: repurposing a model
more robust data sets
originally trained for one task and applying it to a
different one—for example, a model for recognizing – New capabilities: Ability to bring together
cars can be applied to recognizing trucks. multimodal data more effectively.
Many different types of pretrained models have been But this isn’t always true. Adapting pretrained models
designed to solve one or a few specific business sometimes results in too large a drop in performance
problems, and there are a growing number of on new data. That is precisely the problem Boston
generalized, gargantuan models (for example, Scientific, a US-based medical device manufacturer,
faced—and solved.
17
Boston Scientific spends $50,000 The solution? First, they scaled down the problem by
focusing on smaller and narrower tasks. Then, they
to save $5 million leveraged existing off-the-shelf open source AI
models to address the redefined challenge. Finally,
Boston Scientific wanted to automate its stent- they used a smaller data set to fine-tune this system.
inspection process to improve accuracy when
searching for defects such as broken links or surface The result? The company reaped $5 million in direct
imperfections. Accurate inspections are critical for savings based on a modest budget of roughly
successful clinical outcomes. The US Food and $50,000, as well as accuracy that exceeded existing
Drug Administration regulates “escape rates” (the performance.
proportion of defective parts that might slip through
Business leaders who want to save time and money
the cracks) based on risk to patients.
with foundation and pretrained models need to
Eric Wespi, a data science manager at Boston remember that they work well for savings—but may
Scientific, explained, “Human visual inspection is not be the best fit when differentiation is the primary
often slow, expensive, and can present unwanted goal. With these models available to all—some of
quality risks.” The company has approximately 3,000 them open source—organizations must be careful
experts doing inspections, costing several million to select business problems where differentiation
dollars each year. matters less. Or they can refocus their efforts
on customizing by including additional—often
Boston Scientific had already implemented an proprietary or proprietarily integrated—data to
automated rules-based system that used achieve greater competitive advantage.
dimensional measurements and other means to
capture common issues. They had tuned the system
to be conservative, with a negligible false-negative
rate. However, the false-positive rate of 5%-10% was
still too high. Too many in-spec parts were being
flagged as defective.
18
Myth 1 Myth 5
AI is a one-size-fits-
all proposition
Myth 2
If it isn’t deep learning, it
AI only delivers value
isn’t real AI
Myth 3
for the problem at hand
Cost reduction is AI’s
sweet spot
Myth 4
Shortcuts don’t work Reality
in AI
Myth 5 Emergent intra- and inter-company AI network effects
AI only delivers value
for the problem at hand are driving real business value across the enterprise.
Appendix
Proliferating data sources and the increased ability to tap into them provide organi-
zations with a rising wealth of data. Used strategically to fuel thoughtful, ethical AI,
companies are reaping not only financial rewards but an uptick in open innovation.
This, in turn, continues to deliver other economic benefits of scale—especially
among more advanced AI adopters. As we noted in 2020 in our report on the
business value of AI:
19
For example, rotating AI talent from one department Depending on their needs and on their existing
or project to another allows cross-pollination of capabilities, participating carmakers can either lease
expertise—and continued organizational learning for autonomous vehicle hardware to train their own
people—across a company. This approach helps grow models based on a larger data set, or use pretrained
overall AI acumen versus allowing it to stagnate. models from NVIDIA. In either case, instead of
making significant capital investments in hardware
Network effects and other synergies between AI and and AI development capability, carmakers can book
other digital transformation technologies—like cloud, the technology as operating expenses, and benefit
the Internet of Things, security, and data management— from improvements as the hardware and software
add to the value that can be realized.13 improves.
As with many emerging technologies, we see that What we see manifesting across institutions with AI
what AI was already catalyzing within institutions has ultimately points to forces that could impact whole
also begun to manifest across institutions. economies.
20
What stands in the way? Further reading
Knowledge diffusion—the distribution of knowledge Since late 2020, the IBV has been developing a series on
and talent—does not always line up neatly with building a world-class AI capability. It takes a holistic,
“absorptive capacity”—the ability of organizations enterprise-wide point of view on AI and weaves together
to adapt and integrate those insights and skills.14 many of the relevant themes necessary to realizing
Institutional barriers can get in the way, as can rigid financial and economic value from adopting AI.
management that resists the discomfort change
Each of these pieces has a set of concrete recommenda-
can bring.
tions relevant to a specific theme—also tailored to
The transformative value of AI—through its financial, enterprises that are more or less mature in their
economic, and societal impacts—can only become adoption of AI business practices.
reality if leaders of more traditional enterprises set
We suggest consulting these tangible action guides—
aside nostalgic notions of how things have worked in
synthesized from dozens of AI projects and the
the past. They must fully grasp the opportunities for
expertise of hundreds of AI practitioners and other
innovation strategically, thoughtfully, and concretely.
experts—found in each of the following:
A critical starting point is to separate perception from – Strategy and vision: Rethinking your approach to AI
the emerging reality of AI.
– Data and technology: Dealing with the AI data
dilemma
21
About
the authors
Subhro Das, PhD Subhro Das is a Research Staff Member at the MIT-
Research Staff Member, IBM Watson AI Lab in IBM Research. As a Principal
MIT-IBM Watson AI Lab Investigator at the lab, he works on developing novel
[email protected] AI algorithms in collaboration with MIT. His research
linkedin.com/in/subhrodas/ interests are broadly in the areas of optimization
methods for machine learning, reinforcement learning,
trustworthy machine learning, and human-centric
AI algorithms. He received MS and PhD degrees in
Electrical and Computer Engineering from Carnegie
Mellon University.
222
About
the authors
Contributors Acknowledgements
Adam Bogue The authors and contributors would like to thank the MIT-
Business Development Lead, IBM Research IBM Watson AI Lab, and its codirectors Aude Oliva and
David Cox, for funding this project—as well as Seth Dobrin,
Alex Gorman
Glenn Finch, and Sriram Raghavan for their support.
Program Director, Client Advocacy, IBM Software
Cathy Reese
Senior Partner, Practice Leader, IBM Consulting
Shannon Todd-Olson
Senior Partner, IBM Consulting
23
24
Appendix
Scope and scale of
detailed case studies
Company interviews
25
Appendix
BESTSELLER
BESTSELLER
Boston Scientific
Using AI to unlock the value in your company’s data
Crédit Mutuel
Global Bank
Summary
IFFCO-Tokio
KPMG Demand prediction relies on making product characteristics available to algorithms.
The more information is available, the more variability in historical demand patterns
Marketing Platform
can be captured, and the better the future predictions are likely to be.
McCormick
However, assessing more granular attributes of products can be difficult and time
Navtech
consuming. Deep learning provides a solution by rapidly and accurately classifying
NVIDIA products with minimal manual intervention, thus increasing the features available for
Suncor the prediction algorithms. Fashion company BESTSELLER illustrates how this works.
Zzapp
Opportunity: Reducing waste and
improving turnaround time
In the fashion industry, around 80% of merchandise is sold across two seasons each
year, and everything else is highly discounted, ultimately donated, or dumped. This
over-production means sub-optimal profits but presents an enormous sustainability
issue as well.
BESTSELLER designs, makes, and sells clothes for the Indian market. For each of its
four brands, its team designs and mocks up 3,500 samples, but selects only 1,100 for
production. These successful candidates are stratified into 5,000-6,000 SKUs by
color, size, and more; 1.5 million pieces are produced. Of these, BESTSELLER can sell
approximately 78%, which is relatively good performance in the fashion industry.
An opportunity exists, though, to increase this percentage even further by better
matching production to customer preferences. With plans to more than double its
portfolio of brands from four to nine by the end of the year, improving the sell-through
rate can have an outsized impact on profitability.
26
BESTSELLER (continued)
Clothes can be described in terms of shape, cut, fabric, styles, and various design
elements. In fact, BESTSELLER was using a taxonomy of more than 7,000 design
patterns and 4,000 colors. While many of these features are discernable simply by
observing images of the product, very little of this information was tagged in the
product master data. BESTSELLER needed a way to extract this information quickly
and effectively.
27
Appendix
BESTSELLER
Boston Scientific
Boston Scientific
Avoiding the pitfalls of transfer learning
Crédit Mutuel
Global Bank
Summary
IFFCO-Tokio
KPMG Transfer learning involves repurposing a model originally trained for one task to use it for
a different task. In this sense, the knowledge gained while solving one problem can be
Marketing Platform
applied to a related problem--for example, a model for recognizing cars can be applied to
McCormick recognizing trucks.
Navtech
Transfer learning can save work and help reduce training costs, but it can also come with
NVIDIA an outsized (up to 45%) drop in performance, so there are limited applications where it
Suncor makes sense. Boston Scientific’s experience, however, shows an organization can still
achieve high performance with transfer learning by “stepping down” the problem,
Zzapp
enabling its model to achieve performance of over 99% and labor savings of over
$5 million.
Traditionally, human experts have done much of the inspection, but this is not optimal. As
Eric Wespi, a Boston Scientific data science manager, explained: “Human visual
inspection is often slow, expensive, and can present unwanted quality risks.” This makes
intuitive sense; people typically don’t perform well on tasks that require focused
attention for a long period of time where the probability of an event is infrequent.
Moreover, judgement can vary from person to person. Also, experts’ time is expensive;
Boston Scientific has approximately 3,000 experts performing inspections at a cost of
several million dollars each year.
28
Boston Scientific (continued)
Convolutional neural networks (CNNs) are particularly well-suited to image classification, but such
models require an enormous amount of data to train. In many cases (particularly for newer and
rarer defects), the team did not have enough data to train these models from scratch. Collecting or
generating this data would be impractical and the cost prohibitive.
1. Scale down the problem: For each defect, inspection could be segmented into smaller and
narrower tasks such as, “Does this portion of the image contain a link?” and “Is this link broken
or not?”.
2. Customize existing models: Several open source CNNs were used (for example, VGG16,
EfficientNet [B0 through B7], Mask R-CNN, YOLOv3, ResNet-50, and Inception-v3). In each
case, the team started with the open-source model’s pretrained weights, customized the last
couple of network layers, and then retrained the models using their own data.
3. Test data requirements: The team found that they needed less data than expected (for example,
100-1,000 examples of each defect and 50,000-60,000 examples of non-defective stents ) to
exceed human performance.
To improve the robustness of the models, they also augmented the training data by generating
additional examples through perturbation (these could be simple adjustments that should not
impact classification, such as brightness adjustments or the addition of noise).
All the work was completed within a relatively modest budget of $50,000. Model training was
quick and inexpensive, taking 1-2 seconds per image across nine models, 2-10 hours to train each
model on a single GPU, and a small team of approximately three people.
This level of performance is not what is typically expected with transfer learning. Performance
usually drops significantly, requiring more data to offset the deficit. In this case, applying the
existing models to a simpler problem appears to have eliminated that need.
Deploying the nine models enabled an equivalent of $5 million in direct labor savings from the
reduction in parts being flagged for human inspection, and the opportunity to reassign several
experts to other high-value projects.
Boston Scientific’s experience suggests that transfer learning works well with the right conditions:
– A generic model exists to be leveraged. In the case of image processing tasks, the early layers
of such networks seem to be highly transferable even when the task is notably different.
– The usual drop in performance from transfer learning can be eliminated by using the system
on a simpler problem and doing finetuning on the network.
29
Appendix Crédit Mutuel
BESTSELLER
Boston Scientific
Using AI to get the right information
Crédit Mutuel
to customer advisors
Global Bank
IFFCO-Tokio
KPMG
Summary
Marketing Platform
The drive for efficiency in customer service is often at odds with a desire to deepen the
McCormick customer relationship. Specialists are better able to address questions related to specific
Navtech products or services but can lack the context necessary to serve customers whose
relationship with the company is broader than one domain alone. Credit Mutuel employed
NVIDIA
AI to scale dedicated points-of-contact for customers across many products by providing
Suncor curated information to its customer representatives.
Zzapp
Opportunity: Enhance the service
provided by human advisors
At Crédit Mutuel, each customer has a dedicated advisor. The advisor acts as a first point of
contact, helping customers navigate their relationship with Crédit Mutuel across various
products in areas like checking, savings, mortgages, and investments. The quicker and
easier it is for advisors to access relevant information, the faster they can respond to
customer requests (and the more time available to serve other customers). With approxi-
mately 3 million incoming calls and 7 million emails received per month, improvements in
resolution time can have a significant impact.
30
Crédit Mutuel (continued)
This case exemplifies AI being used not to provide a specific answer, but as an integral
part of a human-led workflow, generating a smaller and more targeted set of proposed
outputs where humans can apply their subjective judgement.
31
Appendix
BESTSELLER
Global Bank
Boston Scientific
Crédit Mutuel AI can increase capacity by complementing existing
Global Bank processes with minimal disruption
IFFCO-Tokio
KPMG Summary
Marketing Platform
In highly regulated industries such as banking, it’s critically important to maintain
McCormick process documentation. This enables banks to execute more consistently and repeatably,
Navtech and to demonstrate compliance during external audits. To help ensure accuracy,
completeness, and comprehension, banks conduct internal audits of documentation
NVIDIA
where their teams review and attempt to replicate process controls. However, the
Suncor write-ups are often complex and unstructured (free text), and manual audits take time. A
Zzapp global bank shows how deep learning can help scale that manual effort, while comple-
menting, not replacing, existing people and processes.
Banks mitigate these risks through a rigorous system of controls. Some are automated
but most are manual. To help ensure that these controls are designed appropriately and
continue to be applied effectively, a bank’s Internal Audit (IA) department conducts tests
by sampling controls and checking to see if they worked. If the banks find an issue (for
example, opening an account that should not have been opened), then they implement a
corrective action plan (CAP) (such as increasing the frequency of refreshing the list of
blocked entities). The more controls that can be checked—and the more often—the
greater the assurance provided to the business.
32
Global Bank (continued)
The audit process represents a significant manual effort as is. Global Bank conducts approxi-
mately 1,000 audits per year, each of which looks at around 10 controls and takes three hours on
average to complete. Global Bank continues to increase its capacity (the number of auditors will
grow by 30%), and already has one of the largest internal audit departments in the world, so it’s
important to optimize the productivity of those resources.
The solution was two-pronged. First, Global Bank built a new model that was easier to retrain.
It partnered with IBM to build this model, bootstrapped from a prior engagement, and put in an
on-prem implementation using IBM Watson® Studio. Second, Global Bank set out to increase
the data available for this new model. Previously, it had built a tagging system in Python directly
connected to their internal audit platform, and the system enabled auditors to make new
annotations as they worked on audits. The company augmented this system with the original BERT
model, which provided auditors instant feedback, and made this data available for the new IBM
model.18
Within just four months, rapid adoption of the system had already yielded results. Fifty active
users had collectively input 12,000 entries (custom annotations) across 5,000+ controls and were
adding an additional couple hundred entries each week. The increased efficiency of the review
process enabled Global Bank to release an estimated 30,000 hours of effort and deploy it towards
additional assurance that would not otherwise have been possible.
33
Appendix
BESTSELLER
IFFCO-Tokio
Boston Scientific
AI process improvements generate
Crédit Mutuel
better customer incentives
Global Bank
IFFCO-Tokio
Summary
KPMG
Marketing Platform
Like many industries, insurance companies face a challenge in providing the right
McCormick incentives. In general, insured people are not motivated to seek the best price for services
Navtech if an insurance company is footing the bill. But sometimes AI can make possible the
process changes that enable them to unlock this value.
NVIDIA
Suncor Opportunity: Pay customers directly
Zzapp
IFFCO-Tokio settles approximately 500,000 motor-vehicle damage claims in India each
year. Traditionally, customers sought repairs at privately owned repair workshops, which
provided quotations for IFFCO-Tokio to approve, but there were several issues with this
process. Workshops are incentivized to inflate their estimated costs, and customers do
not have incentives to seek out the best prices. This led to disagreements about repair
charges and delays. Customers, many of whom depended on their vehicle for their
livelihood, were severely disadvantaged by delays of up to 20-30 days per settlement.
Some could not afford to wait, forgoing repairs and driving vehicles that were unsafe.
IFFCO-Tokio resolved to pay customers directly for the cost of repairs. This would provide
customers better incentives and empower them to take charge of the process. But there
were two challenges: providing fast turnaround on quotations and getting estimates
correct without a quote from the repair shops.
The new approach proved to be immensely popular with customers, but it was time
consuming and imprecise. The turnaround time for this process was up to 5 hours per
claim, much of which was assessor time to inspect parts and populate forms with “repair/
replace” decisions, as well as cost estimates for each part. Their efforts were also
complicated by poor-quality images. Previously, pictures of the damage had been taken in
the controlled environment of a professional garage, but customers frequently submitted
images with incorrect angles, poor lighting, or glare.
34
IFFCO-Tokio (continued)
35
Appendix
BESTSELLER
KPMG
Boston Scientific
Crédit Mutuel Task complexity, not data availability, drives
Global Bank the choice of machine learning method
IFFCO-Tokio
KPMG Summary
Marketing Platform
Natural language processing (NLP) and text-mining approaches often rely on deep learning
McCormick for named-entity recognition (NER). These approaches enable meaning to be extracted (for
Navtech example, from sentences and passages) and are often able to perform well on complex
tasks. However, KPMG’s experience with document search and classification illustrates
NVIDIA
that when a particular sub-task is simple enough, traditional machine learning may be the
Suncor best approach.
Zzapp
Opportunity: Better substantiation for R&D tax credit claims
Tax incentives for research and development (R&D) in the US can be significant. They can
be up to 11%-15.8% of eligible incremental R&D expenditures—and the benefit can further
increase with many states also offering a state R&D credit. For smaller research organiza-
tions in particular, these tax credits can be decisive in making an R&D project commercially
viable, or in securing investor funding to initiate projects in the first place. KPMG is hired by
businesses of all sizes to document the R&D they’ve done and to help ensure that the
business gets the maximum tax credit for that R&D.
The evidence is typically collected from an organization’s documentation and can take
many forms. It could include presentations, emails, meeting minutes, lab reports, test
records, and engineering drawings. Content is often unstructured, of intractable volume,
and/or stored in various repositories. In some cases it may be very limited, such as in agile
or continuous improvement environments. In any case, regulations do not specify what
qualifies as “sufficient” evidence, so it’s important to review as much information as is
available and present as much of it as possible that is relevant and high quality.
36
KPMG (continued)
KPMG supports clients in their audit readiness and is experienced in managing the discovery
process. Traditionally this has involved a top-down approach, starting with a list of projects,
trawling through document repositories related to those projects, manually searching through
the documents using keywords, and reading and tagging specific document sections that
satisfy each of the four tests. This is a significant manual effort and necessitates some prioriti-
zation— which risks excluding valuable evidence. It also takes valuable time from client
scientists and engineers to support the effort. KPMG wondered if machine learning could help
them do better.
Documents were chunked into sections by tokenizing words and sentences. Teams tried a range
of approaches, including statistical learning, such as regular expressions, support vector
machines, decision trees, and random forest; deep learning for named entity recognition (NER),
and rules-based approaches. They found that accuracy ranged from 55% using out-of-the box
document discovery software (about as effective as a manual keyword search) to above 70%
for deep learning. However, the best approaches were rules-based with accuracy exceeding
85%. This was likely due to a relatively high degree of standardization between document
formats, making the document chunking task relatively straightforward.
It is worth reflecting on the relative performance of deep learning to other methods. The results
reinforce the assertion that even if there is a large enough data set, deep learning tends to be
superior only when the data or the problems are extremely complex. In this case, simple rules
and keywords were sufficient to identify the relevant information for each test, while also
providing greater explainability.
37
Appendix
BESTSELLER
Marketing Platform
Boston Scientific
Crédit Mutuel
Controlling AI cost through better visibility
Global Bank
IFFCO-Tokio
Summary
KPMG
Marketing Platform With on-prem infrastructure, the cost of training and running machine learning models is
usually hidden from the business. For example, there can be a disconnect between the
McCormick
value generated from the model and what it takes in terms of engineering and compute
Navtech resources to get there.
NVIDIA
This case study with Marketing Platform shows that bringing transparency to the link
Suncor between computational demand and cost can help in creating incentives and in reducing
Zzapp costs in a meaningful way.
The data sets available for these models are enormous. Marketing Platform runs a data
cooperative with thousands of members, with 25%-40% of these members regularly
contributing data on things like transactions, donations, or subscriptions. It combines
this data with compiled third-party data on everything from demographics, to census
data, to household incomes. After feature engineering, the data comprises 12,000
variables and covers almost the entire US population.
With such a rich data set, Marketing Platform was at the limits of what its existing
on-prem infrastructure could handle. The team was able to train models only on internal
data, and even then, only a small portion of it (for example, a 50,000-100,000 sample) at
a time. Marketing Platform knew that if it could use more data, there was a huge
opportunity to generate additional value.
38
Marketing Platform (continued)
Initially, the scaling of compute resources (along with the initial switching cost and learning
curve) meant that total costs increased. In a fully on-prem world, the cost of infrastructure had
been independent of utilization, so data scientists were able to run whatever they wanted,
constrained only by compute availability. With effectively unlimited scalability, experiments
would have to have to be designed more thoughtfully.
The team was also able to increase the efficiency of their compute utilization by improving
cluster allocation, data flow, and the overall modeling pipeline. For the machine learning model
itself, they ran tests to optimize the approximately 100 required model parameters and fixed
several of them based on what worked well to minimize the number that would need tweaking
each time.
The platform’s success in this area is now fueling a transformation of its data science capabil-
ities, bringing the number of practitioners in the US up from 40 to 4,000 within the year.
– Visibility of training costs at a per-model run level of granularity can help make the cost of
moving to new machine-learning techniques less expensive than would be expected.
– Volume matters: even small gains in accuracy can have an enormous impact to an
organization when it can be applied at scale.
39
Appendix
BESTSELLER
McCormick
Boston Scientific
Crédit Mutuel Augmenting creativity in R&D through
Global Bank AI-directed exploration
IFFCO-Tokio
KPMG Summary
Marketing Platform
AI is often used to make recommendations based on what has worked well in the past,
McCormick but this can result in solutions that are simply more of the same. McCormick’s experience
Navtech shows that AI can also be used to explore a solution space and result in new, creative
combinations that may not otherwise have been attempted. In this way, AI can help
NVIDIA
augment and accelerate a process of creative experimentation.
Suncor
Zzapp Opportunity: Use AI to accelerate
development of new flavor profiles
McCormick creates a variety of products including seasonings, sauces, and flavors (some
of which may also be sold B2B to clients for incorporation into third-party products). A
formula is a combination of specific ingredients in precise proportions, standardized for
consistency, that delivers a flavor profile which describes the eating experience. Flavor
creation for B2B products can be a competitive process, with several companies
producing formulas in response to client requests. To improve the success of new
flavors, McCormick looked at two important factors:
– Scaling the experience of food scientists. A junior flavorist typically apprentices for
seven years, during which they build experience and glean valuable insights. It is this
cumulative experience that enables creativity, through an understanding of what
works, what doesn’t, and what are the degrees of freedom in between.
McCormick wondered if AI could help. If flavorists can intuit insights from empirical
experience, then there was probably more that could be extracted, more rapidly, through
analytics. If these insights could be captured and systematized, then it could help
flavorists better explore the flavor space in two ways. The first is finding optimal flavors
within an area around what is known well. The second is by finding promising new areas
in the flavor space that have not been explored. Ultimately, this would result in faster
development and better quality.
40
Challenge: Credit assignment
and large search space
The company leveraged data on approximately 350,000 formulas created over more than a
decade, covering product attributes such as category (baked goods, salty snacks), format
(seasoning, condiment, wet sauce, dried), amounts, type, and success metrics such as
product tasting scores. It also captured functional attributes like shelf stability and flow rate,
as well as non-functional attributes like grain size, sodium content, and FEMA18 numbers
(attributes on 40,000 raw materials). With such a high dimensionality in the data set, the
team needed a way to condense the problem to keep it manageable.
To enable this effort, the team needed a few tricks. First, they reduced the dimensionality of
the data by aggregating 40,000 distinct raw materials into 3,000 groupings. Second, they
sampled 3,000 formulas for training from the 350,000 total, each labeled with a “success”
rating. Last, they formulated the model as a graph problem, defining distance metrics
between materials where each formula was represented as a vector.
41
Appendix
BESTSELLER
Navtech
Boston Scientific
Crédit Mutuel The new platforms being created by deep learning
Global Bank
IFFCO-Tokio Summary
KPMG
Some innovations can only be economically feasible when a centralized provider can
Marketing Platform make the up-front investments required and spread the costs over a wide enough
McCormick customer base. This is increasingly true for machine learning , particularly where the
tasks being automated are perception based (for example, image recognition), because
Navtech
these can require significant data and compute resources to develop and maintain. For
NVIDIA many individuals and businesses, this technology is simply out of reach.
Suncor
Navtech identified an opportunity to bring advanced computer vision to individual
Zzapp diamond retailers across the globe by creating a model and delivering it as a service. This
approach can be a win-win and is an excellent example of where AI can not only deliver
improved efficiency and performance, but also unlock new capabilities and business
models in the process.
Visual catalogs are a key part of the sales process. Each retailer maintains one for its own
inventory and supplements it with images of other jewelry as inspiration for customers
looking for bespoke pieces. Traditionally, these have been physical booklets or
magazines, but these formats can only present a limited number of items and can’t be
refreshed often.
Digital catalogs remove many of these constraints but introduce other challenges. Staff
compile images from various sources like inventory photographs, the web, and manufac-
turers’ catalogs, and categorize them manually into folders. The process is slow (30-60
seconds per image, up to a million images), prone to error (there are lots of duplicate
images and it’s difficult to remember what you’ve seen already), and results in only very
high-level categorization (for example, rings versus necklaces). Ideally there would be a
way to automatically create catalogs and enable customers to search based on
additional criteria.
42
Navtech (continued)
Navtech selected three attributes for its pilot and created a deep learning model for each:
– For product category (for example, rings, bracelets) and style, it used a VGG1619 network to
classify images. The open source ImageNet-trained backbone was fine-tuned by custom
training the head and the first and second layers of the network.
– For diamond cut (for example, round, square), it instead used Mask RCNN for object
detection and classification (having achieved only 55-56% accuracy using VGG16). The
training data for this model was labelled by an intern, who manually drew a polygonal mask
around the shape of each diamond.
This experience illustrates the cost-benefit tradeoffs of deep learning, where some use cases
can only be enabled by larger, centralized players with the ability to serve wider audiences.
Such systems, because they are delivered at scale, must be delivered as part of a wider
product and service architecture, underpinned by traditional software development, but are
much more cost-effective.
43
Appendix
BESTSELLER
NVIDIA
Boston Scientific
Crédit Mutuel Building a compute and data
Global Bank platform for self-driving cars
IFFCO-Tokio
KPMG Summary
Marketing Platform
To build safe and reliable AI models for autonomous vehicles (AVs), enormous amounts of
McCormick compute power and training data are needed, along with the skills, resources, and expertise
Navtech required at that scale. Under such conditions, we are likely to see an emergence of larger
platforms that can pool data from multiple participants, aggregate sufficient demand to
NVIDIA
justify the large investments required, and ultimately enable a new business model where
Suncor AV software can be offered as a service to carmakers and fleet operators.
Zzapp
Opportunity: Solving complex problems
to drive mass adoption of AV
Driver-support systems are becoming more widespread and can now perform parking,
emergency braking, lane changing, and other functions. Once vehicles are fully self-driving,
the applications will be numerous, from freight and mass transit, to on-demand transpor-
tation like robo-taxis. The global autonomous vehicle system market was $82 billion in
2021 and is forecast to grow to $770 billion by 2030—a compound annual growth rate
(CAGR) of 39.1% (from 2022 to 2030).20
Considering that each of these scenes will entail data from numerous sensors, the compute
challenge for AVs is enormous. To create the perception models needed for a full AV stack,
NVIDIA estimates that a productive development team could require in the order of 5,000
dedicated GPUs.22 A single model can take three to six days to run on 32 GPUs, and there
can be 25-50 deep learning experiments for each task. Individual car companies typically
don’t have the resources in terms of skills, experience, hardware, and data to develop these
systems on their own.
44
NVIDIA (continued)
– Extending a common data platform across multiple customers: Pooling data between
several carmakers increases the data available for training and can enable greater model
performance, particularly with edge cases. The quality of the data can be enforced through a
reference architecture that specifies the standards for sensor specifications and placement.
– Simulation for training and testing: Hundreds of millions of driving scenarios can be simulated
to supplement real-world data and help bootstrap models for silent on-street testing and
iteration—running AI in the vehicles to compare what it would have done relative to the driver’s
actual behavior.
– Common processing of visual tasks: NVIDIA was able to minimize the compute required by
jointly training multiple tasks on a single ResNet-based model architecture. Once the full
model is trained, the heads (later layers) of the model can be optimized for each given task,
without the need to retrain the trunk (earlier layers) of the model. Noting that the compute was
not much greater than that required for a single task alone suggests that there is a lot of
common processing possible, which makes intuitive sense for the domain of computer vision.
It also represents the start of a new market dynamic. On one side are vertically integrated
carmakers such as Tesla), that can codesign their software and hardware for more seamless
experiences. On the other are increasingly modularized carmakers who compete on the quality
of their hardware and buy their software from centralized players like NVIDIA (which greatly
reduces cost of entry to the AV market and is likely to stimulate greater competition as a result).
The success of either of these two paradigms depends on how important the quality of AV
software is in terms of the overall experience.
45
Appendix
BESTSELLER
Suncor
Boston Scientific
Crédit Mutuel Performance and explainability
Global Bank aren’t always a tradeoff
IFFCO-Tokio
KPMG Summary
Marketing Platform
Deep learning often performs well at predicting processes that are nonlinear (small
McCormick changes can have an outsized impact) and highly coupled (there is a lot of dependency
Navtech between factors). Often there is sufficient performance improvement from adopting
deep learning that firms are willing to have their models be less explainable (that is,
NVIDIA
black box). But as Suncor’s experience shows, when the stakes are high enough,
Suncor explainability is at a premium.
Zzapp
Opportunity: Better prediction of
issues to manage output quality
Suncor Energy specializes in the production of synthetic crude from oil sands. For
diesel in particular, this involves removing sulfur and nitrogen by hydrotreating, mixing
the straight-run diesel with hydrogen (and a solid metal catalyst such as cobalt) at high
temperature and pressure. The process is complex, with several variables (pressure,
temperature, flow rates) that interact to affect the resulting quality of the output.
These factors must be tightly monitored and controlled to minimize “upsets”, where
product quality deviates outside acceptable rates and cannot be sold. With diesel
production averaging 43,000 barrels per day, there is a strong commercial incentive to
avoid upsets as much as possible.
46
Suncor (continued)
However, when the team compared the performance to simpler traditional statistical
techniques, they observed performance that was much better than expected—for
example, principal component analysis (PCA) performed at only a 10% performance
deficit relative to XGBoost, while being much easier to interpret.
47
Appendix
BESTSELLER
Zzapp Malaria
Boston Scientific
Crédit Mutuel Learning from satellite images to fight malaria
Global Bank
IFFCO-Tokio Summary
KPMG
With satellite imagery, computer vision is commonly used to identify visible objects,
Marketing Platform with convolutional neural networks (CNNs) most often being the default choice. Even
McCormick when the objects themselves are not clearly visible, predictive models using CNNs can
sometimes infer the presence of objects based on other characteristics (for example, the
Navtech
region surrounding the object in question). However, as the experience of ZzApp Malaria
NVIDIA shows, this is not always the case, and in such situations, traditional techniques such as
Suncor linear regression may be good enough.
Zzapp
Opportunity: Preventing malaria through
treatment of standing water
Malaria caused an estimated 627,000 deaths in 2020, with Africa accounting for 96%
of all deaths. Vector control is the main way to prevent transmission through bites from
malaria-carrying Anopheles mosquitos. The primary focus thus far has been on
commodities like bed nets or indoor spraying of insecticides, but these are only partially
effective and don’t work outdoors. An alternative approach is to treat standing water
within the community (where mosquitos breed and multiply) directly, but such programs
have not been systematic or comprehensive enough to be effective at scale.
48
Zzapp (continued)
49
About Research Insights IBM Institute for
Research Insights are fact-based strategic
Business Value
insights for business executives on critical
For two decades, the IBM Institute for Business
public- and private-sector issues. They are
Value has served as the thought leadership think
based on findings from analysis of our own
tank for IBM. What inspires us is producing
primary research studies. For more information,
research-backed, technology-informed strategic
contact the IBM Institute for Business Value at
insights that help leaders make smarter business
[email protected].
decisions.
50
Notes and
sources
1 Sources: “Fast Start in cognitive innovation: Top 3 “McCarthy, J; M.L. Minsky; N. Rochester; C.E. Shannon.
performers share how they are moving quickly.” IBM “A proposal for the Dartmouth summer research
Institute for Business Value. January 2017. https://www. project on artificial intelligence.” Accessed on July 13,
ibm.com/blogs/internet-of-things/fast-start-cognitive/ 2022. http://www-formal.stanford.edu/jmc/history/
Unpublished data. C&A8. In general, where is your organi- dartmouth/dartmouth.html
zation in its adoption of cognitive computing? Select the
most advanced level for your organization; “Shifting 4 Goodfellow, Ian; Yoshua Bengio, Aaron Corville.
toward Enterprise-grade AI: Confronting skills and data “Deep Learning.” The MIT Press. 2016. https://www.
challenges to realize value.” IBM Institute for Business deeplearningbook.org
Value. September 2018. https:// www.ibm.com/
5 LeCun, Yann; Yoshua Bengio; Jeffrey Hinton. “Deep
thought-leadership/institutebusiness-value/report/
Learning.” Nature. May 28, 2015. https://www.nature.
enterpriseai Unpublished data. AI1. In general, where is
com/articles/nature14539.pdf
your organization in its adoption of artificial intelligence?
Select the most advanced level for your organization; “The
6 Burns, Ed. “Timeline of AI winters casts a shadow over
business value of AI: Peak performance during the
today’s applications.” TechTarget. Accessed on July 13,
pandemic.” IBM Institute for Business Value. November
2022. https://www.techtarget.com/searchenter-
2020. https://www.ibm.com/thought-leadership/insti-
priseai/infographic/
tute-business-value/report/ai-value-pandemic#
Timeline-of-AI-winters-casts-a-shadow-over-todays-
Unpublished data. S6. In general, where is your organi-
applications
zation overall and your particular function in terms of
adoption of artificial intelligence? 2022 Omdia AI Market 7 Thompson, Neil C.; Kristjan Greenewald; Keeheon Lee;
Maturity survey https://omdia.tech.informa.com/ Gabriel F. Manso. “Deep Learning’s Diminishing
OM023919/AI-Market-Maturity-Survey--2022-Database Returns.” IEEE Spectrum. September 24, 2021. https://
Q1. What is the state of AI deployment in your company? spectrum.ieee.org/deep-learning-computational-cost
The rating scale in Omdia survey has been assumed to
equivalent to IBM IBV rating scale in the following way: 8 Ibid.
investigating technology and use cases = considering;
Identified at least one use case and developing pilot = 9 Ibid.
Evaluating; Currently piloting AI in at least one function or
business = Piloting; Live AI deployment in at least one 10 World Health Organization malaria fact sheet. April 6,
function or business unit = Implementing; Scaling AI 2022. https://www.who.int/news-room/fact-sheets/
deployment across multiple business functions or units = detail/malaria
Operating/optimizing.
51
11 “Fast Start in cognitive innovation: Top performers 16 XGBoost is a decision-tree-based ensemble ML
share how they are moving quickly.” IBM Institute for algorithm that uses a gradient boosting framework.
Business Value. January 2017. https://www.ibm.com/ https://towardsdatascience.com/
blogs/internet-of-things/fast-start-cognitive/ https-medium-com-vishalmorde-xgboost-algorithm-
Unpublished data. QC&A10 What are the important long-she-may-rein-edd9f99be63d
value drivers for cognitive computing? Select the top
5.“Shifting toward Enterprise-grade AI: Confronting 17 McCormick has observed that each 5-10 years of
skills and data challenges to realize value.” IBM experience halves the number of iterations.
Institute for Business Value. September 2018. https://
18 The Flavor Extract Manufacturer’s Association of the
www.ibm.com/thought-leadership/institutebusi-
United States. FEMA numbers refer to ingredients
ness-value/report/enterpriseai Unpublished data. AI2.
generally recognized as safe and allowed in the United
What are the important value drivers for artificial intelli-
States. https://www.femaflavor.org/
gence/cognitive computing? Select top 5. “The
business value of AI: Peak performance during the
19 VGG16 (also called OxfordNet) is a convolutional neural
pandemic.” IBM Institute for Business Value. November
network architecture named after the Visual Geometry
2020. https://www.ibm.com/thought-leadership/
Group from Oxford. https://blog.keras.io/how-convolu-
institute-business-value/report/ai-value-pandemic#
tional-neural-networks-see-the-world.html
Unpublished data Q8. What are the most important
value drivers for artificial intelligence? Select top 5. 20 Report Ocean press release. “Autonomous Vehicle
System Market |(CAGR) of 39.1%| by Product Type,
12 “The business value of AI: Peak performance during the
End-User, Application, Region – Global Forecast to
pandemic.” IBM Institute for Business Value. 2020.
2030.” July 14, 2022. https://www.marketwatch.com/
https://www.ibm.com/thought-leadership/insti-
press-release/autonomous-vehicle-systemmarket-
tute-business-value/report/ai-value-pandemic
cagr-of-391-by-product-type-end-user-application-re-
gion-global-forecast-to-2030-2022-07-
13 Payraudeau, Jean-Stéphane; Anthony Marshall;
14?mod=search_headline
Jacob Dencik. “Unlock the business value of hybrid
cloud: How the Virtual Enterprise drives revenue growth
21 Kalra, Nidhi and Susan M Paddock. “Driving to Safety:
and innovation.”IBM Institute for Business Value. 2021.
How Many Miles of Driving Would It Take to
https://ibm.co/hybrid-cloud-business-value.
Demonstrate Autonomous Vehicle Reliability?” Rand
Payraudeau, Jean-Stéphane; Anthony Marshall; Jacob
Corporation. 2016. https://www.rand.org/content/
Dencik. “Extending digital acceleration: Unleashing the
dam/rand/pubs/research_reports/RR1400/RR1478/
business value of technology investments.” IBM
RAND_RR1478.pdf
Institute for Business Value. 2021. https://ibm.co/
hybrid-cloud-business-value 22 GPUs are arranged into purpose-built deep learning
systems (for example, the NVIDIA DGX, which
14 Fleming, Martin. “Breakthrough: A Growth Revolution.”
comprises 8 GPUs per server).
Business Expert Press. 2022
52
© Copyright IBM Corporation 2022
IBM Corporation
New Orchard Road
Armonk, NY 10504
IBM, the IBM logo, ibm.com, IBM Cloud Pak for Data, IBM
Research, and IBM Watson are trademarks of International
Business Machines Corp International Business Machines Corp.,
registered in many jurisdictions worldwide. Other product and
service names might be trademarks of IBM or other companies.
A current list of IBM trademarks is available on the web at
“Copyright and trademark information” at: ibm.com/legal/
copytrade.shtml.
6PQKYZ12-USEN-01
xxxxxxUSEN-00 53