AI Infrastructure 101

This document discusses how to develop a successful and modern AI infrastructure. It describes how AI infrastructure has evolved from individual data scientists developing models locally to using cloud-based tools that support automated machine learning (AutoML) and machine learning as a service (MLaaS). These cloud tools from companies like Amazon, Google, Microsoft and IBM provide full capabilities for developing, managing and operationalizing machine learning models at scale for organizations. The document also discusses model as a service, where pre-built models can be accessed through APIs rather than building models yourself.

Uploaded by

nicolepetrescu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

620 views8 pages

AI Infrastructure 101

Uploaded by

nicolepetrescu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

AI Infrastructure

101
AI Infrastructure 101

In this E-Guide:
As companies race to execute AI projects, capital investment in tools and technology that
support AI capabilities has skyrocketed. But AI infrastructure isn’t as easy as a single
How to develop a
purchase.
successful, modern AI
infrastructure Continue reading to learn how to create a successful and modern-ready AI infrastructure
without making hasty CapEx purchases.

Page 1 of 7 SPONSORED BY
AI Infrastructure 101

How to develop a successful,

modern AI infrastructure
Ronald Schmelzer, Principal analyst
How to develop a
successful, modern AI Artificial intelligence, enabled by machine learning and cognitive technologies, has taken
infrastructure many industries by storm. Only in the past decade has AI as a concept entered the day-to-
day experience of the enterprise. As such, companies are now rushing to implement AI
projects of all shapes and sizes. Correspondingly, government, enterprise, and venture
capital investment in tools and technology that support this widespread adoption of AI has
also increased dramatically.

The shift from experimental research and academic approaches to acceptable, common use
systems is rapidly changing the environment of the tools landscape. In this area of
continuous development, it's difficult to keep track of the new tools on the market that
organizations of all types are using to develop their AI infrastructure.

Data-centric ML development tools

In the early days of AI development, all machine learning model creation and management
was done locally on machines owned and operated by data scientists. As such, all the
platforms that saw early traction are focused on the individual data scientist or their
immediate teams. Open source dominates in this space, especially by offerings in the
Python and R ecosystems. Libraries developed for these ecosystems include the vastly
popular scikit-learn, Keras, TensorFlow, and PyTorch open source toolkits as well as the

Page 2 of 7 SPONSORED BY
AI Infrastructure 101

popular Jupyter notebook, and Google's Colaboratory built on top of that, as well as a wide
range of open source tools and toolkits covered in a previous article on this topic.

However, open source is not the end-all for machine learning model development. The tools
alone lack specific requirements for the management of models and data that are needed by
serious machine learning-focused data scientists and developers. As a result, over the past
How to develop a decade, tools focused on the immediate needs to build and train machine learning models
successful, modern AI have emerged. These tools have a focus on algorithm selection, tuning and evaluation, with
infrastructure the final result being Python, R, Java or other objects that can then be directly used to
answer any specific ML-related queries or data science needs, or put into operation by
production teams to be used in more highly scalable manners.

These tools include those by the major platform vendors including Amazon, Microsoft,
Google and IBM, as well as by focused data science and ML vendors including H2O,
RapidMiner, DataRobot, Databricks, Anaconda, Dataiku, Domino, KNIME, Alteryx, Ayasdi,
SAS and Mathworks. Since data science and ML model development is so data-dependent
and data-centric, big data vendors have entered this space with offerings from vendors
including Cloudera and SAP. The tools all share a focus on data centricity, with many of the
tools having origins in big data or data analytics. As a result, the core features of these
systems are algorithm and model focused, and not as much operationalization or
consumption focused. However, model operationalization and "ML ops" is rapidly becoming
the forefront of evolution for these tools.

The biggest change in the machine learning development space has been the emergence of
autoML. Given the lack of data science skills and expertise, many ML modeling and
development tools have released capabilities to automatically handle aspects of ML model
development that used to require the time and expertise of the user. In particular, data

Page 3 of 7 SPONSORED BY
AI Infrastructure 101

scientists and ML developers would have to clean and process their data, selecting among
the wide array of algorithms, configuring and managing the training of that model, tuning the
model and selecting the right hyperparameters, handling model evaluation and a variety of
additional steps required to operationalize the resultant models. AutoML tools have emerged
to handle many, if not all, of those steps. As a result, organizations are finding much greater
ability to simply drag and drop their data set into a tool, click a few options, then watch and
How to develop a
wait as a suitable model is automatically selected, tuned, configured and set up for
successful, modern AI
operationalization. AutoML vendors include open source solutions such as Auto-sklearn,
infrastructure
Auto-WEKA, OptiML AutoML and TPOT, as well as commercial offerings from companies
such as Cloudera, DataRobot, Google, H2O.ai, RapidMiner and others.

ML as a service, cloud ML, and model as a service

Given machine learning algorithms' need for data, big cloud vendors have been some of the
biggest proponents and supporters of machine learning. Amazon, Google, Microsoft, IBM,
Oracle, SAP and others are building substantial portfolios for machine learning development
and management. In the world of developer-oriented tools, the cloud-based offerings of
Amazon, Google, Microsoft and IBM stand apart from the rest.

Known as machine learning as a service (MLaaS) or cloudML offerings, cloud-based

offerings provide the full range of development, management and operationalization tools
needed to put machine learning and AI to work for a wide range of organizations' AI
infrastructure. The Amazon Web Services (AWS) machine learning solution is offered
primarily through AWS SageMaker, but also includes a number of higher-level AI and ML
capabilities for computer vision, natural language processing, predictive analytics and other
AI application areas. IBM's Watson was one of the first to be commercially available for
developers to experiment with ML and put AI into real-world, enterprise settings. Google's

Page 4 of 7 SPONSORED BY
AI Infrastructure 101

Cloud ML Engine brings Google's hosted platform to the fore to enable developers and data
scientists to run and develop machine learning models and datasets. Microsoft Azure ML
has likewise proven to provide a wide range of tools and solutions for data scientists,
developers and administrators looking to put ML into production.

Separate from the MLaaS market is the concept of model as a service. Rather than
How to develop a providing the environment to build, run and manage your own models, model as a service
successful, modern AI gives you access to prebuilt and trained models specific to individual tasks. Clarifai,
infrastructure Gumgum, Modeldepot, Imagga and SightHound are major companies building and curating
ML models for use. As a developer, you can query these models that will provide results as
specified. For example, some models might identify specific things in images while others
might help you categorize text or process natural language. Many model-as-a-service
offerings focus on specific models targeted to image recognition or text analysis but there is
an emerging class of companies trying to gather a widely curated set of models applicable
to many different domains.

ML ops and the need to manage model usage

One of the newest movements in machine learning development comes with the realization
that for many organizations, their challenges start not with producing models, but with using
and consuming them. The need to manage the operationalization of machine learning
models, or ML "ops" is becoming increasingly urgent as the number of models in production
continues to grow exponentially. Not only are companies producing and consuming their
own models, but they're increasingly making use of vendor and third-party models.

Using models in production brings up a lot of concerns, including making sure that models
are providing reliable, secure and manageable results in an environment of continuous

Page 5 of 7 SPONSORED BY
AI Infrastructure 101

change. An emerging set of ML ops tools provide capabilities for machine learning model
governance, version control, security, model discovery, model transparency and model
monitoring and management. These tools, like ParallelM, make sure that only qualified
users are allowed to make use of certain models, help ensure that new versions of models
don't cause unpredictable results, help safeguard models from data poisoning and
cybersecurity attacks, and make sure that the models continue to provide results at the
How to develop a
required levels of accuracy and precision as needed by their usage constraints.
successful, modern AI
infrastructure
Fundamental skills still needed
Before an enterprise can get started with AI, there are a few key considerations that must be
made. While it is true that an increasingly diverse range of users are able to develop and
make use of models, it is still necessary for ML developers and users to have skill sets to
effectively use these systems. At the most fundamental levels, organizations still need data
scientists with mathematical knowledge and a solid understanding of algorithms in order to
build their own models. In order to not only yield effective results but analyze and
understand the results being given, it is crucial for a citizen data scientist on the team to be
well-versed with probability and statistics, an integral part of working with machine learning.

Since a fair amount of exploration and theorizing goes into determining how to utilize these
systems to yield the desired results, having an employee who can think outside the box and
is willing to truly explore the extents of these systems is important when it comes to getting
the most out of them. Since AI platforms and tools are constantly changing, there is also a
need for the ML team to be able to stay up to date on modern methods for ML model
creation, autoML capabilities, ML Ops and other rapidly changing technology ecosystem
considerations. The artificial intelligence landscape is constantly changing, making it

Page 6 of 7 SPONSORED BY
AI Infrastructure 101

important for those working within this area to understand that today's platform investment in
their AI infrastructure might have to change tomorrow.

How to develop a
successful, modern AI
infrastructure

Page 7 of 7 SPONSORED BY

Neo4j Manual PDF
No ratings yet
Neo4j Manual PDF
334 pages
Gartner Magic Quadrant & Critical Capabilities - Gartner
No ratings yet
Gartner Magic Quadrant & Critical Capabilities - Gartner
16 pages
Information and Communications Technology Book 2
No ratings yet
Information and Communications Technology Book 2
106 pages
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
From Everand
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
Pierre-yves Bonnefoy
No ratings yet
MGT300 - Ch05 Exercise
100% (1)
MGT300 - Ch05 Exercise
2 pages
Microsoft Power BI DIAD - IN
No ratings yet
Microsoft Power BI DIAD - IN
84 pages
Study Material DF
No ratings yet
Study Material DF
152 pages
DGX Basepod Deployment Guide DGX A100
No ratings yet
DGX Basepod Deployment Guide DGX A100
110 pages
Google AI Infrastructure Supremacy
No ratings yet
Google AI Infrastructure Supremacy
29 pages
6643690f56a51719abfa0901 - Gartner Market Guide For NDR
No ratings yet
6643690f56a51719abfa0901 - Gartner Market Guide For NDR
18 pages
IN 1040 DataDiscoveryGuide en PDF
No ratings yet
IN 1040 DataDiscoveryGuide en PDF
215 pages
Dod Cloud Strategy Osd016570-18 Res Final
100% (1)
Dod Cloud Strategy Osd016570-18 Res Final
19 pages
IDS Reference Architecture Model 3.0 2019
No ratings yet
IDS Reference Architecture Model 3.0 2019
118 pages
Watsonx Deck
No ratings yet
Watsonx Deck
30 pages
Accenture Ever Ready Infrastructure
No ratings yet
Accenture Ever Ready Infrastructure
29 pages
Advanced Technology Stacks and Business Use-Cases
100% (1)
Advanced Technology Stacks and Business Use-Cases
28 pages
WP The Data Center of The Future Reaching Sustainability en
No ratings yet
WP The Data Center of The Future Reaching Sustainability en
40 pages
Digital Architecture Management Study 2019 Web
No ratings yet
Digital Architecture Management Study 2019 Web
24 pages
Enabling Scalable OLAP Directly On A Data Lakehouse Architecture
No ratings yet
Enabling Scalable OLAP Directly On A Data Lakehouse Architecture
39 pages
Sans Threat Intelligence Driven Attack Surface Management
No ratings yet
Sans Threat Intelligence Driven Attack Surface Management
29 pages
Data Lakehouse
No ratings yet
Data Lakehouse
7 pages
Laura Paton - PMI Business Analysis Leading Organizations To Better Outcomes
No ratings yet
Laura Paton - PMI Business Analysis Leading Organizations To Better Outcomes
25 pages
Connected Mining 1.0 Design Guide
No ratings yet
Connected Mining 1.0 Design Guide
65 pages
Migrating To Cloud-Native Threat Detection and Response (2023) - CyberProof-24pg
No ratings yet
Migrating To Cloud-Native Threat Detection and Response (2023) - CyberProof-24pg
24 pages
Fraunhofer - ISST Report - Data Strategy Praxis Report
No ratings yet
Fraunhofer - ISST Report - Data Strategy Praxis Report
26 pages
Cloud Anywhere:: Azure For Hybrid and Multicloud Environments
No ratings yet
Cloud Anywhere:: Azure For Hybrid and Multicloud Environments
36 pages
Data Fabric Solutions
No ratings yet
Data Fabric Solutions
37 pages
"Defend Forward" and Sovereignty
No ratings yet
"Defend Forward" and Sovereignty
28 pages
Information Security CS 526: Topic 21: Data Privacy
No ratings yet
Information Security CS 526: Topic 21: Data Privacy
39 pages
Building The Unified Data Warehouse and Data Lake: Best Practices Report Q2
No ratings yet
Building The Unified Data Warehouse and Data Lake: Best Practices Report Q2
30 pages
IOT Architecture II
No ratings yet
IOT Architecture II
29 pages
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-F
No ratings yet
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-F
13 pages
DGX A100 System Architecture Whitepaper
No ratings yet
DGX A100 System Architecture Whitepaper
23 pages
The Future of Network Security Is in The Cloud
No ratings yet
The Future of Network Security Is in The Cloud
25 pages
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-C
No ratings yet
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-C
10 pages
Decentralized Web Platform - Public
No ratings yet
Decentralized Web Platform - Public
18 pages
Utility Enterprise Architecture Best Practices - Webcast
No ratings yet
Utility Enterprise Architecture Best Practices - Webcast
13 pages
Presentation On Internet of Things
No ratings yet
Presentation On Internet of Things
32 pages
How To Build A Self-Service Data Analytics Stack Final - Google Docs Pdxule
No ratings yet
How To Build A Self-Service Data Analytics Stack Final - Google Docs Pdxule
12 pages
Remote Work Policy Template
100% (2)
Remote Work Policy Template
5 pages
Data As A Service The Future of Data Management
No ratings yet
Data As A Service The Future of Data Management
7 pages
Flexera One SaaS Management Solution Brief
No ratings yet
Flexera One SaaS Management Solution Brief
4 pages
2020 Data Center Roadmap Survey PDF
No ratings yet
2020 Data Center Roadmap Survey PDF
16 pages
Introduction To Wireless Communication Systems
No ratings yet
Introduction To Wireless Communication Systems
27 pages
Lecture 5. Industry 4.0 Technologies and Enterprise Architecture
No ratings yet
Lecture 5. Industry 4.0 Technologies and Enterprise Architecture
13 pages
Data Platforms Market Map 2019
No ratings yet
Data Platforms Market Map 2019
31 pages
Kubernetes ATC Kube Ebook-Final Feb 2020 PDF
No ratings yet
Kubernetes ATC Kube Ebook-Final Feb 2020 PDF
11 pages
Securing Generative Ai
No ratings yet
Securing Generative Ai
5 pages
Metadata Management On A Hadoop Eco-System: Whitepaper by
No ratings yet
Metadata Management On A Hadoop Eco-System: Whitepaper by
12 pages
Patron Client Politics and Its Implications For Good Governance in Bangladesh
No ratings yet
Patron Client Politics and Its Implications For Good Governance in Bangladesh
26 pages
5 Steps To Build A Business Case For Continuous Data Quality Assurance
100% (1)
5 Steps To Build A Business Case For Continuous Data Quality Assurance
11 pages
PLAYBOOK - Data & AI - Migrate and Modernize Data Estate
No ratings yet
PLAYBOOK - Data & AI - Migrate and Modernize Data Estate
5 pages
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-9
No ratings yet
2023 Data, Analytics, and Artificial Intelligence Adoption Strategy-9
5 pages
Data Ingestion Architecture For Telecom
No ratings yet
Data Ingestion Architecture For Telecom
10 pages
FRM Big Bazar
No ratings yet
FRM Big Bazar
31 pages
Data Governance
No ratings yet
Data Governance
7 pages
Smarter IT: Optimize IT Delivery, Accelerate Innovation: Inside
No ratings yet
Smarter IT: Optimize IT Delivery, Accelerate Innovation: Inside
15 pages
Full Stack Observability Aag
No ratings yet
Full Stack Observability Aag
3 pages
Palo Alto Network
No ratings yet
Palo Alto Network
4 pages
Introduction To Cloud Databases: Lecturer: Dr. Pavle Mogin
No ratings yet
Introduction To Cloud Databases: Lecturer: Dr. Pavle Mogin
23 pages
Beagle Research Moving OnPremise Contact Center Cloud
No ratings yet
Beagle Research Moving OnPremise Contact Center Cloud
11 pages
A COBIT 5 Overview
No ratings yet
A COBIT 5 Overview
19 pages
John Rennie Short-Global Dimensions - Space, Place and The Contemporary World (2004)
No ratings yet
John Rennie Short-Global Dimensions - Space, Place and The Contemporary World (2004)
192 pages
Blog 4 - Why Data Projects Fail - No Data Strategy 5.25.17
No ratings yet
Blog 4 - Why Data Projects Fail - No Data Strategy 5.25.17
4 pages
Understanding Kubernetes
100% (2)
Understanding Kubernetes
21 pages
Container Security Going Beyond Image Scanning
No ratings yet
Container Security Going Beyond Image Scanning
18 pages
(T-GCPAZURE-B) Module 3 - Virtual Machines in The Cloud
No ratings yet
(T-GCPAZURE-B) Module 3 - Virtual Machines in The Cloud
58 pages
European Vehicle Market Statistics
No ratings yet
European Vehicle Market Statistics
56 pages
Top 10 Guidelines For Deploying Modern Data Architecture For The Data Driven Enterprise
No ratings yet
Top 10 Guidelines For Deploying Modern Data Architecture For The Data Driven Enterprise
6 pages
(T-GCPAZURE-B) Module 2 - Getting Started With Google Cloud Platform
No ratings yet
(T-GCPAZURE-B) Module 2 - Getting Started With Google Cloud Platform
57 pages
Rubrik and Pure Storage FlashBlad
No ratings yet
Rubrik and Pure Storage FlashBlad
2 pages
HPC Summit Digital 2020: Gpu Experts Panel: Ampere Explained
No ratings yet
HPC Summit Digital 2020: Gpu Experts Panel: Ampere Explained
29 pages
Nvidia DGX Pod Data Center Reference Design
No ratings yet
Nvidia DGX Pod Data Center Reference Design
19 pages
MS 2012 - Features of Server 2012
No ratings yet
MS 2012 - Features of Server 2012
14 pages
Enterprise Information Security Architecture
No ratings yet
Enterprise Information Security Architecture
2 pages
Iso 50001 Self Assessment Questionnaire Web
No ratings yet
Iso 50001 Self Assessment Questionnaire Web
3 pages
Toefl
No ratings yet
Toefl
1 page
Bachelor of Arts With Education (Baed)
No ratings yet
Bachelor of Arts With Education (Baed)
8 pages
VMware Cloud On AWS Cheat Sheet
No ratings yet
VMware Cloud On AWS Cheat Sheet
1 page
Configuring Linux and Macs To Use Active Directory For Users-Groups-Kerberos Authentication and Even Group Policy
No ratings yet
Configuring Linux and Macs To Use Active Directory For Users-Groups-Kerberos Authentication and Even Group Policy
15 pages
Business Culture of Usa, China, Japan &india
100% (1)
Business Culture of Usa, China, Japan &india
17 pages
Accelerating Biological Research With Multicloud Kubernetes
No ratings yet
Accelerating Biological Research With Multicloud Kubernetes
3 pages
Cheesy Stuffed Eggplant Two Different Ways
No ratings yet
Cheesy Stuffed Eggplant Two Different Ways
6 pages
Sanjeev Taras: Key Result Area
No ratings yet
Sanjeev Taras: Key Result Area
3 pages
Iaf Ucav Rfi
No ratings yet
Iaf Ucav Rfi
4 pages
Citizenship Training
No ratings yet
Citizenship Training
4 pages
Strategic Management First Activity
No ratings yet
Strategic Management First Activity
3 pages
Vicente D. Millora For de Guzman. Jacinto Callanta For Private Respondent
No ratings yet
Vicente D. Millora For de Guzman. Jacinto Callanta For Private Respondent
3 pages
Ebensburg Plane Crash Report
No ratings yet
Ebensburg Plane Crash Report
3 pages
Competition Issues in Real Estate
No ratings yet
Competition Issues in Real Estate
17 pages
Brexit
No ratings yet
Brexit
11 pages
The Advent of Canine Performance Science-Offering A Sustainable Future For Working Dogs
No ratings yet
The Advent of Canine Performance Science-Offering A Sustainable Future For Working Dogs
9 pages
Stabilized Approach and Landing
No ratings yet
Stabilized Approach and Landing
2 pages
Fybba p1
No ratings yet
Fybba p1
1 page
Office of The SK Panlalawigang Pederasyon President: Activity Design
No ratings yet
Office of The SK Panlalawigang Pederasyon President: Activity Design
3 pages
People'S Pavilion: Eindhoven, The Netherlands
No ratings yet
People'S Pavilion: Eindhoven, The Netherlands
6 pages
Peer Evaluation of An Oral Presentation: Very Good 3 Satisfactory 2 Poor 1
No ratings yet
Peer Evaluation of An Oral Presentation: Very Good 3 Satisfactory 2 Poor 1
8 pages
ART Theoretical Framework On The Effectiveness of Training & Development
No ratings yet
ART Theoretical Framework On The Effectiveness of Training & Development
13 pages
Article-Swearing at Work
No ratings yet
Article-Swearing at Work
3 pages
KIPP Infinity - 2010-11 Student Lottery Application
No ratings yet
KIPP Infinity - 2010-11 Student Lottery Application
2 pages
Business Recognition of Human Rights: Global Patterns, Regional and Sectoral Variations
No ratings yet
Business Recognition of Human Rights: Global Patterns, Regional and Sectoral Variations
7 pages
Rhetorical Analysis
No ratings yet
Rhetorical Analysis
2 pages
The Challenges of Negotiation in English in An Romanian
No ratings yet
The Challenges of Negotiation in English in An Romanian
4 pages
Ehris Action Plan
No ratings yet
Ehris Action Plan
2 pages
Eshfaque Alam Dastagir
No ratings yet
Eshfaque Alam Dastagir
3 pages
Special Issue On Modelling Passenger Flows in Mult
No ratings yet
Special Issue On Modelling Passenger Flows in Mult
1 page