0% found this document useful (0 votes)

16 views8 pages

Whitepaper Gen Ai

Uploaded by

Anirban De

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views8 pages

Whitepaper Gen Ai

Uploaded by

Anirban De

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

WHITE PAPER

Scaling Generative AI
Requires Specific Platform
Engineering Considerations

VULTR.COM

To be successful with Generative AI (GenAI), you need to

adopt a platform engineering approach with a GenAI twist.

Platform engineering How organizations that have How organizations new to

adapted to scaling GenAI and adopted a platform engineering platform engineering can adopt
AI in general at the edge approach need to adapt it for GenAI this approach to scale GenAI

In the ongoing effort to adapt and simplify the DevSecOps lifecycle to

80%
better address the growing complexity around developing, deploying, and
scaling traditional web applications at the edge, enterprises of all sizes
are turning to platform engineering. Gartner predicts that by 2026, 80%
of software engineering organizations will establish ‘platform teams’ as
internal providers of components and tools for application delivery. of software engineering
organizations will
Concurrent with the rise of platform engineering is the rapidly growing establish ‘platform
investment in GenAI. Deloitte predicts that 2024 enterprise spending on teams’ as internal
GenAI will increase by 30% from an estimated US $16 billion in 2023. providers of components
& tools for application
These two trends are on intersecting courses. Among enterprises delivery, by 2026
pursuing both GenAI and platform engineering, platform teams should
Gartner
ensure they are accounting for the unique requirements for scaling
GenAI initiatives across the enterprise. In the next manifestation of
digital transformation, these innovators have a head start – placing large
language models (LLMs) at the core of business operations.

A far greater number of enterprises, however, have GenAI and/or platform

engineering initiatives that are more nascent. For these organizations,
platform engineering remains more of a greenfield opportunity.
30%
Regardless, at this point, collective wisdom around platform engineering predicted increase
for GenAI has emerged and is now available to every organization striving in enterprise GenAI
spending in 2024
to improve its competitive posture.
from an estimated
As this guide outlines, the best practices and critical requirements for $16 billion in 2023
supporting GenAI with a purpose-built platform engineering solution will Deloitte
help enterprises scale and manage their GenAI initiatives more efficiently.

© Vultr 2024 | 2
Requirements for supporting large
language model operations at scale
As large language model operations (LLMOps) mature across the business landscape,
it’s becoming increasingly common for enterprises to develop and deploy multiple
modestly-sized LLMs, each specialized for specific business processes, rather than
deploying a single LLM trained to manage all processes (like the massive models that
support ChatGPT and other popular GenAI applications). As such, enterprises need to
provide machine learning engineers – the LLM developers – with the infrastructure,
tools, services, and applications they need to optimize a complex LLMOps ecosystem.

The following have emerged as

best practices for LLMOps at scale:

Establish a center of excellence Tap a cloud-based edge

architecture
within the enterprise where LLMs and
other machine learning models can be that allows for tightly integrated
developed and trained centrally. CPU and GPU operations close to
the geographic regions where your
organization is doing business.

Leverage open-source models

from public repositories so your
Fine-tune models at the edge
organization isn’t starting from square
one with model development. based on local data to account for
regional and cultural considerations
while maintaining data governance
and privacy requirements.

Specialize models
and focus on developing multiple,
smaller models to address specific
business use cases.
Ensure responsible AI practices
by building observability into every
phase of AIOps and LLMOps.

Train models on proprietary data

and move trained models to a centrally-
located private registry, so all models are
accessible across the enterprise.

© Vultr 2024 | 3
Must-have platform engineering
capabilities for MLOps and LLMOps
The emerging best practices around LLMOps demand a platform engineering
approach that can automate the provisioning and configuration of all the
resources ML engineers need to build, train, deploy, and optimize LLMs and
GenAI applications. Providing self-serve access to these resources frees
ML engineers to focus on the high-value development work they were hired
for, builds efficiencies into the workflows that support a multi-model GenAI
strategy, accelerates the LLMOps development cycle, and reduces the overall
time to value for the enterprise’s GenAI investments.

Comprehensive platform engineering solutions

designed for LLMOps at scale must address all of the
following requirements:

Infrastructure optimization
Provide developers and ML engineers with easy access to edge infrastructure
components optimized for GenAI workloads, allowing for tightly integrated
CPU and GPU operations close to the geographic regions where the
organization is doing business.

Model management and deployment

Establish a centralized model development and training environment
and a Kubernetes-based private registry for trained models. This ensures
all models are accessible across the enterprise, enabling efficient model
management and deployment.

Data governance and privacy

Provide edge-based data storage and security measures for maintaining
data governance and privacy when training models on proprietary company
data and fine-tuning models at the edge based on regional data.

Model observability
Build observability into every phase of LLMOps to ensure responsible AI
practices. This involves integrating monitoring and observability tools into
the platform engineering solution to track the performance of GenAI
models and ensure that they adhere to ethical and operational standards.

Automating tasks and self-service

Automate code builds, testing, and deployments through CI/CD pipelines,
as well as infrastructure provisioning and management using Infrastructure
as Code (IaC) tools. Self-service capabilities enable the development of new
software and models in less time and accommodate a range of workflows.

The four core tenets of building

and maintaining a platform
engineering approach

Think product, not project

Platform engineering must be managed as a product rather than
a project. This requires treating users – the internal developers
and ML engineers interacting with the platform – as customers
and assigning a dedicated product support team to assist them.
The platform engineering product team must continue to improve
the solution as requirements change and technology evolves.

Build self-service and automation into the platform

To accelerate deployments of LLMs and GenAI applications,
platform engineering solutions must make routine processes
easily repeatable and consistent. This approach enhances the
developer experience and increases productivity by offloading
ancillary tasks that distract from their highest-value work.

Demand uncompromising reliability

Establish SLAs for availability and other attributes as part of the
product mindset. This approach helps ensure timely resolution of
issues and helps improve productivity.

Think minimally viable product (MVP) versus

full-fledged finished product
In the age where products are never “finished” and continuous
improvement is expected, platform engineering teams should
focus on getting the platform up and running. Platform teams
can maintain a backlog of additional features and decide what
enhancements to introduce as users’ needs change.

© Vultr 2024 | 5
What “good” looks like in enterprises
that have adopted platform engineering
While the history of platform engineering for GenAI is not yet long, there are
specific attributes that define platform engineering excellence.

Composability Technical architecture

Full interoperability enables developers to Tight integration of cloud CPU and GPU
assemble flexible tech stacks that address infrastructure and automated, self-serve
specific business functions. Composability access to these resources allow enterprises to
also allows platform engineering teams to scale compute, inference, and ultra-low latency
swap components to ensure that platform globally at the edge while optimizing resource
engineering solutions remain relevant and are utilization to minimize operating costs.
optimized as needs and capabilities change.

Vertical use cases Extensibility

Developing precomposed stacks and Building in the flexibility to incorporate new
Integrated Development Environments (IDEs) capabilities or functionalities can future-proof
geared toward developing models for different the platform engineering solution against
business services can shorten the path to obsolescence while optimizing the enterprise’s
model-based operations and accelerate GenAI investment in the solution and the platform
adoption. engineering team supporting it.

The future of platform

engineering
As more enterprises organize their business practices Already, we see that companies are looking at
around GenAI and other AI and ML initiatives, we platform engineering as a professional competency.
anticipate that serving inference at the edge will become The 2023 State of DevOps Report by Perforce found
an even greater priority within enterprises. At the same that 71% of respondents stated that their employer
time, new efficiencies in MLOps and LLMOps will plans to hire people with platform engineering
continue to emerge, so platform engineering teams will experience in the near future. As this competency
continue to evolve their platform engineering products to grows into an established career field, platform
better accommodate the developers and ML engineers teams will introduce even greater innovation into the
who will focus on delivering the best user experiences products and processes that facilitate the move to
for their internal and external customers. GenAI-based business operations.

The platform engineering

field is rapidly evolving. 71%
of organizations stated that they plan to hire employees
with platform engineering experience in the near future.
2023 State of DevOps Report by Perforce

Enterprises that start addressing

the critical considerations of
platform engineering for GenAI
today, and implement the best
practices in this guide, will be best
positioned to not only keep up
with the pace of change but also
contribute to the innovation in
platform engineering and LLMOps
that is sure to come.

© Vultr 2024 | 7
A final word
The most direct path to success in
effectively scaling GenAI across the
enterprise lies with a tailored platform
engineering approach. Organizations
that prioritize this will put themselves
in the best position to future-proof
their AI operations, and establish a
framework for sustainable innovation.

To learn more about Vultr visit

vultr.com or contact sales.

VULTR.COM CONTACT

Preview AI Engineering by Chip Huyen
25% (4)
Preview AI Engineering by Chip Huyen
21 pages
Seller Interactive - Example of A Plan of Action For Amazon
No ratings yet
Seller Interactive - Example of A Plan of Action For Amazon
3 pages
DevOps For AI-IEEE
No ratings yet
DevOps For AI-IEEE
6 pages
Hotel Management
100% (3)
Hotel Management
90 pages
Front End Programming: Mendel Rosenblum
No ratings yet
Front End Programming: Mendel Rosenblum
13 pages
Whitepaper Eu Ai Act
No ratings yet
Whitepaper Eu Ai Act
8 pages
Canonical MLOps Toolkit
No ratings yet
Canonical MLOps Toolkit
17 pages
Proven Strategies For Building Gen AI Capability - McKinsey
No ratings yet
Proven Strategies For Building Gen AI Capability - McKinsey
5 pages
Newwhitepaper - Operationalizing Generative AI On Vertex AI
No ratings yet
Newwhitepaper - Operationalizing Generative AI On Vertex AI
69 pages
14 - Operationalizing Generative AI On Vertex AI - v7
No ratings yet
14 - Operationalizing Generative AI On Vertex AI - v7
93 pages
CL Ai ML Top Considerations e Book 479615 202308 en - 1
No ratings yet
CL Ai ML Top Considerations e Book 479615 202308 en - 1
13 pages
10 Boas Práticas para Escalar IA Generativa em Toda A Organização
No ratings yet
10 Boas Práticas para Escalar IA Generativa em Toda A Organização
20 pages
Operationalize AI at Scale With MLOps Ventana Research Ebook DDL-NVIDIA
No ratings yet
Operationalize AI at Scale With MLOps Ventana Research Ebook DDL-NVIDIA
10 pages
Generative Ai and LLM
No ratings yet
Generative Ai and LLM
3 pages
CNCF - Ai 2
No ratings yet
CNCF - Ai 2
21 pages
Build Your Performant ML Stack With NVIDIA DGX and Kubeflow
No ratings yet
Build Your Performant ML Stack With NVIDIA DGX and Kubeflow
14 pages
Gartner - 10 Best Practices For Scaling GenAI
No ratings yet
Gartner - 10 Best Practices For Scaling GenAI
18 pages
Enterprise Ai: Building Powerful Enterprise AI Infrastructure: How To Design Enduring Infrastructure For AI
No ratings yet
Enterprise Ai: Building Powerful Enterprise AI Infrastructure: How To Design Enduring Infrastructure For AI
8 pages
AI For Business - Addendum
No ratings yet
AI For Business - Addendum
30 pages
How GenAI Impacts Infrastructure Strategies
No ratings yet
How GenAI Impacts Infrastructure Strategies
39 pages
Generative AI
No ratings yet
Generative AI
25 pages
Ready To Scale Ai Idc 88025788USEN
No ratings yet
Ready To Scale Ai Idc 88025788USEN
17 pages
Lesson2 Huawei Ascend Platform Introduction EXTERNAL
No ratings yet
Lesson2 Huawei Ascend Platform Introduction EXTERNAL
40 pages
Building Trusted AI in The Enterprise
No ratings yet
Building Trusted AI in The Enterprise
35 pages
MLOps Google Cloud
No ratings yet
MLOps Google Cloud
37 pages
Nvitu 230307121950 c3b682cc
No ratings yet
Nvitu 230307121950 c3b682cc
24 pages
Open Source Ai For Developers Ebook
No ratings yet
Open Source Ai For Developers Ebook
10 pages
Scaling AI and ML
No ratings yet
Scaling AI and ML
4 pages
Parts 1,2,4
No ratings yet
Parts 1,2,4
5 pages
Generative Ai Reference Model
No ratings yet
Generative Ai Reference Model
1 page
Generative AI From Use Cases To Organizational Paradigm v1.1
No ratings yet
Generative AI From Use Cases To Organizational Paradigm v1.1
44 pages
Session 18 Solution Architecture For Gen AI
No ratings yet
Session 18 Solution Architecture For Gen AI
34 pages
Red Hat & NVIDIA For FSI - Final
No ratings yet
Red Hat & NVIDIA For FSI - Final
18 pages
McKinsey On GenAI at The Edge Overview & Outlook
No ratings yet
McKinsey On GenAI at The Edge Overview & Outlook
14 pages
(EXTERNAL) AI Trailblazers Workshop - Industry (5th Sep)
No ratings yet
(EXTERNAL) AI Trailblazers Workshop - Industry (5th Sep)
134 pages
Ai Platform Services Customer Deck
No ratings yet
Ai Platform Services Customer Deck
18 pages
1 DGX EPYC 5 Steps To Get Started Ebook DGX-a100-Partner
No ratings yet
1 DGX EPYC 5 Steps To Get Started Ebook DGX-a100-Partner
8 pages
Datateam Reading Book Club 2025
No ratings yet
Datateam Reading Book Club 2025
9 pages
Webinar Fast-Track To Generative AI With NVIDIA
No ratings yet
Webinar Fast-Track To Generative AI With NVIDIA
27 pages
AI COmpanies For Partnership
No ratings yet
AI COmpanies For Partnership
8 pages
TM Forum Doc Summary
No ratings yet
TM Forum Doc Summary
20 pages
9 Actions To Re-Imagine Business and Tech With Gen AI
No ratings yet
9 Actions To Re-Imagine Business and Tech With Gen AI
12 pages
Verizon Delivering Ai at Scale Networking Insights
No ratings yet
Verizon Delivering Ai at Scale Networking Insights
18 pages
Gartner - Reskin 2025 Planning Guide For Software Engineering v2
No ratings yet
Gartner - Reskin 2025 Planning Guide For Software Engineering v2
16 pages
The State of AI Infrastructure at Scale 2024
No ratings yet
The State of AI Infrastructure at Scale 2024
22 pages
Generative Ai Approaches and Implications For Product Managers
100% (1)
Generative Ai Approaches and Implications For Product Managers
15 pages
GenAI Survival Guide PM
No ratings yet
GenAI Survival Guide PM
10 pages
AI Governance For AI-Powered Applications Palo Alto Firewall
No ratings yet
AI Governance For AI-Powered Applications Palo Alto Firewall
14 pages
#AI Readiness Model
No ratings yet
#AI Readiness Model
6 pages
Technicalseminar
No ratings yet
Technicalseminar
11 pages
UBUNTU - Canonical - Datasheet For MLOps Workshop
No ratings yet
UBUNTU - Canonical - Datasheet For MLOps Workshop
2 pages
Executive's Guide To Managed AI Infrastructure
No ratings yet
Executive's Guide To Managed AI Infrastructure
19 pages
Enhancing Security in Industrial Application Development - Case Study On Self - Generating Artificial Intelligence Tools
No ratings yet
Enhancing Security in Industrial Application Development - Case Study On Self - Generating Artificial Intelligence Tools
18 pages
Sitesdefaultfiles2024 04databricks Big Book of GenAI FINAL PDF
No ratings yet
Sitesdefaultfiles2024 04databricks Big Book of GenAI FINAL PDF
118 pages
A Data Leaders Operating Guide To Scaling Gen Ai Final
No ratings yet
A Data Leaders Operating Guide To Scaling Gen Ai Final
9 pages
Northam Quickstart Manufacturing
No ratings yet
Northam Quickstart Manufacturing
21 pages
Future of Software Development With Generative AI
No ratings yet
Future of Software Development With Generative AI
9 pages
Whitepaper Machine Learning and The Intelligent Edge
No ratings yet
Whitepaper Machine Learning and The Intelligent Edge
7 pages
Amazon Nova's Competitive Price - Performance, OpenAI O1 Pro's High Price - Performance, Google's Game Worlds On Tap, Factual LLMs
No ratings yet
Amazon Nova's Competitive Price - Performance, OpenAI O1 Pro's High Price - Performance, Google's Game Worlds On Tap, Factual LLMs
15 pages
Case Study
No ratings yet
Case Study
14 pages
HPE Aruba Networking CX 8100 Switch Series Data sheet-PSN1014733547PHEN
No ratings yet
HPE Aruba Networking CX 8100 Switch Series Data sheet-PSN1014733547PHEN
4 pages
Unit Iv Testing: 4.1taxonomy of Software Testing
No ratings yet
Unit Iv Testing: 4.1taxonomy of Software Testing
28 pages
Project Report: Id Card Generator
No ratings yet
Project Report: Id Card Generator
37 pages
Business Requirement
No ratings yet
Business Requirement
4 pages
Bds Fts 22q2 Wricef-Funct-Spec Form en
No ratings yet
Bds Fts 22q2 Wricef-Funct-Spec Form en
14 pages
Denodo Security Overview 20160211
No ratings yet
Denodo Security Overview 20160211
14 pages
Sap Cloud Alm
No ratings yet
Sap Cloud Alm
10 pages
Interesting Command Center Commands
No ratings yet
Interesting Command Center Commands
111 pages
Opennebula 5.6 Deployment Guide: Release 5.6.2
No ratings yet
Opennebula 5.6 Deployment Guide: Release 5.6.2
224 pages
Important Port Number
No ratings yet
Important Port Number
5 pages
Akhilesh CV
No ratings yet
Akhilesh CV
1 page
Financial Accounting Hub (FAH) - Advantages
No ratings yet
Financial Accounting Hub (FAH) - Advantages
4 pages
What Is The Difference Between Transparent Table and Pooled Table?
No ratings yet
What Is The Difference Between Transparent Table and Pooled Table?
20 pages
Entries in Universal Journal in SAP: There Are Several Technical Changes in General Ledger Accounting
No ratings yet
Entries in Universal Journal in SAP: There Are Several Technical Changes in General Ledger Accounting
3 pages
Course Outline Sap HCM Ecc 6.0 (Human Capital Management)
No ratings yet
Course Outline Sap HCM Ecc 6.0 (Human Capital Management)
3 pages
MDM 103HF1 UpgradingFromVersion10x en PDF
No ratings yet
MDM 103HF1 UpgradingFromVersion10x en PDF
168 pages
Core Java Questions
No ratings yet
Core Java Questions
28 pages
Bhargavi Resume
No ratings yet
Bhargavi Resume
2 pages
QP24DP2 - 290 - 13-03-2024 13:25:26 - 117.55.242.132
No ratings yet
QP24DP2 - 290 - 13-03-2024 13:25:26 - 117.55.242.132
1 page
HFS To ZFS MIgration
No ratings yet
HFS To ZFS MIgration
79 pages
Synopsis Campus Management System
No ratings yet
Synopsis Campus Management System
25 pages
Human Computer Interaction Exam Questions (2017 Fall Semester)
No ratings yet
Human Computer Interaction Exam Questions (2017 Fall Semester)
2 pages
? Google Tag Manager & GA4 - Concept Workbook
No ratings yet
? Google Tag Manager & GA4 - Concept Workbook
5 pages
PC R2 TechnicalDocument SpecSheet 1v10 - 230526 - 204646
No ratings yet
PC R2 TechnicalDocument SpecSheet 1v10 - 230526 - 204646
20 pages
James Eugene Atinda Resume
No ratings yet
James Eugene Atinda Resume
5 pages
Penpeña Sts
No ratings yet
Penpeña Sts
16 pages
DDCA - CO-3 & 4 - Terminal Questions
No ratings yet
DDCA - CO-3 & 4 - Terminal Questions
18 pages

Whitepaper Gen Ai

Uploaded by

Whitepaper Gen Ai

Uploaded by

WHITE PAPER

To be successful with Generative AI (GenAI), you need to

Platform engineering How organizations that have How organizations new to

In the ongoing effort to adapt and simplify the DevSecOps lifecycle to

A far greater number of enterprises, however, have GenAI and/or platform

The following have emerged as

Establish a center of excellence Tap a cloud-based edge

Leverage open-source models

Train models on proprietary data

Comprehensive platform engineering solutions

Model management and deployment

Data governance and privacy

Automating tasks and self-service

The four core tenets of building

Think product, not project

Build self-service and automation into the platform

Demand uncompromising reliability

Think minimally viable product (MVP) versus

Composability Technical architecture

Vertical use cases Extensibility

The future of platform

The platform engineering

Enterprises that start addressing

To learn more about Vultr visit

You might also like