0% found this document useful (0 votes)
45 views24 pages

Microsoft Governance

Uploaded by

Szabolcs Németh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
45 views24 pages

Microsoft Governance

Uploaded by

Szabolcs Németh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

White paper

Modern
Analytics, AI
and Governance
at Scale
Learn how a strategic framework for data
is the foundation for AI innovation
Bring on the era of AI with Microsoft 2

03 / 09 /
Executive summary Microsoft Fabric powers MA²G
11 Enterprise Data Governance
14 Microsoft Purview provides a unified data
governance solution

04 / 15
19
Data Management Foundation
Domains and Data Products
When your data is siloed, your
organisation is siloed
22 /
Copilots reduce the heavy lifting

06 /
What will it take to make your
business AI-ready?
23 /
LLM capabilities power your
generative AI applications

24 /
Get ready for the era of AI
Bring on the era of AI with Microsoft 3

Executive summary Data has long been the linchpin for


making digital transformations possible.
Now, after many years into this digital
future, we know simply having data isn’t
enough. Organisations need a system to
unlock the value of their data, to power
analytics and artificial intelligence (AI) that
help sharpen their competitive edge. They
also need to adapt their culture, people,
and processes so the organisation as
a whole is maximising data.

Based on hundreds of engagements


and conversations with customers,
Microsoft has developed a framework
for organisations to adopt a unified
analytics and AI ecosystem. Known
as Modern Analytics, AI and Governance
at Scale (MA²G), this framework addresses
hurdles like siloed data, poor governance
and manual data management. Built on
Microsoft Fabric, MA²G helps organisations
make sure their data is AI-ready and
allows business units to find relevant data
assets, without compromising enterprise
requirements.

Keep reading to learn more about MA²G


and what it can do for your organisation.

¹ ‘Footnote example, Title and release date aligned to the bottom in


segoe regular
Bring on the era of AI with Microsoft 4

When your data is siloed, your


organisation is siloed
In most cases, organisations struggle to implement an end-to-end analytics and AI ecosystem,
not because of the technology, but because they do not plan for, nor address, the role culture,
people and processes play in bringing about digital transformation. In fact, the MIT Technology
Review says nearly every time – 92% – an organisation struggles with data it links back to an issue
with the company’s data strategy, data governance and/or data management.1

This is because many organisations approach a data challenge with a technology solution – but
that only solves a fraction of the problem. Organisations need a wider and deeper focus on
solutions that drive cultural changes and align people and processes with technology.

Throughout hundreds of engagements with organisations worldwide to help them become data
driven, Microsoft has seen the following top challenges over and over. Here are three common
problems associated with culture, people and processes that impede a unified analytics and
AI ecosystem.

Lack of data strategy leads to a siloed ecosystem


After years of implementing different analytics projects in the cloud, organisations continue to
build out their ecosystem in a reactive, piecemeal way. Without a well-defined data strategy,
solutions become siloed and technical debt increases, which stands in the way of bigger data
and analytics innovation. The lack of an analytics foundation for all data also inhibits a thriving
ecosystem.

For example, imagine a data warehouse migration that lands data in proprietary data formats.
At the same time, Internet of Things (IoT) data is streamed into a data lake store. Data from these
two separate projects lands in separate data stores, causing siloed data. To become data driven,
the entire organisation must be able to build meaningful insights from both sources, regardless
of the boundaries between business unit, so they can access data of all attributes and types.

1
Building a high-performance data and AI organisation, MIT Technology Review, April 2021
Bring on the era of AI with Microsoft 5

Poor governance prevents democratised data


Data governance – which includes a clear set of policies, processes and controls – is critical for
organisations to find, manage and consume data across the business. A lack of data governance
and an incomplete understanding of the data can stymie analytics projects. Failure points can run
the gamut. Some projects can’t access data fast enough, others can’t integrate data due to the
lack of data relationships and others require additional data engineering work to be completed
before being able to train machine learning models.

All these problems can be avoided if the organisations have enterprise data governance that
provides the inventory and context of all data, automated processes to streamline workflows and
policies that automatically manage data access. The goal is to implement robust data governance
and data management that enables different analytics projects for different business units.

Manual data management slows time to insights


For data to be consumable, there’s a ton of ingestion and data engineering work that must
happen. It must be cleaned with approved data quality, integrated to reveal new business insights
and aggregated to become a proper data product. Most organisations tend to build manual data
engineering workflows on a case-by-case basis aligned to specific projects.

However, this creates fragmented data engineering solutions that become harder to maintain
as they grow to include thousands on top of thousands of pipelines. The worst part is, most data
engineering tasks are manual. Many organisations use their best people to perform these manual
tasks when it should be a process change instead. By implementing proper data management
processes with automation, organisations can reassign data engineers to more meaningful work
of business data modelling, data aggregations and calculations.
Bring on the era of AI with Microsoft 6

What will it take to make your business


AI-ready?
From chatbots supporting service interactions to generative AI creating sales content, AI is
everywhere. And demand for these capabilities is soaring. Just one year after ChatGPT launched,
54% of companies had already implemented generative AI in some area of their business.2

However, a paradox still exists when it comes to AI. While 78% of executives agree that AI is
a top business priority,3 they also recognise that data problems will likely stand in their way of
achieving their AI goals. This is because many organisations see AI and analytics as initiatives that
can be adopted by investing in technology solutions – but that’s just a small piece of the puzzle.
Organisations that successfully deploy AI and analytics also change processes, adapt their culture
and support their people to use these new capabilities in effective ways.

AI priorities for executives

Unifying our data


Scaling AI/ML use cases
platform for analytics
to create business value
78%
agree
is a top priority 68% agree
and AI is crucial to our
enterprise data strategy
Disagree 8%
Disagree 12%
Neutral 14%
Neutral 20%

We favour a multi-cloud Data problems are the


approach as a flexible most likely factor to
72%
agree
foundation for AI/ML 72% agree
jeopardise our AI/ML goals
Disagree 12% Disagree 9%
Neutral 16% Neutral 19%

2
2024 AI Business Predictions, PWC
3
CIO perspectives on generative AI, MIT Technology Review, July 2023
Bring on the era of AI with Microsoft 7

Modern Analytics, AI and Governance at Scale


The question every leader is asking themselves right now is: How can my organisation seize
the full potential of AI, while safeguarding my business, data and employees?

After hundreds of engagements with organisations worldwide, Microsoft has developed


a framework to not only adopt AI and analytics, but also use these technologies to their fullest
extent. It’s known as Modern Analytics, AI and Governance at Scale (MA²G) and it outlines the
foundational elements organisations need to ensure their data is AI-ready.

Get started with MA²G


MA²G was created to give organisations a framework for understanding what it takes to deploy
a successful end-to-end analytics and AI platform. It’s built on the following three pillars, which
describe the people, process and culture considerations organisations need to keep in mind for
AI and analytics to take root.

Enterprise Data Governance includes the set of policies and practices used to discover,
describe and manage data to accelerate responsible data democratisation. Data
governance ties together the data and analytics stack and automates data operations,
such as cataloguing, classification, creating lineage and applying security through
policies. Without it, organisations limit their ability to innovate and unlock new insights.
• Data Management Services • Data Order Service
• Governance • Rapid Access to Data
• Quality • On Premises or Azure
• Policy
• Lineage
• Classification
• Catalogue
Bring on the era of AI with Microsoft 8

Data Management Foundation involves the practices and processes that help you
create efficiencies with ingesting, storing, protecting and ultimately serving data to
different domains in the organisation.
• Self Service – Automation • Automated Data Operations
• Domain Provisioning • Data Virtualisation: Shortcuts and Mirroring
• Workspace Provisioning • Ingestion at Scale Solution
• Data Onboarding • Data Engineering Acceleration
• Automation
• Lakehouse and open data formats

Domains and Data Products describes the environments and services that enable
your business units to fully use their data. Allowing departments to self-serve data and
analytics enables non-technical users to access, analyse and build data insights or data
products on their own.
• Federation • AI Copilots
• Autonomous Business Units • Empowering Data Practitioners
• Domains, Workspaces, Capacities • Accelerating Report Creation
• Data Sharing / Collaboration • Enable Data Exploration
• AI Guided Insights
• Data Integration

Together, these three solution pillars combine to help organisations achieve MA²G. By first setting
business priorities as the guiding North Star, then implementing aspects from each solution pillar,
your organisation can shift into a whole new paradigm of doing business that empowers everyone
to work toward a common goal fuelled by data.
Bring on the era of AI with Microsoft 9

Microsoft Fabric powers MA²G


Based on the Microsoft Intelligent Data Platform, the MA²G architecture brings together the
best of Azure into a unified SaaS solution powered by Microsoft Fabric and Microsoft Purview,
and built on an open and governed data lakehouse. It includes support for Delta Parquet, and
emerging open standards like Hudi and Iceberg, to accommodate diverse investments.

Microsoft Fabric

Enterprise Governance,
Security, and Compliance
IT Finance Marketing
Fabric domain Fabric domain Fabric domain

OneLake Security

OneLake
Shortcuts Fabric storage Mirroring
Existing data lakes (Delta/Parquet) for proprietary data stores
Azure, AWS, Google

Innovation HR Operations
Fabric domain Fabric domain Fabric domain

A hub for all your data


Most organisations have analytics systems that are a labyrinth of specialised and disconnected
services. As businesses aim to adopt modern data capabilities, it’s increasingly important
to integrate all disparate data into a unified source. OneLake, incorporated into Fabric, unifies
all your data in a single, accessible location for comprehensive data management. Plus,
it’sbacked by built-in security and governance, ensuring the protection of all your data.
Bring on the era of AI with Microsoft 10

Simplified data preparation


For AI and machine learning models to be as accurate as possible, they must be built with clean
data and in a semi-structured way. Fabric simplifies data preparation and transformation so
that you can quickly get your data ready for custom AI deployments. Fabric includes analytics
experiences such as Data Engineering, Data Factory and Data Warehouse built into its SaaS-based
data platform so that different teams can all find the tools they need and work together.

AI-ready data that’s accessible to domains


Once you have a cleaned, robust dataset, you can start building AI and generative AI experiences
on top of your data. Fabric allows companies to organise data into domains where data
consumers can filter and fi nd content they need for AI or analytics. It also enables federated
governance so that each business unit or department can define its own rules and restrictions.
Bring on the era of AI with Microsoft 11

Enterprise Data Four ways to improve


enterprise data governance

Governance Create metadata: Well-governed


data includes metadata about its
Governance is essential to every lineage, profile, quality, business
organisation – regardless of the framework context and classification so it’s
or solution implemented – because it lays trusted and useful for all end
the bedrock for responsibly democratising parties.
data. Data governance translates your data
strategy into data ownership, rules and Map data assets: With a data
policies that improve data discoverability, map, data consumers can easily
confidence, security, compliance and and visually inspect all data
operational efficiencies. assets across all domains whether
it’s physically stored on premises
Over the years, data access and use or in the cloud.
has spread to all corners of companies
with individuals making critical business Catalogue data assets: Data
decisions that affect organisations, consumers use a data catalogue
customers and shareholders. You might to find all datasets, see if they’re
see the finance department visualising complete, and learn the relevant
billions of rows of risk data, while analysts business context associated with
in marketing identify customers for a new the data asset.
product. The range of disparate uses,
abuses and copies of data is leading Automate governance and
to widespread confusion and risk. Data security: Instead of manually
governance provides the glue that ties implementing data security for
together all the data in the analytics each data asset – which is prone
stack and ensures the right data is easily to errors and highly inefficient
accessible by the right people. – automated governance keeps
data secure while still ensuring
access of data to the right users.
Bring on the era of AI with Microsoft 12

Within Fabric, you can access a Purview Democratise data through


Hub Insights dashboard that shows you: self‑service features
To truly democratise data, organisations
• Overview report: See an overview
need to implement a solution where data
of distribution, use of endorsement
is discoverable through a data catalogue
and sensitivity labelling.
and domain users can request access
• Endorsement report: Drill down without opening a ticket. Microsoft
and analyse distribution and use Fabric and Microsoft Purview make this
of endorsement. possible. Domain teams (data analytics,
data products developers, data owners,
• Sensitivity report: Drill down
etc.) can browse a data catalogue in Fabric
and analyse distribution and use
or Purview to discover new data relevant
of sensitivity labelling.
to their use case.
• Inventory report: Get details
about labelled and endorsed items, Fabric displays information about the
then narrow down the results data, such as metadata with classification,
through date ranges and filters for lineage, business terms, related assets
workspace, item type, etc. and data owners to help domains make
decisions about which sources to explore.
• Items page: Find insights about the Once they select data sources, Fabric
distribution of items throughout makes it easy to grant access to domains
your organisation and endorsement if they do not have pre-established
coverage. permission. By combining Fabric and
• Sensitivity page: Discover Purview, you can govern your entire estate
insights about sensitivity labelling and lineage of data. From data source
throughout your entire organisation. down to Power BI reports, Purview and
Fabric work together seamlessly so you
can store, analyse and govern your data
without piecing together services from
multiple suppliers.
Bring on the era of AI with Microsoft 13

Govern and protect your data with integrated services


Governance is not only crucial to empowering your data
consumers, but also gaining their trust while meeting security and Microsoft Purview
compliance requirements. Fabric includes a set of capabilities that provides a unified
help you know, protect, manage and monitor your organisation’s data governance
sensitive information. It works in tandem with Purview toS help solution to help
govern and manage your entire data estate. Fabric governance manage and govern
and compliance is tightly integrated with Purview, allowing you your data estate.
to create a holistic, up-to-date map of your data landscape Learn more >>
with automated data discovery, sensitive data classification and
end‑to‑end data lineage.
Bring on the era of AI with Microsoft 14

Microsoft Purview provides a unified


data governance solution
Microsoft Purview allows you to create a holistic, up-to-date map of your data landscape with
automated data discovery, sensitive data classification and end-to-end data lineage. Enable
business units to access valuable, trustworthy data management and take advantage of the
following capabilities:

• Support for multi-cloud data estates: Automatically scan and catalogue all data assets –
including machine learning models and Power BI reports – across the organisation, whether
they’re on premises, in Azure or running on other public clouds.

• Governance experience: Develop clear role definitions for administrators, domain creators,
data health owners and data health readers.

• Business-friendly terminology: Assign language that follows the data governance experience
through data products, domains, quality assessments and reports.

• Data scan and search: Find the data you need across your entire estate and profile data at the
source to indicate attributes like min, max, average and thresholds.

• Data quality scores: Generate data quality scores once rules and policies are applied, giving
you insights into your data quality relative to your business rules.

• Metadata analysis: Capture metadata and data lineage to help personas to decide if data is
usable, then use profiling or data quality scans for recommendations.

• Data health controls: Ensure your rules and indicators reflect the unique standards of your
organisation with a set of cloud data management controls.

• Summarised insights: Showcase the overall health of your governed data estate with built-in
data governance reports.

• Pre-built integrations: Extend the value of Purview with integrations for solutions related to
master data management and data lineage.
Bring on the era of AI with Microsoft 15

Data Management Foundation


The purpose of data management is to ensure that data is properly collected, stored, processed,
analysed and used in a secure and efficient manner to support an organisation’s goals and
objectives. Yet, less than one-quarter of organisations have a consistent, global data management
strategy in place.

Implementing automation, frameworks, and services can help organisations bolster their data
management practices. With a data management foundation that uses Microsoft Fabric and
OneLake, you gain both an open and governed data lakehouse for storing data, as well as
automated data virtualisation that efficiently sends data to domains without overburdening
IT teams.

Microsoft Fabric

Admin Console: Fabric Domains

Domain Provisioning IT Finance Marketing

Workspace Provisioning Innovation Operations

Data Onboarding
OneLake
Shortcuts allow instant linking of data
already in Azure and other clouds, Shortcuts Fabric Storage Mirroring
Existing Delta and for proprietary
without any data duplication and
data lakes Parquet data stores
movement.
Azure, AWS, Google
Mirroring is a feature that offers
continuous and seamless access to and
replication of data from database or
data warehouse with no ETL required.
Bring on the era of AI with Microsoft 16

Every workload works with OneCopy and open formats


All compute engines – including Spark, T-SQL and KQL – automatically store their data in OneLake
in a single common format. Once data is stored in the lake, it’s accessible to all engines and does
not have to be imported or exported. Each compute type has been fully optimised to work with
Delta and Parquet as their native format and a shared.

An easier way to onboard data


Fabric helps eliminate data pipelines through capabilities such as shortcuts and mirroring, which
bring your data into one platform, without the legwork. In addition, you can also use partner
solutions that work with Fabric connectors to move data between stores.

Shortcuts: OneLake shortcuts let you easily onboard data by instantly linking data that already
exists in Azure or other clouds through a unified namespace. This eliminates data duplication or
movement, reducing latency associated with data copies and staging.

• A shortcut is a symbolic link which points from one data location to another

• Shortcuts make data from a warehouse part of your lakehouse

• You can consolidate data across items or workspaces without changing the data ownership

• Data can be reused multiple times without data duplication

• Existing ADLS Gen2 storage accounts and Amazon S3 buckets can be managed externally to
Fabric and Microsoft while still being virtualised into OneLake with shortcuts

• All data is mapped to a unified namespace and can be accessed using the same APIs, including
the ADLS Gen2 DFS APIs
Bring on the era of AI with Microsoft 17

Industry solutions: Fabric includes pre-built, industry-specific solutions that help organisations
integrate data from different sources and use rich analytics. Data solutions combine data
integration services and, in some cases, machine learning support, so organisations can face
industry-specific data challenges. These solutions include retail, healthcare, sustainability
and more.

Mirroring: Fabric offers a mirroring feature that provides continuous and seamless access to –
and replication of – data from databases or data warehouses, without ETL. Any database can be
accessed and managed via Fabric without having to switch database clients. By just providing
connection details, your database is instantly available in Fabric as a Mirrored database.

• A full editing experience of the source database is available for the Mirrored database

• Data is replicated into OneLake in Delta format and kept up to date in near real time

• All the Fabric experiences instantly work with the OneLake replica

• Analysts and data scientists can work with real-time data

• The replica protects operational databases from analytical queries

Automated data management


For any sources that don’t have mirroring or shortcuts, you can still automatically ingest data
into the ecosystem. Automated data services and templates can help improve efficiency around
data ingestion, standardisation, quality, metadata registration and access provisioning. These
enterprise-level capabilities allow data foundation teams to minimise repetitive, manual work
and they create a foundation in OneLake for domain teams to self-serve data.
Bring on the era of AI with Microsoft 18

Data-agnostic ingestion service


• Pull/push
On premises Azure • Format agnostic
or other Data-agnostic ingestion service • Data agnostic
cloud provider • Metadata driven

Data standardisation service


Pull • Analytics format conversion
Data sources Bronze Silver Gold
• Versioning
• Merging
• PII handling
Push
Generic
Data pipeline Drop Zone • Data quality
Confidential • Common Data Model
• Master data unification
• Synchronous processing
• GDPR

Data-agnostic ingestion
Automatically ingest data regardless of its attribute, format and the domain it belongs to.
Organisations can push or pull data from different sources then process it. Metadata-ingestion
frameworks or Kafka-based solutions are sample solutions that can be implemented to automate
this process.

Data standardisation
As your data gets ingested, you can standardise it through processes such as format conversions,
versioning, merging, PII handing and master data management. Use Apache Spark notebooks
within Fabric to quickly implement data standardisation practices. Additional services related
to data quality management address issues such as deduplication, threshold identification and
alignment with master data. Without proper checks on data quality, you run the risk of slowing
down time to insights.

Metadata registration and access provisioning


With your data in OneLake, the next step is registering the new data assets in your data catalogue
so they’re instantly discoverable. As a safety net, data governance scheduled scanning of the
data hub should register these new assets. Once data is added, another automated service can
provision access according to the data classification of data being ingested.
Bring on the era of AI with Microsoft 19

Domains and Data Products


Organisations are shifting their approach from running a centre of excellence – where everything
is centrally controlled – to using federated domains to provide departments more control and
autonomy.

With Microsoft Fabric, every data workload is available in


one SaaS experience for all personas in the organisation.

Domains and workspaces makes collaboration happen


Domain provisioning: Fabric allows you to group data into a domain so that users can find
the resources they need that are relevant to their field. For instance, you can create domains
by business department such as HR or finance, allowing those teams to manage their data
according to their specific regulations, restrictions and needs.

Workspace provisioning: You can create a workspace to collaborate with teammates in your
domain and create collections of items such as lakehouses, warehouses and reports.

Microsoft Fabric experiences promote org-wide collaboration


Because it’s built on a single, unified SaaS platform, Fabric brings together Power BI, Azure
Synapse and Azure Data Factory so data teams can collaborate in a single workspace, on
the same copy of data. Fabric provides each domain with core experiences designed to work
together seamlessly.
Bring on the era of AI with Microsoft 20

Each experience is tailored to a specific persona and a specific task, allowing different domains
to find the tools they need to create their own data products.

Microsoft Fabric Domains

Data Factory Synapse Data Synapse Synapse Data Synapse Real-Time Power BI Data Activator
Engineering Data Science Warehouse Analytics

Workspace 1 Workspace 2 Workspace 3

Fabric Capacity Fabric Capacity

OneLake

Data Factory offers a modern data integration experience to ingest, prepare and
transform data from a rich set of data sources. Data Factory brings Fast Copy capabilities
to both dataflows and data pipelines so you can move data between your lakehouse and
data warehouse in Fabric at blazing speed.

Synapse Data Engineering provides a world class Spark platform with great authoring
experiences, enabling data engineers to perform large scale data transformation and
democratise data through the lakehouse. The Spark integration with Data Factory also
enables notebooks and Spark jobs to be scheduled and orchestrated.

Synapse Data Science allows you to build, deploy and operationalise machine learning
models within your Fabric experience. It integrates with Azure Machine Learning to
provide built-in experiment tracking and model registry.
Bring on the era of AI with Microsoft 21

Synapse Data Warehouse provides industry-leading SQL performance and scale. It fully
separates compute from storage, allowing independent scaling of both the components.
Additionally, it natively stores data in the open Delta Lake format.

Synapse Real-Time Analytics gives you a way to focus and scale up your analytics
solution while democratising data for both citizen data scientists and advanced data
engineers. As a fully managed big data analytics platform, Real-Time Analytics utilises
a query language and engine so you can search structured, semi-structured and
unstructured data.

Power BI provides business owners the ability to access all their data in Fabric quickly
and intuitively to make better decisions with data. This experience allows organisations
to turn unrelated data sources into coherent, visually immersive and interactive insights.

Data Activator monitors data in Power BI reports and automatically takes actions when
certain patterns or conditions are detected. This allows you to build a digital nervous
system that acts across all your data, at scale and in a timely manner.
Bring on the era of AI with Microsoft 22

Copilots reduce the heavy lifting


Microsoft Fabric includes several copilots that act as interactive aides, lightening the load on
engineers, scientists and analysts so they can expedite the journey from raw data to meaningful
insights. From data preparation to report building, Copilot and other generative AI features offer
new ways to analyse data, generate code and create visualisations in Fabric and Power BI.

Copilot for Data Science and Data Engineering provides intelligent code completion,
automates routine tasks and supplies industry-standard code templates to facilitate
taskslike data enrichment and the creation of analytical models. Copilot offers
contextual code suggestions and prompts that adapt to specific tasks, helping you code
more effectively and with greater ease.

Copilot for Data Factory supports both citizen and professional data wranglers in
streamlining their workflow. It provides intelligent code to transform data, as well as
code explanations to help you understand complex tasks.

Copilot for Power BI allows you to create Power BI reports automatically. You can
generate summaries of existing reports or ask for suggestions on which reports to create
based on your data. Prompts like ‘Create a page to examine next month’s forecast’ yield
visualisations that help you spot trends and patterns quickly.
Bring on the era of AI with Microsoft 23

LLM capabilities power your generative


AI applications
As a platform for analytics and AI, Microsoft Fabric is well suited to support the use of large
language models (LLMs) for the creation of generative AI applications. Fabric and SynapseML
offer unique LLM capabilities so you can build solutions that can handle question-and-answer
tasks or document summaries, for example.

Extract insights from unstructured data: Use Fabric to tap into information stored in
unstructured documents like PDFs. You can load PDF documents into a Spark DataFrame, read the
documents using the Azure AI Document Intelligence in Azure AI Services and use SynapseML to
split the documents into chunks.

Integrate Azure OpenAI: Apply LLMs at scale by integrating Azure OpenAI Service
and SynapseML. Azure OpenAI can be used to solve natural language tasks by prompting
the completion API. Through SynapseML, you can use Apache Spark distributed computing
framework to easily process millions of prompts.

Generate embeddings: Connect Azure OpenAI Service and use SynapseML to generate
embeddings in a distributed manner that allows you to efficiently process large volumes of data.
You can also store the embeddings in a vector store using Azure AI Search and search the vector
store to answer users’ questions.
Bring on the era of AI with Microsoft 24

Get ready for


As organisations forge ahead in the era
of AI, clean data from well-managed
and highly integrated analytics systems

the era of AI is critical. Frameworks like MA²G make


sure your systems support governance,
data management and domains, allowing
organisation to create customised AI and
analytics experiences. Through Microsoft
Fabric, all the data and analytics tools you
need are available in one, end-to-end
platform.

Interested in learning more about


MA²G and Fabric?

Reach out to an Azure


sales specialist or
your Microsoft sales
representative for best
practices on analytics, help getting started
with Fabric and more. Or ask about
visiting a local Executive Briefing Centre or
Microsoft Training Centre.

© 2024 Microsoft Corporation. All rights reserved. This


document is provided ’as is’. Information and views
expressed in this document, including URL and other
Internet website references, may change without notice.
You bear the risk of using it. This document does not
provide you with any legal rights to any intellectual
property in any Microsoft product. You may copy and
use this document for your internal, reference purposes.

You might also like