What Is A Data Fabric - IBM
What Is A Data Fabric - IBM
Related solutions
Resources
Next steps
Over the last decade, developments within hybrid cloud, artificial intelligence, the
internet of things (IoT), and edge computing have led to the exponential growth of big
data, creating even more complexity for enterprises to manage. This has made the
unification and governance of data environments an increasing priority as this growth
has created significant challenges, such as data silos, security risks, and general
bottlenecks to decision making.
Data management teams are addressing these challenges head on with data fabric
solutions. They are leveraging them to unify their disparate data systems, embed
governance, strengthen security and privacy measures, and provide more data
accessibility to workers, particularly their business users.
These data integration efforts via data fabrics allow for more holistic, data-centric
decision-making. Historically, an enterprise may have had different data platforms
aligned to specific lines of business. For example, you might have a HR data platform,
a supply chain data platform, and a customer data platform, which house data in
different and separate environments despite potential overlaps. However, a data
fabric can allow decision-makers to view this data more cohesively to better
understand the customer lifecycle, making connections between data that didn’t exist
before.
Data virtualization is one of the technologies that enables a data fabric approach.
Rather than physically moving the data from various on-premises and cloud sources
using the standard ETL (extract, transform, load) processes, a data virtualization tool
connects to the different sources, integrating only the metadata required and creating
a virtual data layer. This allows users to leverage the source data in real-time.
Analyst report
See why IBM is recognized as a leader in the Gartner Magic Quadrant for Augmented
Data Quality Solutions.
Data fabric architecture
By leveraging data services and APIs, data fabrics pull together data from legacy
systems, data lakes, data warehouses, sql databases, and apps, providing a holistic
view into business performance. In contrast to these individual data storage systems,
it aims to create more fluidity across data environments, attempting to counteract the
problem of data gravity—i.e. the idea that data becomes more difficult to move as it
grows in size. A data fabric abstracts away the technological complexities engaged for
data movement, transformation and integration, making all data available across the
enterprise.
Data fabric architectures operate around the idea of loosely coupling data in platforms
with applications that need it. One example of data fabric architecture in a multi-cloud
environment may look like the below, where one cloud, like AWS, manages data
ingestion and another platform, such as Azure, oversees data transformation and
consumption. Then, you might have a third vendor, like IBM Cloud Pak® for Data,
providing analytical services. The data fabric architecture stitches these environments
together to create a unified view of data.
That said, this is just one example. There isn’t one single data architecture for a data
fabric as different businesses have different needs. The various number of cloud
providers and data infrastructure implementations ensure variation across
businesses. However, businesses utilizing this type of data framework exhibit
commonalities across their architectures, which are unique to a data fabric. More
specifically, they have six fundamental components, which Forrester (link resides
outside ibm.com) describes in the “Enterprise Data Fabric Enables DataOps” report.
These six layers include the following:
1. Data Management layer: This is responsible for data governance and security of
data.
2. Data Ingestion Layer: This layer begins to stitch cloud data together, finding
connections between structured and unstructured data.
3. Data Processing: The data processing layer refines the data to ensure that only
relevant data is surfaced for data extraction.
4. Data Orchestration: This critical layer conducts some of the most important jobs
for the data fabric—transforming, integrating, and cleansing the data, making it
usable for teams across the business.
5. Data Discovery: This layer surfaces new opportunities to integrate disparate data
sources. For example, it might find ways to connect data in a supply chain data
mart and customer relationship management data system, enabling new
opportunities for product offers to clients or ways to improve customer
satisfaction.
6. Data Access: This layer allows for the consumption of data, ensuring the right
permissions for certain teams to comply with government regulations. Additionally,
this layer helps surface relevant data through the use of dashboards and other
data visualization tools.
– Customer profiles.
– Fraud detection.
– Preventative maintenance analysis.
– Return-to-work risk models, and more.
Related solutions
IBM Cloud Pak for Data is an open, extensible data platform that provides a data fabric to
make all data available for AI and analytics, on any cloud.
Build, run and manage AI models. Prepare data and build models on any cloud using
open source code or visual modeling. Predict and optimize your outcomes.
Learn about Db2 on Cloud, a fully managed SQL cloud database configured and optimized
for robust performance.
Resources
A Create a The future Weaving
blueprint strong of data data fabric
for data in data fabric into hybrid
a foundatio multicloud
multicloud n for AI
Read the Read the Listen to this Data fabric
world
report on smartpaper webinar as can help
how your on how to IBM and businesses
organization create a Gartner investing in
can move robust data discuss the AI, machine
beyond the foundation future and learning,
database— for AI by importance Internet of
and create a
Learn focusing
Read theon of data fabric
Watch Things,
Read and
the
data
morefabric three key
smartpa and
the its core edge
IBV
where the
about data
per components.
video computing
report
right
data data is management (44:11) get more
available to
strategy areas: value from
the right access, their data.
location and governance,
the right and privacy
application and
at the right compliance.
Predict outcomes faster using a platform built with data fabric architecture.
Collect, organize and analyze data, no matter where it resides. Find out how
IBM Cloud Pak for Data can improve your business’s data governance practices
across multi-cloud environments.