Transform Data Into Actionable Insights
Transform Data Into Actionable Insights
Introduction to
Intelligent Data Platforms
Introduction to Intelligent Data Platforms 2
Analytics.....................................................................................................................................................................7
Data ingestion.....................................................................................................................................................................8
Analytical data storage....................................................................................................................................................8
Data visualization..............................................................................................................................................................9
Data governance....................................................................................................................................................11
Cataloging data and building the business glossary...........................................................................................11
Capturing data lineage.................................................................................................................................................. 12
Hybrid capabilities................................................................................................................................................12
Conclusion..........................................................................................................................................17
Introduction to Intelligent Data Platforms 3
Introduction
Data is essential to businesses today. With the explosion of digital information and the
increasing availability of powerful analytic tools, companies can leverage data to gain
deep insights into their operations, customers and markets. By harnessing the power of
data, businesses can optimise their processes, make informed decisions and identify new
opportunities for growth and innovation. For example, a 2021 survey of IT and senior business
decision makers conducted by Vanson Bourne found that the average profit growth for
companies with an enterprise data strategy in place for longer than a year was 5.97%, compared
to just 2.50% for ones with no enterprise data strategy1.
Many organisations have stitched together a collection of point solutions to meet their
evolving data needs. For example, one supplier for the operational database, another for the
analytical platform and a separate tool for data integration. Each solution may have its own
tools, processes and interfaces, requiring different management and maintenance efforts.
Integrating data across silos can be expensive and time-consuming for teams, requiring
complex engineering to keep data synchronised. Implementing data governance to gain a
comprehensive understanding of the storage and utilisation of critical data in reporting can also
present significant challenges. Different solutions with varying security features and protocols
can make it difficult to keep data secure. Obtaining timely and accurate data needed for AI
analysis may not be possible. Ultimately, the lack of integration and standardisation can affect an
organisation’s ability to manage and analyse its data efficiently, resulting in missed opportunities
and suboptimal decision-making.
An intelligent data platform designed for modern data can easily overcome these obstacles.
It encompasses data governance, data management, data analysis and the ability to power AI
workloads. By taking a holistic approach to data, businesses can realise the full potential of data
assets and achieve success in today’s digital economy. Whether the data is on-premises, in the
public or private cloud, structured or unstructured, gigabytes of data or petabytes or anywhere
in between, an intelligent data platform can support the data strategy of any organisation.
Data governance
AI
Analytics
Operational databases
Governance
Stream
Analytic
processing
Big data
Real-time Data
Data processing
message transformation
sources ingestion and
and modelling
management
Orchestration
● Operational databases: Stores, manages and tracks business information such as sales,
products and customer and employee information.
● Analytics: The tools for turning raw data into insights to make better decisions. It may
include copying data from operational databases, transforming its format and visualising
the data.
● AI and machine learning: Using algorithms to identify patterns in data to predict future
outcomes.
● Data governance: Processes, policies, roles and standards to ensure effective and efficient
use of information.
These features need to be supported by a foundation of security. Using them together allows an
organisation to get the most out of its data. This enables an organisation to work efficiently in
terms of the effort required and the computing capacity consumed.
The only constant for any organisation is change. Whether growing a business to meet customer
demands better, acquiring new businesses, meeting new regulatory requirements or responding
to an executive’s request for data to help make an important decision, a data platform will
continue to evolve. An intelligent data platform helps businesses respond faster to stay ahead of
the competition.
Time and money are your scarcest resources. You want to make sure
you’re allocating them in the highest-impact areas. Data reveals impact,
and with data, you can bring more science to your decisions.
Operational databases
An operational database is designed to manage and store data for day-to-day business
operations. These databases are optimised for fast and efficient data retrieval and updates,
making them ideal for transactional applications such as online retail or banking.
One of the difficulties organisations may face with operational databases is managing their data
estate. An organisation may use dozens or even hundreds of operational databases. Knowing
what data is stored in each can be critical for compliance with regulations such as General
Data Protection (GDPR), California Consumer Privacy Act (CCPA) and similar rules throughout
the world.
Another obstacle can be harnessing the data in an operational database for analysis and
visualisation. Analysis may require copying some of the operational data so it can be
restructured or combined with additional operational or external data. For example, an
organisation might combine sales data with weather data to help forecast the inventory based
on the predicted weather. However, keeping the data synchronised between operational and
analytical data stores can be a complex process.
Analytics
Analytics allows businesses to track progress, identify opportunities and make informed
decisions. By analysing data, businesses can uncover trends and patterns that may point to new
opportunities for growth. Analytics can also help companies improve their operational efficiency
and increase their profitability.
Three primary areas of analytics make up the key components of an intelligent data platform:
● Data ingestion
● Data visualisation
These components must work together seamlessly to implement an effective analytics solution.
Introduction to Intelligent Data Platforms 8
Integration of analytics with the other components of the data platform, such as operational
databases, data governance and security, is also critical. The term ‘integration tax’ refers to
the cost of integrating different data sources and systems into a single platform. Without an
intelligent data platform, the integration tax can be significant and often prevents organisations
from fully taking advantage of their data.
Data ingestion
Data ingestion is the process of obtaining and importing data from various sources to a storage
medium where it can be accessed, used and analysed by an organisation. Data can be streamed
in real time or ingested in batches.
● Raw: Data that is simply a copy of the data from the source system.
● Refined: Data that is cleaned up. For example, if the source data from the cash registers in
a retail store, removing voided transactions would refine the data.
● Curated: Data that is summarised for reporting and analysis purposes. For example, if sales
data is to be analysed at the store level, aggregated data from all of the cash registers in a
store would count as curated data.
An intelligent data platform makes data ingestion and data integration easier by simplifying
the process of moving data throughout the platform. For example, tables can be automatically
copied from an operational database to analytical storage, avoiding the need to write code for
each table.
● Isolation from the operational database to ensure that analytical processing does not
impact the operations of the system by competing for resources or blocking updates.
● The ability to combine data from multiple data sources. For example, a utility bill may also
show temperature data along with usage.
The analytical data storage could be a data warehouse that uses a relational database model,
a data lake or a data lakehouse, which combines attributes of both a data warehouse and
a data lake.
An intelligent data platform supports all of these models, so an organisation can use any of
them or a combination that makes the most sense. For example, suppose an organisation is
trying to measure greenhouse gas emissions from an industrial operation. Sensor data may
initially be stored in a data lake. Once the data is summarised, the aggregated data may be
stored in a relational data warehouse for reporting and visualisation.
Data visualisation
A good data visualisation tool can help tell a story with data. This critical part of any intelligent
data platform allows for translating the volumes of data collected, curated and enriched into a
visual representation that business users understand.
A data visualisation tool should also support data democratisation, the principle and process
of making company data easily accessible to all members of an organisation. The primary goal
of data democratisation is to empower any user, technical or non-technical, to uncover insights
in the data to make better-informed business decisions and gain a competitive advantage.
Allowing people across the organisation to access the data they need and act on it can be critical
to a company’s success.
Data visualisation tools allow the creation of executive dashboards showing overall data and
then allow the user to manipulate it in various ways, such as looking at sales by date, geographic
region, department or product category. Some data visualisation tools also incorporate
advanced analytics features such as natural language processing to let users ask a question
about data and instantly get an answer. Another example of an AI feature that may be included
in data visualisation tools is the ability to detect anomalies or outliers in the data.
Data visualisation tools may store a copy of the data to allow high-performance interactions.
Others issue queries directly against the source system. Some allow for both options.
There are a tremendous number of options to consider within the analytics area. It is important
to consider each component not only individually, but also how they will work together with the
other parts of the analytics solution and overall data platform. For example, seamless integration
between the analytics data storage and the data visualisation tool will allow developers to spend
more time creating the visualisations the organisation needs and less time massaging data.
Introduction to Intelligent Data Platforms 10
One example of machine learning is a mobile phone company that is interested in predicting
which customers are likely to switch providers. There will likely be several attributes that
could affect the outcome, including customer demographics, the number of times they called
customer service, the number of lines on their plan and how recently they upgraded their
phone. The machine learning algorithm learns from this data what attributes increase or
decrease the likelihood of an outcome. Then new data that does not yet have an outcome can
be provided to the algorithm, and the algorithm will predict what the outcome is likely to be.
A natural language processing (NLP) model is another type of AI. It is a deep learning algorithm
that can recognise, summarise, translate, predict and generate text and other content using
knowledge gained from massive datasets. There are many applications for NLP models including
summarising news and email, translating text, customer service chatbots and coding.
For example, a wealth management organisation could use NLP to allow their financial advisers
to ask questions and receive easily digestible answers based exclusively on the firm’s internal
content. In addition, AI and machine learning can be used to support day-to-day operations,
for example, by detecting potentially fraudulent credit card transactions or enriching data for
analytics by evaluating whether the sentiment of a comment is positive, negative or neutral.
Introduction to Intelligent Data Platforms 11
Data governance
Data governance refers to the collection of processes, policies, roles, metrics and standards
necessary to manage data effectively. By implementing sound data governance practices,
organisations can establish robust data management solutions to ensure their data is secure,
private, accurate and usable throughout the data life cycle.
A robust data governance solution provides all decision-makers within an organisation with a
single source of truth, i.e., the same data sets, terminology and view, giving more opportunities
for internal flexibility. It also ensures clean data management throughout an organisation’s
governance process, meaning procedures correctly generate, handle and protect the data to
keep it in compliance.
An intelligent data platform provides a unified data governance solution to help manage and
govern on-premises, multicloud and Software-as-a-Service (SaaS) data. It lets organisations
create a holistic, up-to-date map of their data landscape with automated data discovery,
sensitive data classification and end-to-end data lineage.
When it comes to data, the governance of that data is also catalogued in the business glossary.
When an employee looks up ‘customer sales data’, they should be able to see exactly what
that data is, who is responsible for that data, whether or not they can access it and the kinds of
reports it’s used for. A business glossary helps improve communication. When all employees
have an agreed-upon meaning of a term, they can use it confidently to communicate and
collaborate across all parts of the organisation.
Introduction to Intelligent Data Platforms 12
• 71% have four or fewer data stewards and 26% have none
IDC, Chief Data Officers: The New Business Leaders, August 2020
● Why did we collect the data? ● Where was the data collected?
● When was the data collected? ● Who did we collect the data from?
Data lineage tools, such as those found in Microsoft Purview, a part of the Microsoft Intelligent
Data Platform, can visually represent this lineage. The goal of a data catalogue is for all the
data systems and data assets an organisation controls to connect and report their lineage to
the catalogue. Once this lineage metadata is available to the data catalogue, the catalogue can
present it to employees to help them build informed data governance use cases.
Hybrid capabilities
To accelerate time to insight and operational efficiency, organisations are looking for a platform,
typically a cloud solution, that can help streamline data processes. However, many customers
need to keep some of their applications and data stores on-premises. They may have regulatory
and compliance reasons, or in certain industries, such as manufacturing, they may need data
at the edge to be ingested, stored and analysed right away. In fact, most enterprise customers
looking to realise value with data and AI will have a hybrid and/or multi-cloud environment.
These organisations are also looking for centralised governance of their data estate, applications
and databases as well as a platform that allows them to discover, catalogue, ingest, analyse
and govern all their data – no matter where the data lives and what type of data they have. It is
important for an intelligent data platform to have the governance and security capabilities to
support a hybrid model.
Introduction to Intelligent Data Platforms 13
Let’s consider a hospital that has a database supporting an application that monitors patients’
vital signs. The operational database needs to be in the hospital to mitigate the risk of a wide
area network outage. However, the scalability and tools available in the public cloud make it
a better option for analysing the data, looking for trends and ensuring the hospital’s quality
protocols that ensure patient health are followed. An intelligent data platform makes it easy to
work with data, wherever it may be, in a governed and secure fashion.
Network security
Access management
Threat protection
Information protection
Customer data
Network security
The network is the first line of defence to protect an organisation’s data. It ensures that data
access is only allowed from specified network locations. For example, access to a database that
supports a company’s website may be restricted to the web servers and a small number of
other addresses required for database administration. Outbound traffic can also be restricted to
prevent data exfiltration. Protecting data assets using network access greatly reduces the risk of
brute-force attacks.
Introduction to Intelligent Data Platforms 14
Access management
Access management has two components:
● Authentication: This is the process of proving the user is who they claim to be. A directory
mechanism is the most common method for authentication. It provides capabilities such
as multi-factor authentication (MFA) and conditional access. It also provides the benefit of
centralised administration.
Threat protection
Threat protection can be used to track database activities and analyse logs to detect unusual
behaviour and potential exploits. Alerts can be created for suspicious activities such as SQL
injection, potential data exfiltration and brute-force attacks. Details of suspicious activities and
recommendations can be viewed through cloud security posture management (CSPM) or a
cloud workload protection platform (CWPP).
Key vault
A key vault is a cloud service for securely storing and accessing secrets. A secret is anything that
requires tightly controlled access, such as API keys, passwords, certificates or cryptographic keys.
Using a key vault helps avoid storing passwords or other secrets in code or configuration files
that can be exposed to unauthorised use. Instead, an administrator stores the secrets in the key
vault and authorises users or applications to access specific secrets. The secrets are encrypted in
transit between the key vault and the client, and access to the key vault is logged.
Introduction to Intelligent Data Platforms 15
Data innovation leaders, those with the most mature practices around
using data to improve products, services, customer experience, etc.,
are finding bottom-line success that eludes less mature organisations.
The technology that allows an intelligent data platform to deliver these benefits includes:
● The virtually unlimited scale of the cloud provides computational power and storage
power that can help process even the largest volumes of data.
● The ability to store the data in a data warehouse, data lake or data lakehouse (which
combines features of both a warehouse and a lake) makes it easy to consume the data,
whether it is for reporting and visualisation or for AI and machine learning.
● Data visualisation allows decision-makers to easily view the data they need to make the
best decisions for the organisation.
● Integrated AI and machine learning capabilities to help organisations unlock insights and
discover intelligent connections.
No matter how an organisation uses data, an intelligent data platform allows an organisation
to focus more on delivering results and less on tedious data management and data movement
activities.
Introduction to Intelligent Data Platforms 17
Conclusion
The value of data to a business is clear. An intelligent data platform harnesses data to help serve
customers better and make more timely decisions, whether that data is structured or streaming,
on-premises or in the cloud or gigabytes or petabytes in size.
The analytics capabilities of an intelligent data platform support any use case, whether it is
analysing what has already happened or using AI to predict what is most likely to occur in the
future based on the data.
Data visualisation capabilities provide easy to understand reports and dashboards to users
throughout the organisation, whether they need summarised information or detailed data to
make the best decisions.
Finally, security is implemented throughout the intelligent data platform with multiple layers to
ensure that data is protected.
Save
up to 90% Save
up to 59%
Azure SQL Managed Instance Azure Synapse Analytics Microsoft Intelligent
costs up to 90% less than costs up to 59% less than Data Platform
Amazon RDS for SQL Server. Snowflake. Customer Stories
Learn more Learn more Explore