Six Data Integration Reference Architectures
Six Data Integration Reference Architectures
Six Data Integration Reference Architectures
Governance – Business leaders expect IT to track, maintain, and protect data • Profit increase (74%)
at every stage of the lifecycle Source: IDC, InfoBrief sponsored by Qlik, “Data as the
New Water: The Importance of Investing in Data and
Analytics Pipelines,” June 2020.
Agility – Leaders hope for the automation of the design, creation, and
continuous updating of data warehouses and data lakes on any cloud platform
to speed decision making
Qlik Data Integration meets these requirements and more. It’s the most effective data
integration for delivering all types of data from any source to the right people as quickly
and safely as possible. Qlik Data Integration automates the creation of data streams
and efficiently moves them to applications, data warehouses, and data lakes, delivering
business-ready data to Qlik Sense or other analytics solutions.
1 2 3 4 5 6
Data Warehouse Data Lake Data Lakehouse Event-Driven Event-Driven Data Mesh
Power agile data warehousing Continuously provide accurate, Architect for a single source of Data – Lambda Data – Kappa Form a decentralized and distributed
with automation to quickly design, timely, and trusted transactional truth for all analytic initiatives – Reliably update a data lake and Handle real-time data processing data fabric foundation where domain
build, deploy, manage, and catalog data sets for business analytics. artificial intelligence (AI), business efficiently train ML models using and continuous data reprocessing data product owners use common
purpose-built data warehouses Automate the entire data pipeline intelligence (BI), machine learning three layers. Understand the using a single stream processing data infrastructure via self-service to
(especially cloud-based) faster (from real-time data ingestion to (ML), streaming analytics, data batch layer, operating on all data, engine. develop pipelines that share data in a
than traditional solutions. the creation and provisioning of science, and more. produces the most accurate governed, open approach. Adhere to
Data engineers: Invest in less
analytics-ready datasets) without Data engineers: Unify both results but at high latency; the core principles: a.) domain-oriented,
expensive hardware and solve
Data engineers: Meet or exceed manual scripting. data lake and data warehouse speed layer shows real-time views decentralized data ownership and
multi-layered, Lambda architecture
the demands for analytics- Data engineers: Realize faster automation in one user interface in a low-latency, near real-time architecture for scale, b.) data-
redundancy by replaying data
ready data marts that enable data- return on data lake investments to plan and execute either model; and the serving layer as-a-product as an architectural
instead of maintaining two
driven insights at the speed of while confidently meeting growing with ease. supports queries from batch and unit (built, deployed, maintained),
code bases (batch and speed
change. demands for analytics-ready speed layer results. c.) self-service data infrastructure
layers) to process unique events
datasets in real time. for the autonomous creation and
Data engineers: Predict upcoming continuously in real time
consumption of data products,
events accurately. while meeting standard
and d.) federated governance
quality of service.
and interoperability standards to
aggregate and correlate independent
data products.
Data engineers: Derive value from
analytical data at scale while the
data landscape, use cases, and
responses constantly change.
REFERENCE ARCHITECTURE
1 Data Warehouse
Speed and simplify your data warehouse lifecycle – design, build, deploy, manage and catalog purpose-built
data warehouses – with automation for faster time to insights.
BUSINESS
BATCH LOAD Cloud Data Warehouse INTELLIGENCE 4 Custom Transformation
Create flexible, fit-for-purpose data
pipelines to transform raw data into data
MAINFRAME Application that is ready for analytics
Automation
DATA SCIENCE
5 Machine Learning to Enrich Data
AutoML to enrich data with prediction,
APPLICATIONS
scoring, classification and more
REVERSE ETL 8
RAW DATA CONFORMED DATA CONSUMABLE DATA 6 Data Profiling
MODERNIZATION Assess the quality and structure of data
FILES
CATALOG SYNC
sources to fix data quality issues and
promote good data governance
7 Qlik Analytics
Foundational Empower all your users to explore freely
Services at the speed of thought with hyperfast
SaaS AND ASSOCIATIVE AI – INSIGHT
MONITOR ORCHESTRATE GOVERN API HYBRID CLOUD ENGINE ADVISOR AI – AUTOML calculations, always in context, at scale
8 Reverse ETL
Replicating enriched data from the
warehouse back to the operational systems
Qlik Capabilities
of record
Others
2 Data Lake
Automate your data lake pipeline – from real-time ingestion to processing and refining raw data and making
it accessible to consumers – without writing code for greater speed and agility.
Qlik Analytics
7
Foundational
Empower all your users to explore freely
Services
at the speed of thought with hyperfast
MONITOR ORCHESTRATE GOVERN API
SaaS AND ASSOCIATIVE AI – INSIGHT
AI – AUTOML calculations, always in context, at scale
HYBRID CLOUD ENGINE ADVISOR
8 Reverse ETL
Replicating enriched data from the
warehouse back to the operational systems
Qlik Capabilities of record
Others
3 Data Lakehouse
Combine the low-cost, broad data access and structured management of data lakes and data warehouses to
apply the full, current data set toward business intelligence, data analytics and machine learning for flexibility,
simplicity, and efficiency. Catalog & Lineage
1
5 4 1 6 Discover, govern, and protect data
Data Ingestion Data Transformation Data Machine Catalog Qlik by leveraging a layer of common
Sources
Profile Learning & Lineage Analytics enterprise metadata
3 2 2
Customs Data Lake Data Warehouse
Transformations Automation Automation Data Lake & Warehouse Automation
2
Automate the process of providing
SAP
continuously updated, accurate, and
REAL-TIME (CDC)
PROFILE AND trusted data sets for business analytics
MANIPULATE ENRICH DATA
USER-DEFINED MULTI-ZONED MODEL-DRIVEN DATA
Other Custom Transformation
3
INCREMENTAL Initiatives Create flexible, fit-for-purpose data pipelines
DATABASE
LOAD to transform raw data into data that is ready
PUSHDOWN
<SQL> for analytics
Reverse ETL
7 Replicating enriched data from the warehouse
CATALOG SYNC
FILES back to the operational systems of record
Foundational
Qlik Capabilities Services
SaaS AND ASSOCIATIVE AI – INSIGHT
MONITOR ORCHESTRATE GOVERN API HYBRID CLOUD ENGINE ADVISOR AI – AUTOML
Others
MAINFRAME 4 Monitor
BUSINESS
INTELLIGENCE Monitor data ingestion tasks from a single
pane of glass view
Cloud Data Lake Cloud Data Warehouse
5 Orchestrate
FILES Orchestrate data ingestion tasks from a
single pane of glass view
APPLICATION MODERNIZATION
RAW DATA PERFORMANCE CONFORMED DATA CONFORMED DATA CONSUMABLE DATA
DATA
& LIVE VIEWS
Foundational
Qlik Capabilities Services
SaaS AND ASSOCIATIVE AI – INSIGHT
MONITOR ORCHESTRATE GOVERN API HYBRID CLOUD ENGINE ADVISOR AI – AUTOML
Others
4 5 6 Qlik Data Integration Reference Architectures | 8
REFERENCE ARCHITECTURE
2 Qlik Analytics
SAP Empower all your users to explore freely
1 at the speed of thought with hyperfast
calculations, always in context, at scale
REAL-TIME (CDC)
Other Monitor
3
KAFKA KINESIS EVENT HUBS PUB/SUB CONFLUENT Initiatives Monitor data ingestion tasks from a single
DATABASE
pane of glass view
4 Orchestrate
Orchestrate data ingestion tasks from a
Cloud Data Lake Cloud Data Warehouse BUSINESS single pane of glass view
MAINFRAME INTELLIGENCE
5 API
APIs to automate and integrate with
other applications for monitoring and
FILES orchestration
DATA SCIENCE
LOG DATA
RAW DATA PERFORMANCE CONFORMED DATA CONFORMED DATA CONSUMABLE DATA MODERNIZATION
& LIVE VIEWS
APPLICATION
DATA
Foundational
Services
SaaS AND ASSOCIATIVE AI – INSIGHT
MONITOR ORCHESTRATE GOVERN API HYBRID CLOUD ENGINE ADVISOR AI – AUTOML
Qlik Capabilities 3 4 5
Others
6 Data Mesh
Create a foundation for deriving value from analytical data at scale as your data landscape,
use cases, and responses continually change.
BATCH LOAD
BUSINESS
INTELLIGENCE 4 Machine Learning to Enrich Data
SaaS Data AutoML to enrich data with prediction,
Domain 1 scoring, classification and more
Application RAW LIVE PERFORMANCE CONSUMABLE CONFORMED
DATA VIEWS VIEWS DATA
Automation DATA Data Profiling
5
DATA SCIENCE
Assess the quality and structure of data
MAINFRAME sources to fix data quality issues and
promote good data governance
Data
Domain 2 6 Qlik Analytics
RAW CONFORMED CONSUMABLE
REVERSE ETL
DATA Empower all your users to explore freely
APPLICATIONS 7 DATA DATA MODERNIZATION
at the speed of thought with hyperfast
calculations, always in context, at scale
7 Reverse ETL
FILES CATALOG SYNC Data Replicating enriched data from the
Domain N warehouse back to the operational systems
of record
Foundational
Qlik Capabilities Services
SaaS AND ASSOCIATIVE AI – INSIGHT
MONITOR ORCHESTRATE GOVERN API HYBRID CLOUD ENGINE ADVISOR AI – AUTOML
Others
By quickly delivering data to the user without typical business friction, Qlik Data
Integration powers the agility necessary to drive needed business value out of
scattered and disparate data.
qlik.com
© 2024 QlikTech International AB. All company signs, names, logos, product names, and/or trade names referenced herein, whether or not appearing with the symbols ® or ™, are trademarks of QlikTech Inc or its affiliates. All other products, services, and company names mentioned herein may be trademarks of their
respective owners and are acknowledged as such. For a list of Qlik trademarks please visit: https://www.qlik.com/us/legal/trademarks 1-DM-04