Data Architectures in Azure For Analytics & Big Data: October 20, 2018
Data Architectures in Azure For Analytics & Big Data: October 20, 2018
in Azure
for Analytics & Big Data
October 20, 2018
Melissa Coates
Source: https://docs.microsoft.com/en-us/azure/architecture/data-guide/big-data/
Azure
Source: http://azureplatform.azurewebsites.net/
A public cloud
computing
platform and
infrastructure for
building,
deploying, and
managing MSFT-
specific and third
party software and
services through a
global network of
Microsoft-
managed
datacenters
Azure Technologies
Data Storage: Relational Databases
IaaS PaaS
Infrastructure as a Service Platform as a Service
SMP MPP
Symmetric Multi-Processing Massively Parallel Processing
Managed Database
Managed Instance
Relational database of your Azure SQL
Azure SQL Database
choice in a virtual machine Data Warehouse
Multi-Modal
Multi-Model
HDInsight
HBase Azure
Cluster Table Storage
Azure
CosmosDB
HDInsight HDInsight
Interactive Hadoop Cluster
Query Cluster
(Hive LLAP)
Compute: Streaming & Event Processing
PaaS
Platform as a Service
HDInsight: Azure
Spark, Hive, Pig, Automation
Scoop, Oozie
Reference Architectures
Azure Blob
Storage
Reporting &
Analysis Tools
Power BI
Azure
Azure Analysis Services
SQL
Database Excel
Enterprise Data Warehousing and BI
Multi-Structured Data
Data Mart(s)
Azure Data
Lake Storage
Azure
DW: Structured Data SQL
Azure SQL Database
Data Warehouse
Power BI
Semantic Layer
Azure Excel
Analysis Services
Data Science and Artificial Intelligence
Multi-Structured Data Data Science and AI
Azure Data
Lake Storage
Azure Azure Machine Azure Cognitive
Databricks Learning HDInsight Services
DW: Structured Data
Azure SQL
Data Warehouse Data Mart(s)
Power BI
Azure
SQL
Database Excel
Unified Data Science & Data Engineering
Data Lake: Multi-Structured Data Structured Data
Azure Data Azure SQL
Scheduled
Lake Storage Notebook Database
Job
Raw Data
Operationalized
Curated Analytics
Data
Azure
Databricks
Exploratory
Data Science Analytics
Sandbox
Big Data Interactive Querying (SQL on Hadoop)
Hive Metastore
Data Lake
Azure Data Azure SQL
Lake Storage Database
HDInsight
Interactive
HiveQL
Query Cluster
(Hive LLAP)
Big Data Batch Processing Big Data Job Processing
Data Lake: Multi-Structured Data
Azure Data Lake Analytics
Azure Data U-SQL
U-SQL Job Processing
Lake Storage Extensions
Job 1 Job 2
Python
ADLA Catalog
Database
Tables Procedures
External Cognitive
Views Functions Data Services
Sources
Schemas Assemblies
Azure
SQL DB
SQL Server Azure
in Azure VM SQL DW
IoT + Batch Data (Lambda Architecture)
Speed Layer Serving Layer
Streaming
Dashboard
Batch Layer
Azure
Analysis Services
Azure
Data Lake
Storage
Azure
SQL Data
Power BI Excel
Warehouse
Operational BI (Embedded BI)
Published Reports Embedded Visuals
Power BI Custom
Service Application
Premium
Source Data Data Model + Reports Capacity
REST
Azure SQL Power BI App API
Database Desktop Workspace calls
Web Application
Web Page
Diagnostics Backups
App Service
Plan
Storage Storage
Account Account
Wrap-Up
More Info
Azure Solution Architectures:
https://azure.microsoft.com/en-us/solutions/architecture/
More Info
Azure Data Architecture Guide:
https://docs.microsoft.com/en-us/azure/architecture/data-guide/
Thanks!