0% found this document useful (0 votes)
439 views7 pages

Azure Data Solutions

This document provides an overview of implementing Azure data solutions. It discusses Azure cloud services including storage, SQL Database, data warehouse, data factory, event hub, IoT hub, stream analytics, databricks, and synapse analytics. It describes the capabilities and use cases of each service, and provides examples of creating and managing resources on each service.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
439 views7 pages

Azure Data Solutions

This document provides an overview of implementing Azure data solutions. It discusses Azure cloud services including storage, SQL Database, data warehouse, data factory, event hub, IoT hub, stream analytics, databricks, and synapse analytics. It describes the capabilities and use cases of each service, and provides examples of creating and managing resources on each service.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Implementing an Azure Data Solutions

Introduction to Azure

 Introduction to Azure Cloud


 What is difference between Azure Cloud and On-Premises
 What is Subscriptions and Resource Groups
 Different offerings of Cloud IaaS, PaaS and SaaS
 Creation of Virtual Machine

Introduction to Storage

 Azure Storage
o Azure Blob
o Table
o Message
o Queue
 Azure Data Lake Store Gen I & Gen II
o What is Data Lake
o Data Lake vs. Hadoop
o Blob Storage vs. Data Lake
o Hierarchical Namespace
o Ingestion through different tools i.e.; Azure Data Explorer, AzCopy, Azure CLI,
Powershell

Introduction to Azure SQL Database

 Introduction to Azure SQL Database


 Why choosing SQL Server in Azure
 Azure IaaS vs. PaaS database offerings
 IaaS vs. Managed Instance
 SQL Server PaaS deployment options
 Demo - Azure Single Database
 Purchasing models and Service Tier
 Azure Database vs. Azure Data Warehouse
 Elastic Database Pool
o Introduction
o Azure Elastic Database
o Demo - Azure Elastic Database
 Managed Instance Database
o Introduction
o Azure Managed Instance Database
o Difference between on-premises and managed instance
o Migration options for Managed Instance
o Service tiers for Managed Instance
o Demo - Managed Instance

 Azure Database Security


o Introduction
o Azure Database and Managed Instance Security options
o Encrypting Data at Rest and Motion
o High Availability vs. Disaster Recovery
o RTO vs. RPO
o Azure SQL Database High Availability and Disaster Recovery options
o Azure SQL Database Scaling
 Installation of SQL Server 2016 and above in Virtual Machine
 Creation of External Table or PolyBase in On-Premise SQL Server
o Creation of Master Key
o Creation of Database Scoped Credential
o Creation of External Data Source
o Creation of External File Format
o Creation of External Table
 Creation of External Table or PolyBase in Azure SQL Data Warehouse
o Creation of Master Key
o Creation of Database Scoped Credential
o Creation of External Data Source
o Creation of External File Format
o Creation of External Table
 Different Distribution or Shredding Patterns
o ROUND ROBIN
o HASH
o REPLICATION
 Cross Query Databases in Azure SQL Database
o Creation of Master Key
o Creation of Database Scoped Credential
o Creation of External Data Source
o Creation of External Table
 Creation of Elastic Pools in Azure SQL Server between Databases

Data Warehouse Internals and Architecture


 Introduction
 Azure Synapse MPP Architecture
 Storage and Sharding patterns
 Data Distribution and Distributing Keys
 Data Types and Table Types
 Partitioning
 Data Warehouse Concepts
 Dimensions and Facts
 Types of Dimensions and Facts
 Different types of Schemas in Data Warehouse
 Relationship types in Data Warehouse
 Best Practices for Fact and Dimension tables
 Demo - Analyze Data distribution before migration to Azure Synapse

Azure Data Factory

 Introduction to Azure Data Factory


 Creation of Linked Services, Datasets, Pipelines
 Creation of Integration Runtime and different types
 Slowly Changing Dimensions
 Design and implement a Type 1 slowly changing dimension with mapping data flows
 Debug data factory pipelines
 Understand the Azure SSIS Integration Runtime
 Set-up Azure SSIS Integration Runtime
 Run SSIS Package in Azure Data Factory
 Migrate SSIS Packages to Azure Data Factory
 Integrate SQL Server Integration Services Packages within Azure Data Factory
 Activities
o Copy
o Data flow
o Stored Procedure
o Lookup
o ForEach
o Get Metadata
o Filter Activity
o Spark
o U-SQL
o Databricks Notebooks
o Web
o If Condition
o Delete
 Data Flows
o Derived Column
o Join
o filter
o exists
o conditional split
o Lookup, Exists
o Select
o Aggregate
o Rank
o Filter
o Sort
o Alter Row
 Dynamic Queries in ADF
 Sending mails through Logic Apps
 Few more Activities ......
 Dataset and Pipeline Parameterization
 Monitor -- Azure and Visually
 Setup Alerts from Azure Data Factory

Realize Integrated Analytical Solutions with Azure Synapse Analytics

 Introduction
 What is Azure Synapse Analytics
 How Azure Synapse Analytics works
 When to use Azure Synapse Analytics
 Create Azure Synapse Analytics workspace
 Exercise - Create and manage Azure Synapse Analytics workspace
 Describe Azure Synapse Analytics SQL
 Explain Apache Spark in Azure Synapse Analytics
 Exercise - Create pools in Azure Synapse Analytics
 Orchestrate data integration with Azure Synapse pipelines
 Exercise-Identifying Azure Synapse pipeline components
 Visualize your analytics with Power BI
 Understand hybrid transactional analytical processing with Azure Synapse Link
 Use Azure Synapse Studio
 Understand the Azure Synapse Analytical processes
 Explore the Data hub, Develop hub, Integrate hub
 Explore the Monitor hub, Manage hub
 Describe a modern data warehouse
 Define a modern data warehouse architecture
 Exercise - Identify modern data warehouse architecture components
 Design ingestion patterns for a modern data warehouse
 Understand data storage for a modern data warehouse
 Understand file formats and structure for a modern data warehouse
 Prepare and transform data with Azure Synapse Analytics
 Serve data for analysis with Azure Synapse Analytics

Azure Synapse Analytics

Introduction
 Why Warehouse in cloud
 Traditional vs. Modern Warehouse architecture
 What is Synapse Analytics Service
 Create Dedicated SQL Pool and Spark Pool
 Create Azure Synapse Analytics Studio Workspace
 Analyze Data using Dedicated SQL Pool and Spark Pool
 Analyze Data using Apache Spark Notebook
 Analyze Data using Serverless SQL Pool
 Azure Synapse Benefits

Azure Event Hub, IoT Hub and Azure Stream Analytics

 Introduction to Azure Event Hub, IoT Hub and Stream Analytics


 Azure Stream Analytics Job
 Azure Stream Analytics Components
 Azure Stream Analytics Job
 Batching Streaming using Azure Event Hub
 Real Time Streaming using Azure IoT Hub
 Types of Window Functions
o Tumbling Window
o Hoping Window
o Sliding Window
o Session Window

Azure Databricks

 Spark Basics
 Why Spark is difficult? Why Databricks Evolved?
 Why Databricks in Cloud? Introduction to Azure Databricks
 Demo
 Provision Databricks, Clusters and workbook
 Mount Data Lake to Databricks DBFS
 Explore, Analyze, Clean, Transform and Load Data in Databricks
 Azure Databricks Clusters
 Azure Databricks other Important Components
 Databricks - Monitoring
 How to create Cluster
 How to work with Databricks File System
 How to create notebooks and Integrate with ADF
 How to import and export the Notebooks
 How to connect to blob, SQL DB from Databricks
 How to read data files from Azure Blob and Azure Data Lake Store
 Using Scala, R, Python, Spark SQL Language
 Creating Data Frames
 Converting Data Frames into Temporary Table or Temporary View
 Incremental and Full Load with Azure SQL Data Warehouse
 Understand the architecture of Azure Databricks spark cluster
 Understand the architecture of spark job
 Read data in CSV format
 Read data in JSON format
 Read data in Parquet format
 Read data stored in tables and views
 Write data
 Describe a DataFrame
 Use common DataFrame methods
 Use the display function
 Exercise: Distinct articles
 Describe the difference between eager and lazy execution
 Describe the fundamentals of how the Catalyst Optimizer works
 Define and identify actions and transformations
 Describe the column class
 Work with column expressions
 Perform date and time manipulation
 Use aggregate functions
 Exercise: Deduplication of data
 Describe the Azure Databricks platform architecture
 Perform data protection
 Describe Azure key vault and Databricks security scopes
 Secure access with Azure IAM and authentication
 Describe security
 Exercise: Access Azure Storage with key vault-backed secrets
 Describe the open source Delta Lake
 Exercise: Work with basic Delta Lake functionality
 Describe how Azure Databricks manages Delta Lake
 Exercise: Use the Delta Lake Time Machine and perform optimization
 Describe Azure Databricks structured streaming
 Perform stream processing using structured streaming
 Work with Time Windows
 Process data from Event Hubs with structured streaming
 Describe bronze, silver, and gold architecture
 Perform batch and stream processing
 Schedule Databricks jobs in a data factory pipeline
 Pass parameters into and out of Databricks jobs in data factory
 Integrate with Azure Synapse Analytics
 Understand workspace administration best practices
 List security best practices
 Describe tools and integration best practices
 Explain Databricks runtime best practices
 Understand cluster best practices

Azure Cosmos DB

Introduction to NoSQL DB
 Introduction to NoSQL
 SQL vs. NoSQL
 Types of NoSQL
 NoSQL Offerings by Microsoft
Introduction to Cosmos DB
 Cosmos DB Features
 Cosmos DB - Multi Model 5 APIs
 Table Storage vs. Cosmos DB
 Provision Cosmos DB Account

On-Premise Databases Migration


 DMS -- Database Migration Service
 On-Premise SQL Server to Azure Virtual Machine
 On-Premise SQL Server to Azure SQL Server

You might also like