Implementing An Azure SQL Data Warehouse
Implementing An Azure SQL Data Warehouse
• Cloud-based database
• Relational and nonrelational
• Enterprise workloads
• Integrated with Azure
• Fully managed service
• Benefits include:
• Massive parallel processing
• Advanced query optimization
• Columnstore indexes
• PolyBase integration
• Auditing
• Scalability
Scalability and Cost
• No upfront cost
• Storage
• Adjusts automatically
• Cost based on storage used
• Compute
• Determines execution performance
• Data Warehouse Unit (DWU)
• Increase or decrease DWU
• Cost based on DWU used
• Pause and start
Security and Availability
• Security
• Firewall
• Add logins
• Set authorisation
• Auditing
• Availability
• Can restore in different region
• Can choose restore point in last seven days
PolyBase
Creating a Server
Creating a Database
Configuring the Server Firewall
Connecting to Azure Database Using SQL Server
Management Studio
• Demonstration: Creating and Configuring an
Azure SQL Data Warehouse Database
Creating a Server
• Logical server
• Specify:
• Server name that has not been used
• Server admin logon
• Password
• Location nearest to you
• Create database
• Name of database
• Drag slider to change DWU performance
• Create a new server or use existing server
• Source
• Create a new resource group or use existing resource
group
• DWU settings
• Scale
• Pause/start
Configuring the Server Firewall
• Resource class
• Concurrency slots
• Query may use more than one concurrency slot
• Dependent on resource class and DWU service level
• Concurrent queries
• Maximum of 32 queries
• Maximum slots dependent on DWU service level
• Memory allocation
• Dependent on resource class and DWU service level
Data Distribution
• Advantages
• Straightforward
• Multiple tables
• Specify distribution type
• Notification of incompatibility
• Check compatibility
• Migrate schema
• Migrate data
• bcp commands to export and import
Other Migration Tools
• Options for loading data into an Azure SQL Data Warehouse include:
• Azure Feature Pack for Integration Services (SSIS)
• Downloadable extension for SSIS that facilitates the movement of data between on-
premises and cloud
• SSIS
• Add Azure SQL Data Warehouse connection in data flows
• Use SQL Agent to schedule regular transfer of data
• Bulk Copy Program (bcp)
• Useful for small data, use bcp to copy data to flat files and load into the data warehouse
destination
• AZCopy
• Copy data from flat files into Blob storage, and use PolyBase to load into data warehouse
• Import/Export
• For data larger than 10TB, bcp data to files, copy to disks and ship to Microsoft
• PolyBase and T-SQL
• Move UTF-8 formatted data in text files to Azure Blob storage or HDInsight, then use T-
SQL command to load into the data warehouse
• PolyBase uses the massively parallel processing (MPP) architecture for fast loading
Differences Between SQL Server and Azure SQL
Data Warehouse Schemas
• Scheduling
• JSON templates
• Edit parameters in script
• Replace \ with \\
Creating a Data Factory
• New dataset
• Edit parameters in JSON script
• name
• linkedServiceName
• tableName
• frequency
• interval
Setting Up a Pipeline Activity to Copy Data
• New pipeline
• Edit parameters in JSON script
• name
• start
• end
• Dataset properties
• Data slices
Lab: Implement an Azure SQL Data Warehouse
Logon Information
Virtual machine: 20767C-MIA-SQL
User name: ADVENTUREWORKS\Student
Password: Pa55w.rd
• Review Question(s)