0% found this document useful (0 votes)
30 views14 pages

2 - Data Integration Using Integration Services

Uploaded by

kaimo.private
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views14 pages

2 - Data Integration Using Integration Services

Uploaded by

kaimo.private
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 14

ETL using SQL Server

Integration Services
Lec. 2 - Summer
2024

Ejada Internal Use Only


Agenda
I. Lecture Objectives
II. Extract-Transform-Load
III. What is SSDT
IV. SSIS Overview
V. SSIS Project Structure
VI. SSIS Package Breakdown
VII. Summary & References
VIII. Task: Integrate The Data Into DW

2
Ejada Internal Use Only
Lecture Objectives

By the end of the lecture, you will be able to:


 Understand What is ETL and SSDT

 Identify the core components of SSIS, Tool Capability and Features

 Integrate The Data From Source to Destination using ETL Packages

Prerequisites: • BI professionals and Consultants:


BI Developer, Data analysts and Data Engineers.
• Basic knowledge of data warehouse
• Technically proficient business users.
schema topology.

3
Audience:
Ejada Internal Use Only
Extract-Transform-Load (ETL)
Extract, transform, and load (ETL) is the process of combining
required data from multiple sources, apply transformations
(Cleaining, Formatting, ...) and then load the result into a large,
central repository called a data warehouse.

Ejada Internal Use Only


SSDT Overview

Microsoft’s SQL Server Data Tools (SSDT) is a Visual


Studio development solution for Business
Intelligence Projects,
Covering all Phases of BI Workflow. SSDT Includes:

• Integration Services (SSIS) packages for ETL Process


• Analysis Services (SSAS) for Semantic Layer and Data
Modeling
• Reporting Services (SSRS) for reports (Out of Program Scope)

Ejada Internal Use Only


SSIS Overview

Microsoft SQL Server Integration Services (SSIS) is


a platform for building high-performance data
integration solutions, including extraction,
transformation, and load (ETL) packages for data
warehousing, Some of the key points are:

• Ease of use Tool with Drag and Drop GUI


• Fast and Performant Tool
• Supports Various Data Sources
• Supports Error and Event Handling
• Can Easily Handle Different Load Modes and Slowly
Changing Dimensions
Ejada Internal Use Only
SSIS Project Main Elements
• SSIS Packages:
the collection of tasks executed in an orderly fashion needed to merge data into a single dataset
and load the destination table in a single step. An SSIS package can use control flow, manager,
tasks, variables, event handlers, parameters, and more to achieve this.

• Connection Managers:
Used to set up a link between SSIS and an external data source. SSIS includes several distinct
types of connection managers

• Project Parameters:
Allow you to assign values to properties for packages at the time of package execution.

Ejada Internal Use Only


Components Of SSIS Package

Control Flow Data Flow


Control flow helps you encapsulates the data flow engine
arrange components for that moves data between sources
easier execution. These and destinations, and lets the user
components include transform, clean, and modify data
tasks (data flow, SQL as it is moved
execution) and
containers.
Package Event Handlers
Parameter
Allow you to assign values to Ability of handling events
properties within a single in the run-time such as
package at the time of before/pre-executing the
package execution task or container, post-
execution, error, etc.

Ejada Internal Use Only


Summary of Control Flow Tasks
• Data Flow Task:
Encapsulated Sequence of ETL Tasks that covers data journey from source
extraction then some transformations to destination insertion.

• Execute SQL Tasks:


Used To Execute SQL Statements within available data repository

• Execute Package Task:


Reference any available package to be executed.

• Container:
Logically Encapsulate Related Tasks and used to control the precedence of
execution, has three types (Sequence, For loop, and Foreach loop).

• Expression Task:
Simply an If-Else Conditional Split for different flows.

More Advanced Tasks:


- Analysis Services Processing Task: used to process tables in semantic layer
- Bulk Insert Task: used to insert enormous data volume

Ejada Internal Use Only


Summary of Data Flow Tasks
• Source/Destination Connect:
Connect to Available Data Repository in Connection Manager or other
Types Like Raw,Excel,and Flat Files

• Derived Column:
Adding a new column in the data, can be derived from existing columns
or from metadata of the execution like current timestamp.

• Data Conversion:
Convert Types and Names of Source Column.

• Merge Join:
Similar to inner and Outer Join in SQL that join different tables.

• Union All:
Combines rows from different sources into one output.

• Conditional Split:
Split The Source into many branches Based on Given Criteria.

More Advanced Tasks:


- Slowly Changing Dimension: handle dimensions that changes frequently
Ejada Internal Use Only
Summary

 ETL Process is used to Migrate Data From Source/s To Destination

 SSDT is a Visual Studio Extension for BI Solutions including SSIS, SSAS, SSRS

 SSIS is High-Performance Drag and Drop Data Integrator and ETL Tool

References:
• [[ 4 HOURS ]] SSIS Complete Tutorial - { End to End } Full Course
• SSIS Tutorial For Beginners | SQL Server Integration Services (SSIS)
• Free ETL Tutorial - Learn ETL using SSIS | Udemy

Ejada Internal Use Only


Task: Integrate the Data into DW

Task Description:
It is Required to Integrate The Data from the Staging Database into the
Data Warehouse.

Task Goals:
There are two SSIS Packages Required in this Task:
A. Dimension Package for Customer, Product Dimensions Population
B. Fact Package for Fact Table Population

12
Ejada Internal Use Only
Advanced Points For Further Study

• Incremental Load
• SSIS Catalog and SQL Agent Server
• Error and Event Handling

Ejada Internal Use Only


Thank You

Ejada Systems Company Limited ‫شركة إجادة للنظم‬


‫المحدودة‬ www.ejada.com | [email protected]
Ejada Internal Use Only

You might also like