2 - Data Integration Using Integration Services
2 - Data Integration Using Integration Services
Integration Services
Lec. 2 - Summer
2024
2
Ejada Internal Use Only
Lecture Objectives
3
Audience:
Ejada Internal Use Only
Extract-Transform-Load (ETL)
Extract, transform, and load (ETL) is the process of combining
required data from multiple sources, apply transformations
(Cleaining, Formatting, ...) and then load the result into a large,
central repository called a data warehouse.
• Connection Managers:
Used to set up a link between SSIS and an external data source. SSIS includes several distinct
types of connection managers
• Project Parameters:
Allow you to assign values to properties for packages at the time of package execution.
• Container:
Logically Encapsulate Related Tasks and used to control the precedence of
execution, has three types (Sequence, For loop, and Foreach loop).
• Expression Task:
Simply an If-Else Conditional Split for different flows.
• Derived Column:
Adding a new column in the data, can be derived from existing columns
or from metadata of the execution like current timestamp.
• Data Conversion:
Convert Types and Names of Source Column.
• Merge Join:
Similar to inner and Outer Join in SQL that join different tables.
• Union All:
Combines rows from different sources into one output.
• Conditional Split:
Split The Source into many branches Based on Given Criteria.
SSDT is a Visual Studio Extension for BI Solutions including SSIS, SSAS, SSRS
SSIS is High-Performance Drag and Drop Data Integrator and ETL Tool
References:
• [[ 4 HOURS ]] SSIS Complete Tutorial - { End to End } Full Course
• SSIS Tutorial For Beginners | SQL Server Integration Services (SSIS)
• Free ETL Tutorial - Learn ETL using SSIS | Udemy
Task Description:
It is Required to Integrate The Data from the Staging Database into the
Data Warehouse.
Task Goals:
There are two SSIS Packages Required in this Task:
A. Dimension Package for Customer, Product Dimensions Population
B. Fact Package for Fact Table Population
12
Ejada Internal Use Only
Advanced Points For Further Study
• Incremental Load
• SSIS Catalog and SQL Agent Server
• Error and Event Handling