Bi 06 Etl
Bi 06 Etl
Bi 06 Etl
Outline
Introduction
to SSIS
Learn by doing Exercise 4
More about SSIS features - Package
development tools
Youtube Videos
More videos
An Overview of SSIS
Part I 957, Part II 957, Part III 955, Part IV 959, Part V 611
Introduction to SSIS
SSIS
SSIS
SSAS
SSAS
BIDS
SSRS
SSRS
SAS
SAS
EG
EG
SAS
SAS
EM
EM
5
ETL Topics
Dimension Processing
Extract changed row from the operational database
Handling slowly changing dimensions
De-duplication and fuzzy transforms
Fact Processing
Package structure
Control flow
Data flow
Event handler
Package explorer
Connection tray
Features
Event driven
Layered
Drag-and-drop
programming
Data I/O definitions are
done using Connection
Managers
Control
Flow
Data
Flow
Event
Handler
10
SSIS
Architectur
e
11
Control Flow
Bulk
Data sources
Data destinations
Data transformations
Error flows
14
Dynamic Packaging
Modifying
mapping
frequency
How much history is needed
1
7
1
8
De-Duplication
Two
Fuzzy Lookup
Fuzzy Grouping
19
LEARN BY DOING
EXERCISE 4
Loading dimensions
ProductType
ProductSubType
Product
Country
Plant (using SQL Command)
Material (using SQL Command, Aggregate item)
MachineType (copied from the Material loading task)
Machine (copied from the MachineType loading task)
Note: DimBatch and the fact table will be loaded in the next
exercise.
Debugging
Step by step
Understand the error messages
Watch database loading status
10
DimBatch
DimMachine
ManufacturingFact
DimProduct
7
DimMachineType
DimPlant
DimMaterial
DimCountry
DimProductSubType
DimProductType
Aggregate
SQL Coding
22
23
Package Items
Analysis Services Processing Task, Analysis Services Execute DDL Task, Data
Mining Query Task
Transfer Tasks
Scripting Tasks
Execute Package Task, Execute DTS 2000 Package Task, Execute Process Task,
Message Queue Task, Send Mail Task, WMI Data Reader Task, WMI Event Watcher
Task
File System Task, FTP Task, Web Service Task, XML Task
Transfer Database Task, Transfer Error Messages Task, Transfer Logins Task
Transfer Objects Task, Transfer Stored Procedures Task
Maintenance Tasks
Custom Tasks
24
flow transformation
Aggregate
Derived Column
Data Conversion
Sort
25
Connection managers
Container Managers
Foreach
Loop Container
For Loop Container
Sequence Container
30
32
33
34
Set:
flow transformation