0% found this document useful (0 votes)

212 views66 pages

Integration Services Tutorial

This document provides an overview of SQL Server Integration Services (SSIS) and describes how to set up sample databases for testing and learning purposes. It discusses restoring the Wide World Importers sample database from a backup to use for extracting and loading data in an SSIS tutorial. The tutorial will cover moving and transforming data between sources and destinations using SSIS packages.

Uploaded by

Prasanna Kumari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

212 views66 pages

Integration Services Tutorial

Uploaded by

Prasanna Kumari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 66

1

Overview

SQL Server Integration Services (SSIS) is the integration and ETL (extract – transform –
load) tool in the Microsoft Data Platform stack. SSIS is typically used in data warehousing
scenarios, but can also be used in common data integration use cases or just to move
data around. SSIS is used behind the scenes in the Maintenance Plans of SQL Server and
in the Import/Export wizard.

SSIS is a flexible tool and it can perform all sorts of operations:

 Transferring data from a source to a destination. This is done in memory and you
can perform data manipulation tasks on the data while it is in memory, which makes
SSIS one of the faster tools on the market.
 You can perform simple FTP tasks.
 SSIS can send emails to notify people.
 SSIS is capable of robust error and event handling.
 You can define a workflow with constrains to conditionally execute certain tasks.
 And if all that isn’t enough, you can always extend SSIS with .NET code.

There’s already an Integration Services tutorial here at MSSQLTips.com. However, it’s

already a couple of years old and there have been plenty of changes and new features for
SSIS to warrant a new tutorial. In this tutorial, we will guide you with the creation of a
simple package that transfers and manipulates data between a source and a destination.
We will also deploy the package to the server and investigate how you can execute and
monitor the package.

We will go through a number of topics in order to create our package. The high-level
outline is as follows:

 How to get Sample Data?

 Control Flow
 Reading Data
 Writing Data
 Deploying the Package
 Executing the Package
 Monitoring Execution
 Quick Performance Tips

Overview

In this section, we’ll briefly discuss the history of the Integration Services product and the
tools you would use to create SSIS projects.

History

Before SSIS, SQL Server came with Data Transformation Services (DTS), which was part
of SQL Server 7 and 2000. For SQL Server 2005, the teams at Microsoft decided to
revamp DTS. Ultimately, they ended with a replacement for DTS instead of just an
upgrade and because it was such a drastic change, it was decided to name the product
Integration Services instead of DTS. This name change came late in the product
2

development cycle and that’s why some objects still refer to DTS. For example, the
command line tools DTEXEC (to execute SSIS packages) and DTUTIL (to deploy
packages to a server).

Integration Services was launched with SQL Server 2005 and the most basic core
functionality is still the same today. It was a drastic change with DTS and it quickly become
a popular ETL due to its speed, flexibility and its support for various sources.

With SQL Server 2008, lots of performance improvements were made to SSIS and new
sources were introduced as well. SQL Server 2008R2 didn’t introduce any noticeable
changes for SSIS.

SQL Server 2012 was a major release for SSIS. It introduced the concept of the project
deployment model, where entire projects with their packages are deployed to a server,
instead of individual packages. The SSIS of SQL Server 2005 and 2008 is now referred to
as the (legacy) package deployment model. SSIS 2012 made it easier to configure
packages and it came with a centralized storage and management utility: the catalog. We’ll
dive deeper into those topics later on in the tutorial.

SQL Server 2014 didn’t have any changes for SSIS, but on the side new sources or
transformations were added to the product. This was done by separate downloads trough
CodePlex (an open-source code website) or through the SQL Server Feature Pack.
Examples are the Azure feature pack (to connect to cloud sources and objects) and
the balanced data distributor (to divide your data stream into multiple pipelines).

In SQL Server 2016 there were some updates to the SSIS product. Instead of deploying
entire projects, you can new deploy packages individually again. There are additional
sources – especially cloud and big data sources – and some important changes were
made to the catalog. You can find an overview of all new features here and here.

During all these years, SSIS has built itself a reputation for being a stable, robust and fast
ETL tool with support for many sources. However, it’s still mainly an on-premises solution,
there is – at the time of writing – no real cloud alternative.

Tools to develop Integration Services Projects

Integration Services projects are always developed using Visual Studio. However, you
don’t need the have the full-blown Visual Studio installed on your machine. It’s also
possible to install just the business intelligence templates, which will install a shell of Visual
Studio instead. Traditionally there was no backwards compatibility in SSIS projects, which
means for every version of SQL Server, there was a specific version of Visual Studio you
had to use. Unfortunately, the tools also changed names a couple of times, which makes it
a bit harder to search for it on the Internet. An overview:

SSIS Version Visual Studio Version

SSIS 2005 VS 2005 – templates were called Business Intelligence Development Studio (BIDS)

SSIS 2008 / 2008R2 VS 2008 (BIDS)

SSIS 2012  VS 2010. Templates renamed to SQL Server Data Tools (SSDT). This tool came
3

with the SQL Server installation media.

 VS 2012. SSDT. Separate download. Because of confusion with the database tools
in Visual Studio (also called SSDT), the templates were renamed to SQL Server
Data Tools for Business Intelligence (SSDT-BI).
SSIS 2014 VS 2013. SSDT-BI. Separate download.

VS 2015. Database tools and business intelligence tools are combined into one
SSIS 2016
single product: SSDT. Separate download.

If you want to follow along in this SSIS tutorial for SQL Server 2016, you can download the
latest version of SSDT here. Make sure you download the templates for Visual Studio
2015. At the time of writing, SQL Server 2017 hasn’t been released yet, but with the latest
version of SSDT for Visual Studio 2015 (SSDT 17.2) you can already develop projects for
SQL Server 2017.

Since SQL Server 2016, it’s possible to develop projects for earlier versions of SSIS within
the same version of Visual Studio. In the latest version, you can develop projects for SQL
Server 2017, 2016, 2014 and 2012. The tip Backwards Compatibility in SQL Server Data
Tools for Integration Services explains the concept in more detail.

Overview

Like any ETL tool, Integration Services is all about moving and transforming data. In this
tutorial, we’ll also want to extract data from a certain source and write data to another
source. In many cases, either the source or the destination will be a relational database,
such as SQL Server. In this tutorial, we’ll use the Wide World Importers sample database.

Explanation

We will setup databases that can be used for testing and learning more about SSIS.

Wide World Importers

Wide World Importers is an open-source sample database provided by Microsoft. Its main
use is to showcase different features of SQL Server using data resembling a real-life
company. You can use this database to test out functionality, but also to write your own
scripts to test performance or just to get to know SQL Server and the T-SQL language a
bit better. There is an OLTP database available and a data warehouse (which uses the
OLTP database as a source). All of the material – database backups, scripts to generate
data, applications et cetera – can be found on Github.

In this section of the tutorial, we will restore the Wide World Importers database from a
backup, as it is the easiest option. If you want to learn more about Wide World Importers,
you can check out these tips:

 Installing new SQL Server sample databases: WideWorldImporters

 Install SQL Server 2016 Sample Database: Wide World Importers Data Warehouse
 Generate more data for the Wide World Importers sample SQL Server databases

Restoring the Wide World Importers Backup

You can download a backup of the Wide World Importers database from here. This
backup currently contains data from 2013-01-01 till 2016-05-31. Make sure you have a
SQL Server instance available (installing and configuring SQL Server is not part of this
tutorial). Since there are new features present in this backup, you need SQL Server 2016
or later. If you upgrade to SP1, you can use all features that were previously Enterprise
edition only, such as compression or columnstore indexes. You can find more information
on which features are present in which edition in this overview. You could also use SQL
Server 2016 Developer edition, which is free and has all the features available.

We are going to restore the backup using SQL Server Management Studio (SSMS). Right-
click on the Databases node and select Restore Database…

Change the source to Device and click on the ellipsis.

In the Select backup devices menu, click on Add.

Next, we have to choose the backup file we downloaded from Github. By default, the
explorer shows the folder that has been configured as the default backup folder for the
SQL Server instance. Either move your backup file to that folder, or navigate to the
directory where you have saved the .bak file.
6

Click OK twice until you’re back in the Restore Database menu. We have one more thing
to do before we can restore the backup. Go to the Files pane.

In Files, choose to relocate all of the database files to the default SQL Server folders
(which you can configure during the SQL Server set-up).

Now you can click on OK to start the restore procedure. Depending on your machine, this
might take some time. To restore the Wide World Importers data warehouse, you can
follow the exact same steps. You can find the .bak file here.
7

If you have followed all the steps, you should now have two new databases in your SQL
Server instance:

Overview

In this chapter, we’re going to create our SSIS project. For this, we need Visual Studio
2015. You need to download SQL Server Data Tools 2015, which will install a shell of
Visual Studio 2015. If you want a full-blown Visual Studio, so you can also tackle other
type of projects such as R projects or .NET projects, you can download Visual Studio 2015
Community Edition (which is free if you subscribe to Visual Studio Dev Essentials). With
the latest version of SSDT 2015 (SSDT 17.2 at the time of writing), you can create SSIS
projects for SQL Server 2012, 2014, 2016 and 2017.

Keep in mind that if you want source control integration with Team Foundation Server, you
need the full-blown Visual Studio. There is no Team Foundation Explorer plug-in for Visual
Studio 2015, so you can’t use the shell of SSDT.

Creating a Project
Start Visual Studio. If it’s the first time, you might get a prompt asking which settings Visual
Studio should use. You can pick the Business Intelligence settings. When Visual Studio
has started, go to File > New > Project.

In the New Project menu, enter a name for the project and specify a location to save the
project.
8

When you create a project, Visual Studio will create a solution first and add the project to
that solution. By default, the solution has the same name as the project. If you want to add
multiple projects to one solution, you might want to change the solution name. If you have
source control integrated into Visual Studio, you will have an extra checkbox asking you if
you want to add the project to source control.

When you click OK, the solution and the project will be created and an empty package will
be added to the project. You can view the project structure in the Solution
Explorer window:

When there’s only one project, the solution will not be displayed.

The SQL Server Data Tools Interface for Integration

Services
9

Let’s take a look at our development environment for creating SSIS packages. Keep in
mind that most of the windows are dockable, which means you can move them around, so
it’s possible you do not have the exact same view as in this screenshot.

1. This is your canvas. Here you drag items from the toolbox and you connect them with each
other to create a workflow. This will be discussed in more detail in the next sections of the
tutorial. The package canvas has multiple tabs:
1. The control flow. Here you can have multiple tasks which you can connect with each other.
The control flow is important as it defines what your package actually does.
2. The data flow. This is a special task of the control flow. Here you move data around between
sources and destinations, and you can transform the data while it is in memory.
3. Parameters. You can define parameters to make your package more flexible.
4. Event Handlers. This are special “control flow”-like canvasses where you can define tasks
that will only execute if a specific event occurs. Event handlers fall out of scope of this
tutorial.
5. Package Explorer. A tree-view of all the objects inside your package.
2. The SSIS Toolbox. Here you can find all tasks and transformations for the control and data
flow. You can drag them from the toolbox into the canvas. There’s also another window just
called “Toolbox”. It’s used for other types of projects such as Reporting Services, so don’t
confuse it for the SSIS Toolbox. If you can’t find the SSIS Toolbox, right-click on the canvas
and select SSIS Toolbox from the context menu.
3. The connection managers. A connection manager defines a connection to a specific object.
This can be a flat file, a database, a folder and so on. Tasks and transformations use a
connection manager to create a connection to the object.
4. The solution Explorer. A tree view of all the objects in the project or solution.
5. The properties window. Here you can view and change the properties of almost all objects
within an SSIS package.
6. The toolbars. The most important item is the green arrow, which you can use to start the
debugger. The debugger will execute the SSIS package within Visual Studio.

Everything mentioned here will be explained in more detail in the following section in the
tutorial.

There’s only one window missing from this view: the variables. When you create your first
SSIS project, this window is hidden. You can right-click on the canvas and select Variables
to open the window.
10

Variables are used to make your package more flexible and change properties on the fly
when a package is running. The difference between parameters and variables is that
parameters cannot change value once the package has started executing, while variables
can. Parameters are used as input for the package before it starts.

Let’s go to the next section to learn more about the control flow

Overview

Now that we’ve created our SSIS project in the previous chapter, it’s time to start to
explore the control flow and its abilities. The control flow allows you to execute different
tasks and organize a workflow between the tasks. In this section, we’ll give an overview of
the objects you can add to the control flow.

SSIS Tasks
In an SSIS package, you can add tasks to the control flow. A task is a unit of work and you
have different kinds of tasks to perform different kinds of work. Explaining all tasks would
take us too far in this tutorial, so here’s an overview of some of the most common tasks:

 Execute SQL Task: These tasks will execute a SQL statement against a relation
database.
 Data Flow Task: These special tasks can read data from one or more sources,
transform the data while in memory and write it out against one or more
destinations. We’ll describe the data flow in more detail in the next sections of the
tutorial.
 Analysis Services Processing Task: You can use this task to process objects of
an SSAS cube or Tabular model.
 Execute Package Task: With this task, you can execute other packages from
within the same project. You can also pass variable values to the called package.
 Execute Process Task: Allows you to call an executable (.exe). You can specify
command line parameters. With this task, you can for example unzip files, execute
batch scripts and so on.
 File System Task: This task can perform manipulations in the file system, such as
moving files, renaming files, deleting files, and creating directories et cetera.
 FTP Tasks: Allows you to perform basic FTP functionalities. However, this task is
limited because it doesn’t support FTPS or SFTP.
11

 Script Task: This task is essentially a blank canvas. You can write .NET code (C#
or VB) that performs any task you want.
 Send Mail Task: Here you can send an email. Ideal for notifying users that your
package has done running or that something went wrong.

In the screenshot, you can see an Execute SQL Task that has been added to the control
flow:

There are of course more tasks. Some are for working with Azure or big data systems,
others are for performing DBA tasks (and are essentially the building blocks of SQL Server
Maintenance Plans). You can find an overview in the documentation.

SSIS Containers
Next to tasks, you also have containers. These give you more power over how tasks are
executed. You can add one or more tasks to a single container.

 For Loop Container: With this container, you can execute all tasks inside for a
fixed number of executions. This is equivalent to for loops in a programming
language.
 For each Loop Container: This container doesn’t execute a fixed number of times
like the for loop, but the number of executions is determined by a collection. This
can be for example the number of files in a directory or the number of rows in a
table. This makes the container more flexible than a for loop container.
 Sequence Container: This container simply groups tasks together. The tasks will
execute together. This container is useful to split your control flow into logical units
of work.

Here you can see a couple of tasks inside a sequence container. When you would execute
the sequence container, all three tasks will execute at the same time.
12

It’s time to start building an SSIS package. In this chapter, we’ll add tasks to the control
flow and learn how you can start the debugger to execute the package. We’ll also look
how the execution of different tasks can be related to each other.

Adding Tasks to the SSIS Control Flow

Let’s start by adding an Execute SQL Task to the control flow. You can either drag it from
the SSIS toolbox to the control flow, or you can double click it.

You can see there’s a red error icon on the task. That’s because we haven’t defined a
database connection yet.
13

Double click the task to open it. In the editor, open the connection dropdown and click
on <New Connection…>.

If you have already created connection managers, you can pick one from the list in the
next window. However, you can also create a new one by clicking the New… button at the
bottom.
14

This will open a connection manager editor. You need to enter the server name and select
a database from the dropdown list. You can also optionally specify a username and
password if you don’t want to use Windows Authentication.
15

Click OK two times to go back to the Execute SQL Task editor. You can either directly type
a SQL statement in the SQLStatement property or you can click on the ellipsis to open up
the editor. This editor is basically a notepad editor and it has no additional functionality.
You are most likely better off writing SQL statements in Management Studio and copy
pasting them into the editor. Let’s enter a very basic statement: SELECT 1.

Click OK to close the editor.

Executing SSIS Packages

We can now run the package to test our Execute SQL Task. You can click on the green
arrow or just hit F5. This will start the debugger which will run the package.

When the task has finished, you will see a green icon in the corner of the task. You can
click on the stop icon in the task bar to stop the debugger or you can click on the sentence
below the connection manager window.

When the package is running, an extra tab is added called Progress. Here you can see all
of the informational messages, errors and warnings generated by the SSIS package as
well as timing information.
17

When the debugger stops, the Progress tab is renamed to Execution Results.

SSIS Precedence Constraints

With precedence constraints, we can influence how different tasks impact each other. Let’s
start by creating a copy of our Execute SQL Task. Now when we execute the package,
both tasks will be executed in parallel.

You can create a precedence constraint by selecting the first task and dragging the green
arrow to the other task. Now when we execute the package, the first tasks will be executed
and then the other.
18

The green arrow signifies a “Success” precedence constraint, which means the second
task will only be executed if the first task is successful. You can change the behavior of the
precedence constraint by double clicking on the arrow:

You can change the precedence constraint to “Failure”, which means the second task will
only be executed if the first task fails. With “Completion”, the second tasks will execute
once the first task has finished, but it doesn’t matter if the first task was successful or not.
When you have multiple arrows going into one single task, you can change the constraint
to AND or OR. With AND, all tasks need to be successful before the task starts. With OR,
only one task needs to be successful. In the following screenshot, only one of the two top
tasks must finish successfully so the last task can start.
19

With precedence constraints and containers, you can create complex workflows:

Overview

In this section, we will introduce the Integration Services Data Flow. It’s one of the more
important features of SSIS and one of the reasons SSIS is considered one of the fastest
ETL tools. We’ll give also an overview of the more important transformations you can do in
the data flow.

The SSIS Data Flow

The data flow is a special task of the control flow. It needs a canvas of its own, so there’s
an extra tab for the data flow, right next to the control flow.

The data flow is a construct where you can read data from various sources into the
memory of the machine that is executing the SSIS package. While the data is in memory,
you can perform different kinds of transformations. Because it’s in memory, these are very
fast. After the transformations, the data is written to one or more destinations (a flat file, an
Excel file, a database, etc.). In most cases, not all data is read into the memory at once -
although this is possible if you use certain kind of transformations – but the data is read
into buffers. Once a buffer is filled by the source component, it is passed on to the next
transformation which does it logic on the buffer. Then the buffer is passed to the following
transformation and so on until it is written to the destination. You can imagine the data flow
is like a pipeline, with data flowing through.

To create a data flow, you can drag it from the SSIS toolbox to the control flow canvas.

Another option is to simply go to the data flow tab, where you will be greeted with the
following message:

Clicking the link will create a new data flow task for you. You end up with an empty
canvas, just like in the Control Flow.
21

As you can see in the screenshot above, the SSIS toolbox will change once you go to the
data flow canvas. All the tasks are now replaced with transformations, sources and
destinations for the data flow. At the top, you also have a dropdown box that lets you
easily switch between multiple data flows if you have any.

SSIS Data Flow Sources and Destinations

These are some of the most common sources and destinations for the data flow:

 ADO.NET: With this component you can connect to various sources and

destinations using .NET providers.
 Excel
 Flat File
 ODBC: If you have an ODBC connection defined on your machine, you can use it
to read or write data. Keep in mind Visual Studio is a 32-bit application, so you
might want to have both 32-bit and 64-bit versions of the ODBC connection
installed. You can give them the same name to make the transition smoother.
 OLE DB: With this source you can connect to any database for which an OLE DB
provider is available. To date, the OLE DB source and destination are the fastest
option for SQL Server.

There are other types of sources and destinations available. You can take a look at
the documentation to learn more. You also have the possibility to use a .NET script
component to make your own source or destination. This is similar to a .NET script task;
you can use C# or VB, but now there are special methods and classes included to handle
the buffers of the data flow.
22

SSIS Data Flow Transformations

Some of the more common transformations:

 Aggregate: Like a GROUP BY in T-SQL.

 Conditional Split: Splits out streams based on one or more conditions.
 Data Conversion: Allows you to convert columns from one data type to another.
 Derived Column: Allows you to manipulate existing columns or create new
columns using expressions.
 Lookup: Similar to VLOOKUP in Excel. You match a row against a reference data
set and retrieve one or more columns.
 Merge: Merges two streams together. It needs sorted inputs.
 Merge Join: Like a JOIN in T-SQL, it can do inner, left join and full outer joins. It
needs sorted inputs as well.
 Multicast: Duplicates a stream into multiple streams.
 Script Component: With the component you can write your own transformations in
.NET. When you open the editor, you have to choose if the component is a source,
destination or a transformation.
 Sort: This transformation allows you to sort data and remove duplicates.
 Union All: Merges two streams, but doesn’t need sorted input.

In the next two sections of the tutorial, we’ll configure a source, some transformations and
a destination.

Overview

In this section, we’ll get our hands dirty in the data flow. We’ll read data from our sample
database and look at tools on how we can inspect this data.

Reading Data with SSIS

Let’s start with an empty data flow. From the SSIS Toolbox, drag the source assistant to
the canvas. This is a wizard-like dialog that helps you create a source component.

In the assistant, double click on New… to create a new connection manager, while SQL
Server is still selected as the source type. In the connection manager editor, enter the
server name and select the WideWorldImporters database. Click OK.
23

The assistant will now put an OLE DB Source component on the data flow and create a
connection manager. If you want to re-use the same connection manager across different
packages, you can right-click the connection manager and choose “Convert to Project
Connection”. This will upgrade the connection manager to the project level, where it is
shared between all packages of the project.
24

Double click on the OLE DB Source to open its editor. There are different options to read
data from the database:

However, it’s almost always better to write a SQL statement instead of using the dropdown
(table or view option) to select a table. With the dropdown, you select all rows and all
columns and you do not have the option to do some transformations using the SQL
language (such as grouping, sorting and aggregating the data). Change the option to SQL
command. This will give you a text box where you can enter your T-SQL statement. If you
want, you can use a graphical query builder to construct your statement, but most of the
time it’s easier to just write it in Management Studio and copy paste it in the source
component. You can use the following SQL statement:

SELECT
[CityID]
,[CityName]
,[StateProvinceID]
25

,[LatestRecordedPopulation]
FROM [WideWorldImporters].[Application].[Cities];

This selects all the cities from the WideWorldImporters database. The source table is
a system-versioned table, so we get the latest data when we execute this statement.
When you copy and paste the SQL statement into the source component, you can hit
preview to take a look at the data:

In the columns tab, you can inspect all the columns returned by the query defined in the
first tab. You can deselect columns to remove them from the output and you can rename
columns as well. Although it’s better to do these manipulations in the query directly.
26

Every column has a data type associated with it. The data flow expects that this metadata
doesn’t change. If you would change the data type in the source table (for example change
cityID to a date if that were possible), the data flow would throw an error. Sometimes SSIS
doesn’t realize though metadata has changed. In that case, you can just deselect all
columns and select them again (using the checkbox right next to Name) to quickly refresh
the metadata of all columns.

Click OK to close the editor. To be able to run the data flow and see the data flowing
through, you need to add one more transformation. Let’s use the Multicast as a dummy.
Connect the source component to the Multicast with the blue arrow.
27

The arrows are not exactly precedence constraints like in the control flow. They tell the
data flow in which direction the data flows. You have two types: the normal output error
and the red arrow. The red arrow is the error output of the transformation. If some rows
have an error (for example data type mismatch in the source), you can redirect them to
another destination so you can inspect them later. If you would click on the source again
you can see the red arrow.

You can find more information about error handling in the tip How to serialize error logging
in SSIS.

Finally, right-click on the blue error and choose “Enable Data Viewer”.
28

This will add some sort of “debug” window on your output path. When the data flow runs,
you can inspect the rows in the current memory buffer. Let’s start the package. The first
buffer contains 9,637 rows and they are shown in the data viewer.

You can copy the data to inspect them in another tool, such as Excel for example. To fetch
the next buffer, click on the little green arrow in the data viewer. When you close the data
viewer, the data flow will run till all the rows have been fetched from the source. In total,
37,940 rows are read from the source:

In the next chapter, we’ll add some transformations to enrich the data.
29

Overview

In this section, we build further upon the data flow created in the previous section. We will
add an extra column using the Derived Column transformation and fetch extra data using
the Lookup component.

SSIS Derived Column Transformation

The Derived Column transformation has the ability to either modify existing columns or add
new columns to the buffer. You can open the editor by double-clicking the component.

In the editor, you can drag variables, parameters and existing columns to the expression.
At the right, you have also a library of functions available. You can also choose if you want
to replace existing columns or if you want to add a new column:
30

Let’s add a new column that contains the date of today (using the GETDATE() function)
and trim the existing CityName column. You can drag the column name and
the Trim function to the editor:

The Derived Column transformation is very powerful, but the one-line expression editor
can be frustrating. Let’s add a multicast and a data viewer to inspect the results:
31

SSIS Lookup Component

With a lookup, we can match our current columns against a reference dataset and retrieve
one or more columns if a match has been found. In our dataset, we have
the StateProvinceID column. We are going to fetch the full name of the State/Province.
Drag a Lookup transformation to the canvas and open the editor. In the first pane, there
are a couple properties to set:

 Cache mode. This defines how the reference dataset is loaded. With full cache, the
entire dataset is loaded into memory at the start of the data flow. This allows for
very quick matching between the datasets. With partial cache, only a part of the
dataset is loaded into memory. If there’s a cache miss, the data will be fetched from
the database and put in the cache, possibly evicting older data. With no cache,
nothing is loaded into memory and for every row a query needs to be sent to the
database, which is quite slow.
32

 Connection type. You can either choose a cache connection manager for when
you when to pre-load your reference datasets and use them in multiple packages or
data flows, or a regular OLE DB connection manager. The default is a standard
OLE DB connection manager. Notice there’s no option to use ADO.NET or ODBC.
 No match behavior. Here you specify how the lookup component should behave if
no match was found for a row.
o Ignore failure. The row is passed to the Match Output and the columns from the
reference dataset get NULL values.
o Redirect rows to error output. All rows without a match are sent to the error output
(the red arrow).
o Fail Component. The default, but a bit drastic. The data flow and package will stop if
no match is found.
o Redirect row to no match output. A new output is created where all rows without a
match are sent to.

Let’s set this option to “Redirect row to no match output”. In the next tab, you need to
define the reference dataset. You can use the dropdown box to select a table, but just as
with the source, a SQL statement is preferred. Select only the columns you need to make
the match and of course the columns you want to return. We can use this T-SQL
statement:

SELECT
[StateProvinceID]
,[StateProvinceName]
FROM [Application].[StateProvinces];

The last tab we need to edit is the Columns tab, where we specify how the matching will
take place and which columns we want to return. You need to drag the key columns from
33

the input columns to the key columns from the lookup columns. Then you can check each
column from the lookup columns which you want returned; in our case
the StateProvinceName.

Close the editor. When we now attach a multicast to the lookup component, we can
choose which output we want:

Let’s attach multicasts on both outputs, combined with data viewers so we can test the
lookup component:
34

As you can see, the StateProvinceName column has been added to the buffer and there
were no rows sent to the no match output.

Overview

In this chapter, we are going to write the data to a destination. Make sure you have
finished the previous section of the tutorial to have a finished data flow.

Adding the Destination in the SSIS Data Flow Task

In the Data Flow, add a Destination Assistant to the data flow:

Select the SQL Server destination and double-click on New… to create a new connection
manager. We will write the data to the WideWorldImporersDW database.
35

Click OK twice to close the editors. The destination assistant will add a new OLE DB
Destination to the canvas. Connect the Lookup to this destination with the Lookup Match
Output:
36

Open up the destination editor. Make sure the correct connection manager is selected
and Table or view – Fast Load is selected as data access mode. You can select a table
from the dropdown menu (for the destination it’s fine to use the dropdown), but we are
going to create a new table first.

Click on New… next to the dropdown. This will open up an editor with the CREATE TABLE
statement, based on the metadata of the data flow.
37

If you click OK, the table will be created in the database specified by the connection
manager. You might want to change the table name first though.

CREATE TABLE dbo.[SSIS_Tutorial] (

[CityID] int,
[CityName] nvarchar(50),
[StateProvinceID] int,
[LatestRecordedPopulation] bigint,
[LoadDate] datetime,
[StateProvinceName] nvarchar(50)
);

Make sure the new table is selected in the dropdown menu. Leave all the other settings
as-is. In the Mapping pane, we can map columns from the input to the columns of the
destination table. Since all columns have the same name, they are mapped automatically.
38

You can map columns by dragging them from the left list to the right, or you can map them
in the grid below. If your columns have the same name though (recommended) but they
haven’t been mapped already, you can right-click anywhere in the space above the grid
and select Map Items by Matching Names from the context menu. This will save you
quite some time with bigger tables.

When the mapping is finished, you can click OK to close the editor. The data flow is now
finished.

Adding Clean-Up in SSIS Workflow

To make sure we can run the package multiple times in a row without inserting duplicate
values in the destination table, we’re going to add a TRUNCATE TABLE statement to the
control flow. Insert an Execute SQL Task and connect it with the data flow.
39

Open up the editor, choose the WideWorldImportersDW connection manager and type the
SQL statement to truncate the table:

Testing the SSIS Package

You can run the package hitting F5. If everything was successful, the control flow looks
like this:
40

And the data flow:

When we take a look at the destination table, we can see 37,940 rows have been inserted.
41

In the following chapters of the tutorial, we’ll learn how we can deploy our package to the
server and how we can execute it over there.

Overview

Now that our SSIS package development is finished, we can deploy it to the server. There
we can schedule and execute the package as well.

Deploying the SSIS Package

In Visual Studio, right-click on the project and select Deploy.

This will start the SSIS deployment wizard. Keep in mind this will deploy the entire project,
with all packages included. If you want to deploy an individual package, you can right-click
on the package itself and choose Deploy (since SSIS 2016).

In the first step of the wizard, we need to choose the destination (several steps are
skipped since we started the wizard from Visual Studio). Enter the server name and make
sure the SSIS catalog has already been created on that server. If you want, you can also
create a folder to store the project in.
42

At the next step, you get an overview of the actions the wizard will take. Hit Deploy to start
the deployment.

The deployment will go through a couple of steps:

The project has now been deployed to the server and you can find it in the catalog:

Executing an SSIS Package on the Server

To execute the package, simply locate it in the catalog folder, right-click it and
hit Execute…
44

You will be taken to a dialog where you can edit certain properties, such as the connection
managers, parameters if any, the amount of logging and so on.
45

Click on OK to start the execution of the package. A pop-up will open asking you if you
want to open one of the catalogs built-in reports.

Click Yes. This will take you to the Overview report, where can see the package has
successfully executed.

To learn more about the catalog reports, check out the tip Reporting with the SQL Server
Integration Services Catalog.

Scheduling the SSIS Package with SQL Server Agent

Manually executing packages is one thing, but normally you will schedule packages so
your ETL can run in a specific time windows (probably at night). The easiest option is SQL
Server Agent. You can right-click on the Jobs node to create a new job:
46

In the General pane, enter a name for the job, choose an owner and optionally enter a
description:

In the Steps pane, you can create a new job step.

In the job step configuration, you can enter a name for the step. Choose the SQL Server
Integration Services Package type, enter the name of the server and select the package.
48

In the configuration tab, you can optionally set more properties, just like when executing a
package manually. Click OK to save the job step. In the Schedules tab, you can define one
or more schedule to execute the package on predefined points in time. Click New… to
create a new schedule. In the schedule editor, you can choose between multiple types of
schedules: daily, weekly or monthly. You can also schedule packages to run only once. In
the example below we have scheduled the job to run every day at 1AM, except in the
weekend.
49

Click OK twice to exit the editors. The job is now created and scheduled.

Overview

In the last chapter of this tutorial we’ll look at a couple of performance optimizations you
can implement in your SSIS packages. After all, you want to move data around as quickly
as possible.

SSIS Control Flow Performance Optimizations

There is typically not much you can do in the control flow to improve performance. There
are two key points however:

 Think about if you want to perform tasks in SSIS or if you can do them somewhere else.
For example, sorting data will be faster in SQL Server T-SQL code than in SSIS.
 Perform tasks in parallel if possible, but don’t overdo it. Going into parallel will surely
improve performance, but this is heavily influenced by available memory and the number of
processors. There is a certain overhead to parallelism. If there’s too much parallelism, the
system will go slower instead of faster. Carefully test to find the optimum balance.

SSIS Data Flow Performance Optimizations

Most performance issues are related to the data flow. As with the control flow, think if SSIS
or transformations in SQL will be faster. Try to visualize the data flow as a pipeline with
data flowing through. You want to maximize the flow rate to get data to the destination as
quickly as possible. There are some important properties you can set to influence the
memory buffers.

 DefaultBufferMaxRows: This is by default set to 10,000, which was made a good setting

in 2005. Today we have more power in our machines, so you can set this to a higher
number. Too large a buffer will take a while to fill and will make the destination sit idle, so
you need to find a good optimum. 50,000 rows is a good start.
 DefaultBufferSize: This is by default 10MB which is again quite small. You can bump this
up to 50MB or even more.

The actual buffer size will be determined by which of the two properties is reached first.
You can set AutoAdjustBufferSize to True to make sure that the specified number of rows
in DefaultBufferMaxRows is always met.

SSIS Reading Data Performance Optimizations

Some guidelines:

 Don’t use the dropdown box to select the source table. Write a SQL statement and include
filtering, grouping and sorting in the SQL code.
 Only select columns you actually need.
 Keep the data types of the columns small. The more rows you can fit in a single memory
buffer, the better.
SSIS Transforming Data Performance Optimizations

Here we have some best practices as well:

 Don’t use blocking transformations (e.g. sort and aggregate component). They read all data
in memory before even sending one single row to the output. Asynchronous
transformations are to be avoided as well since they modify the memory buffer. You can
find a good overview in this blog post.
 Avoid the Slowly Changing Dimension Wizard. It uses the OLE DB Command, which
executes SQL statements row-by-row, which is slow and results in excessive logging.
 Don’t use the OLE DB Command, as stated in the previous point.
SSIS Writing Data Performance Optimizations

Writing data is typically the slowest part of the process. Here are some tips to optimize the
process:

 The OLE DB Destination is the fastest adaptor for SQL Server at the moment. If you use
the Fast Load option of course.
 Make sure you use a table lock (which is enabled by default).
 To speed up inserts, you can disable constraints and drop and recreate indexes.
51

Bulk Insert Task in SSIS

The Bulk Insert task in SSIS can transfer data only from a text file into a SQL Server
table or view, which is similar to Bulk Insert in SQL Server. If the destination table or
view already contains data, the new data is appended to the existing data when the
SSIS Bulk Insert task runs. If you want to replace the data, run an Execute SQL task
that runs a DELETE or TRUNCATE statement before you run the Bulk Insert task.

For instance, We are working with stock market data, and every day we are getting
billions of data in .csv format (Comma Separated Values). Our task is to copy data
inside this .csv file to SQL database table every day. We usually have two
approaches to do the Bulk Insert Task in SSIS.

 Drag and drop the data flow task and inside the data flow drag and
drop flat file source and OLE DB destinations and copy the data. This
approach is useful if we want to perform any SSIS transformations.
 Use the SSIS Bulk Insert Task. This approach is more powerful
compared to the previous one because internally, Bulk Insert Task uses
Bulk Copy (BCP) operation (Which is very fast in SQL Server).

Available Options inside the Bulk Insert task in

SSIS
Click on the Options tab inside the Bulk Insert Task will give the following options

 CodePage: Specify the code page of the data in the data file. Generally
used for other languages.
 DataFileType: Specify the data-type value to use in the load operation.
 BatchSize: Specify the number of rows in a batch. The default is the
entire data file. If you set BatchSize to zero, the data loaded in a single
batch. For instance, If we set the batch size as 100, then each batch acts
as one transaction, and if the task fails after some time, then
successfully loaded batches will not be rollback.
 LastRow: Specify the last row to copy.
 FirstRow: Specify the first row from which copying starts.
 SortedData: Specify the ORDER BY clause in the bulk insert statement.
The default is false.
 MaxErrors: Specify the maximum number of errors that can occur
before the Bulk insert operation canceled. A value of 0 indicates that an
infinite number of errors are allowed.
52

Options Term Definition

Check
Checks the column data.
constraints

Select to retain null values during the bulk insert operation,

Keep nulls
instead of inserting any default values for empty columns.

Enable identity
Select to insert existing values into an identity column.
insert

Table lock Select to lock the table during the bulk insert.

Select to fire any insert, update, or delete triggers on the

Fire triggers
table.

Points to Remember in SSIS bulk insert task

 The Bulk Insert task in SSIS can transfer data only from a text file into
a SQL Server Table or SQL Server View.
 Bulk Insert Task supports the Flat file Connection manager to select the
text file.
 The Bulk Insert Task in SSIS only supports OLE DB Connection Manager
in SSIS for the destination database.
 Destination table must exist before it is using in the Bulk Insert Task
 Don’t forget to change the First Row option to 2, if you have your
column names in the first row of a text file.
 It is always good practice to set the batch size to insert a large amount
of data.
Sequence Containers in SSIS packages

Introduction
SSIS package control flow is useful for executing multiple tasks and design workflow for execution. A
container in the control flow plays an essential role in workflow design. We can see the following
containers in SSIS Toolbox:
53

For loop container

We can use this container for executing all inside tasks for a fixed number of executions. You can
consider it equivalent to For loop in the programming language.

Foreach loop container

It works similar to For loop; however, we define a collection for determining the number of executions
of For loop instead of a fixed number of executions. You can read more about in Using SSIS ForEach
Loop containers to process files in Date Order article.

Sequence Container
The sequence container in SSIS is useful for grouping tasks together. We can split the control flow into
multiple logical units using this. We will explore more on the Sequence container in this article.

Overview of the Sequence Container in SSIS

We can consider a Sequence container as a subset of an SSIS package. It acts as a single control point
for the tasks defined inside a container.

We can summarize the benefits of a sequence container, as shown below:

 We can define variables under the scope of tasks inside a sequence container
 It follows a parent-child relationship with the underlying tasks. We can change the property of a
sequence container (parent), and it is propagated to inside tasks (child)
 It provides flexibility to manage the tasks in a container

Practical usage of Sequence container in SSIS

Let’s explore the sequence container practically.

Suppose you have a control flow for executing following SQL tasks daily:
54

Currently, we have a similar procedure on each task that runs daily. It is also running on a fixed schedule
by SQL Server Agent. Now, due to some business requirements, your development team created
separate stored procedures for each day of the week.

We can create separate SSIS packages for each day and schedule SQL agent jobs. It increases the
complexity and flexibility to manage the package:

 Separate SSIS package for each day – 7 SSIS packages

 Separate SQL Server Agent job for each SSIS package – 7 SQL Server jobs

Do we feel comfortable in doing this? No, right!

Sequence Container in SSIS package solves this problem for us. Let’s explore the solution.

Drag a sequence container from the SSIS toolbox to the control flow area. Currently, it does not have
any tasks associated with it:

Double-click on Sequence container and rename it to Sunday as shown below:

Now, drag and drop the SQL task 1 inside the Sunday container. You get the following error message
that it cannot move a connected task to a new container. SQL task 1 connected with other tasks using
precedence container:

We can remove the precedence constraints or select all SQL tasks together and move in the container.

Once we select all the tasks together, you can see bold outlines for each task:

Now, move them together inside the Sunday sequence container in SSIS and resize the container so
that we can fix another sequence container also on the screen. I have renamed the tasks and given them
a shorter name:
56

Make similar copies of the sequence container in the SSIS package for the rest of the week with
appropriate scripts.

Note: We are not covering the configuration of individual tasks inside the container. You should have
basic SSIS knowledge before using this article.

Now, my SSIS package looks like below with a Sequence container in SSIS for each day of the week.

Currently, if we execute the SSIS package, it will execute each sequence container individually.

In the following screenshot, we can see that for each sequence container, task 1 fails and it marks
container fails:
57

It did not execute the task 3 because task 3 contains multiple precedences, and by default, all inputs to
a task should be true.

Right-click on the precedence and modify it to Logical OR:

It changes the solid precedence lines to dotted lines. Fix the issue and execute the package and we can
see each sequence contains runs inside task individually:
58

Now, we need to execute the Sequence container based on the day of the week. For this, right-click on
the package and add a variable:

Click on Add variable and provide a name, data type for the variable. By default, the variable scope is at
the package level. We will use this variable for the current day of the week:

Add a new execute SQL task and rename to find the day of the week:
59

Double-click on this task, and it opens the editor window. Make the following changes in this editor:

1. Result set: Single row

2. Connection: Specify SQL instance connection details
3. SQLStatement: Copy-paste the following T-SQL in this editor

1SELECT DATENAME(dw, GETDATE()) AS dayofweek;

This query uses the DATENAME function and GETDATE function to find today’s day of the week. For
example, it returns Wednesday for 27/11/2019.
60

Navigate to the Result set and map the query output with the SSIS variable:

Click OK and join the precedence constraint from SQL task to Sunday Sequence container in SSIS:

Double-click on this precedence constraint and change the property as follows:

 Evaluation operation: Expression and constraint

 Value: Success
 Expression:

1@[User::Day]=="Sunday"
61

You can click on the test to verify the expression. It gives the following message for successful
validation:

Click OK, and you can see the following configuration for precedence constraint with Sunday sequence
container in SSIS:
62

Similarly, add the precedence constraint from SQL task to respective sequence container in SSIS. Make
sure to change the expression for the particular day of the week. You can refer to the following table for
expressions:

Monday @[User::Day]==”Monday”

Tuesday @[User::Day]==”Tuesday”

Wednesday @[User::Day]==”Wednesday”

Thursday @[User::Day]==”Thursday”

Friday @[User::Day]==”Friday”

Saturday @[User::Day]==”Saturday”

Now, my SSIS package configuration looks as per the following screenshot:

The flow of this SSIS package will be:

 First, it executes the SQL task Find the day of the week

 It assigns the day of the week value to the SSIS variable
 In the precedence constraint, we defined expression to check for the day of the week
 It checks for the expression and executes the Sequence constraints, if the expression evaluates
to true

For example, I am running this package on 27/11/2019 that is Wednesday. Let’s execute the SSIS
package. It should execute a sequence container for Wednesday:
63

Here we go. In the following screenshot, we can see that only the Wednesday Sequence container in
SSIS is executed:

An additional property of a sequence container in

SSIS
We can disable a Sequence container as well to exclude from execution. Right-click on it and click
on Disable:
64

It disables the Sequence container. It also greyed out the task inside:

We can design nested Sequence containers as well. In the following screenshot, we added a Sequence
container inside the Sunday Sequence container. Once the task 2 is successful, it triggers the nested
container execution:
65

We can collapse or expand a Sequence container in SSIS package with a click on the arrow as shown
below:

We can configure sequence container property as well. Few useful properties are:

 FailPackageOnFailure: It controls whether the package failure behavior in case of executable

failure
 MaximumErrorCount: It shows the maximum number of errors inside a sequence container in
SSIS. If the numbers of errors are less than this parameter, a sequence container is marked
successful even in case of failure. The default value is 1
 Isolation level: By default, it supports isolation level Serializable
 Disable: We can enable or disable a sequence container in an SSIS package using this property
66

Conclusion
In this article, we demonstrated the Sequence container in the SSIS package. It is useful in combining
tasks and defining the package workflow. You should practice this container as per your requirement
and use it.

SSRS Reporting Project - Ver (1.2)
No ratings yet
SSRS Reporting Project - Ver (1.2)
11 pages
Own Cryptography System: A Project Report
No ratings yet
Own Cryptography System: A Project Report
52 pages
Msbi Developer (SSRS, Ssas, Ssis) : Advanced Level
100% (1)
Msbi Developer (SSRS, Ssas, Ssis) : Advanced Level
4 pages
Disk Partition Alignment Best Practices For SQL Server - Microsoft Docs PDF
100% (1)
Disk Partition Alignment Best Practices For SQL Server - Microsoft Docs PDF
18 pages
Performance Tuning Crystal Report
No ratings yet
Performance Tuning Crystal Report
38 pages
School Case Study
No ratings yet
School Case Study
4 pages
Integration Services Tutorials: Tutorial: Creating A Basic Package Using A Wizard
No ratings yet
Integration Services Tutorials: Tutorial: Creating A Basic Package Using A Wizard
17 pages
Learn To Create MSBI (Microsoft Business Intelligence) Project in 7 Days - CodeProject
No ratings yet
Learn To Create MSBI (Microsoft Business Intelligence) Project in 7 Days - CodeProject
20 pages
Selenium Java Interview Questions
100% (5)
Selenium Java Interview Questions
22 pages
Indian Bank Case Study
No ratings yet
Indian Bank Case Study
7 pages
SQL Server Questions
No ratings yet
SQL Server Questions
10 pages
Msbi Nuakri Resume PDF
No ratings yet
Msbi Nuakri Resume PDF
6 pages
How To Create The Deployment Utility?: SSIS Interview Questions and Answers: Series 3
No ratings yet
How To Create The Deployment Utility?: SSIS Interview Questions and Answers: Series 3
6 pages
Qualys Was API User Guide
No ratings yet
Qualys Was API User Guide
376 pages
Introduction To DO-254 Design Assurance Guidance For Airborne Electronic Hardware
100% (1)
Introduction To DO-254 Design Assurance Guidance For Airborne Electronic Hardware
5 pages
Installation of MSBI or SQL Server 2008: Created By:Gaurav Shrivastava Reviewed By:Amit Sharma
No ratings yet
Installation of MSBI or SQL Server 2008: Created By:Gaurav Shrivastava Reviewed By:Amit Sharma
27 pages
SSIS Interview Questions
No ratings yet
SSIS Interview Questions
13 pages
Basic Definitions
No ratings yet
Basic Definitions
5 pages
SSIS Interview Questions
No ratings yet
SSIS Interview Questions
17 pages
SSRS Interview Questions (1) .Odt
No ratings yet
SSRS Interview Questions (1) .Odt
11 pages
SSAS Essentials
100% (4)
SSAS Essentials
9 pages
DW Notes
No ratings yet
DW Notes
72 pages
Msbi
No ratings yet
Msbi
21 pages
DB2 Databases, Etc. Sending Email Messages, FTP Operations, Data Sources, and Destinations
No ratings yet
DB2 Databases, Etc. Sending Email Messages, FTP Operations, Data Sources, and Destinations
26 pages
158 SSIS 2008 Training Videos
No ratings yet
158 SSIS 2008 Training Videos
12 pages
Oracle SQL 9i
No ratings yet
Oracle SQL 9i
76 pages
Just Go With The Flow! With SAS® Data Integration Studio
No ratings yet
Just Go With The Flow! With SAS® Data Integration Studio
16 pages
In This Section I Will Give You Fairly Long List of Short and Narrowed Questions
No ratings yet
In This Section I Will Give You Fairly Long List of Short and Narrowed Questions
15 pages
SQL Server Import Manual
No ratings yet
SQL Server Import Manual
132 pages
Velocity v8 Data Warehousing Methodology
No ratings yet
Velocity v8 Data Warehousing Methodology
1,106 pages
Msbi Interview - Questions
No ratings yet
Msbi Interview - Questions
45 pages
SSIS Interview Questions
No ratings yet
SSIS Interview Questions
11 pages
Ssas Real Time Interview Questions and Answers
No ratings yet
Ssas Real Time Interview Questions and Answers
7 pages
Total Experience: 6 Year(s) 0 Month(s) Annual Salary: Rs 7.0 Lac(s)
No ratings yet
Total Experience: 6 Year(s) 0 Month(s) Annual Salary: Rs 7.0 Lac(s)
7 pages
Difference Between Temporary Table and Table Variable in SQL Server
No ratings yet
Difference Between Temporary Table and Table Variable in SQL Server
2 pages
Presentation - 2018 - Microsoft SSIS SQL Server 2016&2017
No ratings yet
Presentation - 2018 - Microsoft SSIS SQL Server 2016&2017
30 pages
SSIS Online Training PDF
No ratings yet
SSIS Online Training PDF
4 pages
Table Partitioning in SQL Server
No ratings yet
Table Partitioning in SQL Server
11 pages
SSIS Interview Questions and Answers: Deployment
No ratings yet
SSIS Interview Questions and Answers: Deployment
9 pages
Best Practices in Cognos Report Studio
No ratings yet
Best Practices in Cognos Report Studio
4 pages
Database Interview Questions
100% (2)
Database Interview Questions
8 pages
SQL Server 2012 Performance Tuning Design Internals and Architecture Workshop (4 Days)
No ratings yet
SQL Server 2012 Performance Tuning Design Internals and Architecture Workshop (4 Days)
3 pages
Jesús David Guzmán Gallegos Tesselar
No ratings yet
Jesús David Guzmán Gallegos Tesselar
16 pages
SSIS Package Configurations
No ratings yet
SSIS Package Configurations
20 pages
DTS SSIS 2008 Migration
No ratings yet
DTS SSIS 2008 Migration
13 pages
F-MECA For Centrifugal Pump
100% (3)
F-MECA For Centrifugal Pump
4 pages
SSIS
No ratings yet
SSIS
8 pages
ServiceNow IT Business Management
100% (1)
ServiceNow IT Business Management
6 pages
Business Analytics
No ratings yet
Business Analytics
9 pages
Management Information System: Course Developed by K.K.Nigam
No ratings yet
Management Information System: Course Developed by K.K.Nigam
271 pages
FDTD Getting Started Manual
No ratings yet
FDTD Getting Started Manual
63 pages
Symantec NetBackup Interview Questions and Answers Tecz PDF
No ratings yet
Symantec NetBackup Interview Questions and Answers Tecz PDF
3 pages
5991 3719en
No ratings yet
5991 3719en
8 pages
IEEE 610-5-1990 - w2000 Glossary of Data Management Terminology
No ratings yet
IEEE 610-5-1990 - w2000 Glossary of Data Management Terminology
76 pages
AWS Interview Question For A Company
No ratings yet
AWS Interview Question For A Company
7 pages
LIFECO Product Digital Catalogue
No ratings yet
LIFECO Product Digital Catalogue
48 pages
Structure and Union
No ratings yet
Structure and Union
10 pages
ECSWI269ver020 Online Proctored Exams Candidate GuidelinesmacOs
No ratings yet
ECSWI269ver020 Online Proctored Exams Candidate GuidelinesmacOs
14 pages
Rules of Netiquette 1.1
No ratings yet
Rules of Netiquette 1.1
78 pages
Chapter 6 Foundations of Business Intelligence
No ratings yet
Chapter 6 Foundations of Business Intelligence
17 pages
Tutorial 2
No ratings yet
Tutorial 2
5 pages
Sem. / Computer Engineering Subject: Cloud Computing
No ratings yet
Sem. / Computer Engineering Subject: Cloud Computing
2 pages
Final Porfolio - Daniel Alcala
No ratings yet
Final Porfolio - Daniel Alcala
26 pages
DBMS P Shits
No ratings yet
DBMS P Shits
41 pages
eTIMS Paypoint Windows User Guide Final 2023
No ratings yet
eTIMS Paypoint Windows User Guide Final 2023
32 pages
Chat Bot Mini Project
No ratings yet
Chat Bot Mini Project
4 pages
BIAS FX 2 Mobile Product Manual
No ratings yet
BIAS FX 2 Mobile Product Manual
42 pages
Programs C
No ratings yet
Programs C
139 pages
Configure Pldtfiber Router
No ratings yet
Configure Pldtfiber Router
2 pages
ECM™ Variable Speed Motor
No ratings yet
ECM™ Variable Speed Motor
38 pages
Seaman Resume 1
No ratings yet
Seaman Resume 1
1 page
Data Exfiltration Using Linux Binaries
No ratings yet
Data Exfiltration Using Linux Binaries
22 pages
Posonic HomeAlarm EX10 & EX18 Flyer
No ratings yet
Posonic HomeAlarm EX10 & EX18 Flyer
1 page
Performance&Scalability Ch3
No ratings yet
Performance&Scalability Ch3
41 pages
What's New in Pro Tools 12.3
No ratings yet
What's New in Pro Tools 12.3
21 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Software Asset Management: What Is It and Why Do I Need It?: A Textbook on the Fundamentals in Software License Compliance, Audit Risks, Optimizing Software License ROI, Business Practices and Life Cycle Management
From Everand
Software Asset Management: What Is It and Why Do I Need It?: A Textbook on the Fundamentals in Software License Compliance, Audit Risks, Optimizing Software License ROI, Business Practices and Life Cycle Management
Carl A. Bolton
No ratings yet
The Simple Guide to SAS: From Null to Novice
From Everand
The Simple Guide to SAS: From Null to Novice
Kirby Thomas
No ratings yet
Oracle Essbase 9 Implementation Guide
From Everand
Oracle Essbase 9 Implementation Guide
Joseph Sydney Gomez
No ratings yet
Beginning Microsoft SQL Server 2012 Programming
From Everand
Beginning Microsoft SQL Server 2012 Programming
Paul Atkinson
1/5 (1)
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
Oracle OBIEE Interview Q & A
From Everand
Oracle OBIEE Interview Q & A
Mohammed Azizuddin Aamer
3/5 (1)
Organizational Readiness to E-Transformation
From Everand
Organizational Readiness to E-Transformation
Aqel M. Aqel
No ratings yet
Instant SQL Server Analysis Services 2012 Cube Security
From Everand
Instant SQL Server Analysis Services 2012 Cube Security
Satya SK Jayanty
No ratings yet
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
The Power of Prediction in Health Care: A Step-by-step Guide to Data Science in Health Care
From Everand
The Power of Prediction in Health Care: A Step-by-step Guide to Data Science in Health Care
Rafiq Muhammad
No ratings yet
Database testing Third Edition
From Everand
Database testing Third Edition
Gerardus Blokdyk
No ratings yet
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet
ISO 80000-3 A Complete Guide
From Everand
ISO 80000-3 A Complete Guide
Gerardus Blokdyk
No ratings yet

Integration Services Tutorial

Uploaded by

Integration Services Tutorial

Uploaded by

1

SSIS is a flexible tool and it can perform all sorts of operations:

There’s already an Integration Services tutorial here at MSSQLTips.com. However, it’s

 How to get Sample Data?

Tools to develop Integration Services Projects

SSIS Version Visual Studio Version

SSIS 2008 / 2008R2 VS 2008 (BIDS)

with the SQL Server installation media.

Wide World Importers

 Installing new SQL Server sample databases: WideWorldImporters

Restoring the Wide World Importers Backup

Change the source to Device and click on the ellipsis.

In the Select backup devices menu, click on Add.

The SQL Server Data Tools Interface for Integration

Adding Tasks to the SSIS Control Flow

Click OK to close the editor.

Executing SSIS Packages

SSIS Precedence Constraints

The SSIS Data Flow

SSIS Data Flow Sources and Destinations

 ADO.NET: With this component you can connect to various sources and

SSIS Data Flow Transformations

 Aggregate: Like a GROUP BY in T-SQL.

Reading Data with SSIS

SSIS Derived Column Transformation

SSIS Lookup Component

Adding the Destination in the SSIS Data Flow Task

CREATE TABLE dbo.[SSIS_Tutorial] (

Adding Clean-Up in SSIS Workflow

Testing the SSIS Package

And the data flow:

Deploying the SSIS Package

The deployment will go through a couple of steps:

Executing an SSIS Package on the Server

Scheduling the SSIS Package with SQL Server Agent

In the Steps pane, you can create a new job step.

SSIS Control Flow Performance Optimizations

SSIS Data Flow Performance Optimizations

 DefaultBufferMaxRows: This is by default set to 10,000, which was made a good setting

SSIS Reading Data Performance Optimizations

Here we have some best practices as well:

Bulk Insert Task in SSIS

Available Options inside the Bulk Insert task in

Options Term Definition

Select to retain null values during the bulk insert operation,

Select to fire any insert, update, or delete triggers on the

Points to Remember in SSIS bulk insert task

For loop container

Foreach loop container

Overview of the Sequence Container in SSIS

We can summarize the benefits of a sequence container, as shown below:

Practical usage of Sequence container in SSIS

 Separate SSIS package for each day – 7 SSIS packages

Do we feel comfortable in doing this? No, right!

Double-click on Sequence container and rename it to Sunday as shown below:

Right-click on the precedence and modify it to Logical OR:

1. Result set: Single row

1SELECT DATENAME(dw, GETDATE()) AS dayofweek;

Double-click on this precedence constraint and change the property as follows:

 Evaluation operation: Expression and constraint

Now, my SSIS package configuration looks as per the following screenshot:

The flow of this SSIS package will be:

 First, it executes the SQL task Find the day of the week

An additional property of a sequence container in

 FailPackageOnFailure: It controls whether the package failure behavior in case of executable

You might also like