Informatica Performance Tuning
Informatica Performance Tuning
Informatica Performance Tuning
Agenda:
1. Performance Tuning Overview
2. Identify Bottlenecks
3. Optimizing at different levels :
Target
Source
Mapping
Session
System
2
Load time is very critical to meet SLA needs of the data availability
in the reports.
How do you improve performance?
Identify Bottlenecks
Eliminate Bottlenecks
Test Load option to see if any improvement in the performance
Add partitions
Change one variable at a time
5
Identify Bottlenecks :
10
11
2.Using Thread statistics: This is the way where we get statistics from
a session log file. Before going we need to know few points about
Thread.
DTM (Data Transformation manager) create a master thread to run our
sessions. For each target load order group in a mapping, the master
thread can create several threads. The types of threads depend on the
session properties and the transformations in the mapping. The number
of threads depends on the partitioning information for each target load
order group in the mapping.
1. Mapping Threads
2. Pre- and Post-Session Threads
3. Reader Threads
4. Transformation Threads
5. Writer Threads
Thread analysis is to decide the mapping performance depending upon
the statistics of threads. we can use these statistics to identifying the
source, target, or transformation bottlenecks.
From session log file we will have 4 entries which give details about
performance.
12
14
Drop indexes and key constraints : The loading of data will be slow
on indexes or key constraints defined tables. Use pre-session
commands to drop indexes before session loading. After loading the
data the constraints or indexes need to be built again using postsession commands.
Increase checkpoint intervals: The performance of loading depends
on how many less check points do we have. To do so increase the
checkpoint interval in the database
17
18
19
20
22
23
25
26
1.Grid:
A Load Balancer distributes tasks to nodes without overloading any
node.
27
5.Caches:
Limit the Number of Connected Ports
With a 64-bit platform, the Integration Service is not limited to the 2 GB
cache limit of a 32-bit platform.
If the allocated cache is not large enough to store the data, the
Integration Service stores the data in a temporary disk file, a cache file.
Performance slows each time the Integration Service pages to a
temporary file.
The Transformation_readfromdisk or Transformation_writetodisk
counters for any Aggregator, Rank, or Joiner transformation indicate the
number of times the Integration Service pages to disk to process the
transformation.
6.Target-Based Commit :
If the commit interval is too high, the Integration Service may fill the
database log file and cause the session to fail.
28
7.Real-time Processing:
Increase the flush latency to improve throughput
Source-based commit interval determines how often the Integration
Service commits real-time data to the target. To obtain the fastest
latency, set the source-based commit to 1.
8.Staging Areas:
The Integration Service can read multiple sources with a single pass,
which can reduce the need for staging areas.
9.Log Files:
Workflows and sessions always create binary logs which can be
accessed in the Administrator tool.
10.Error Tracing:
Set the tracing level appropriately. To debug use Verbose. Use Terse
when you do not want to log error messages for reject data.
29
11.Post-Session Emails:
configure the session to write to log file when you configure post-session
email to attach a session log. Enable flat file logging
31
Use the database partitioning partition type for source and target
databases. Enable parallel queries/inserts
SQ : pass-through partition
Filter : round-robin partition
Sorter : hash auto-keys partitioning. Delete default partition at
Aggregator
Performance Counters :
All transformations have counters. The Integration Service tracks the
number of input rows, output rows, and error rows for each
transformation. Some transformations have performance counters:
right-click the session in the Workflow Monitor and choose Properties.
Click the Properties tab in the details dialog box.
Errorrows
Readfromcache and Writetocache
Readfromdisk and Writetodisk
Rowsinlookupcache
33
If these counters display any number other than zero, you can increase
the cache sizes to improve session performance.
34
35