Tera Data 1222
Tera Data 1222
KEYLABSTRAINING.COM
[email protected]
Introduction
to
Informatica
PowerCenter
KEYLABSTRAINING.COM
[email protected]
Data Warehousing
Data warehousing is the entire process of data extraction,
transformation, and loading of data to the warehouse and
the access of the data by end users and applications
KEYLABSTRAINING.COM
[email protected]
Data Mart
A data mart stores data for a limited number of subject
areas, such as marketing and sales data. It is used to
support specific applications.
An independent data mart is created directly from source
systems.
A dependent data mart is populated from a data
warehouse.
KEYLABSTRAINING.COM
[email protected]
Data Sources
ETL Software
S
T
A
G
I
N
G
Transaction Data
Prod
IBM
Mkt
IMS
HR
VSAM
Fin
Ascential
Oracle
Ext ract
Acctg
Syba se
SAP
Sagent
Web Data
Clickstream
Infor mix
SAS
External Data
Demographic
HarteHanks
Data Stores
Users
SQL
A
R
E
A
O
P
E
R
A
T
I
O
N
A
L
Data Analysis
Tools and
Applications
ANALYSTS
Cognos
Teradata
IBM
Load
Informatica
D
A
T
A
Data
Warehouse
Dat a Marts
Essbase
Marketing
Meta
Dat a
SAS
MANAGERS
Fi nance
Queri es,Reporting,
DSS/EIS,
Data Mining
Micro Strategy
EXECUTIVES
Sales
Microsoft
S
T
O
R
E
Clean/Scrub
Trans form
Fi rst logic
KEYLABSTRAINING.COM
[email protected]
Siebel
Business
Objects
OPERATIO NAL
PERSONNEL
Web
Browser
CUSTOMERS/
SUPPLIERS
KEYLABSTRAINING.COM
[email protected]
Components Of Informatica
Repository Manager
Powercenter Designer
Workflow Manager
Workflow Monitor
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Architecture
KEYLABSTRAINING.COM
[email protected]
Process Flow
Informatica Server moves the data from source to target
based on the workflow and metadata stored in the
repository.
A workflow is a set of instructions how and when to run
the task related to ETL.
Informatica server runs workflow according to the
conditional links connecting tasks.
Session is type of workflow task which describes how to
move the data between source and target using a mapping.
Mapping is a set of source and target definitions linked by
transformation objects that define the rules for data
transformation.
KEYLABSTRAINING.COM
[email protected]
Sources
Power Mart and Power Center access the following sources:
Relational. Oracle, Sybase, Informix, IBM DB2, Microsoft SQL
Server, and Teradata.
File. Fixed and delimited flat file, COBOL file, and XML.
Extended. If you use Power Center, you can purchase additional
Power Connect products to access business sources such as
PeopleSoft, SAP R/3, Siebel, and IBM MQSeries.
Mainframe. If you use Power Center, you can purchase Power
Connect for IBM DB2 for faster access to IBM DB2 on MVS.
Other. Microsoft Excel and Access.
KEYLABSTRAINING.COM
[email protected]
Targets
Power Mart and Power Center can load data into the
following targets:
Relational. Oracle, Sybase, Sybase IQ, Informix, IBM DB2,
Microsoft SQL Server, and Teradata.
File. Fixed and delimited flat files and XML.
Extended. If you use Power Center, you can purchase an
integration server to load data into SAP BW. You can also purchase
Power Connect for IBM MQSeries to load data into IBM MQSeries
message queues.
Other. Microsoft Access.
You can load data into targets using ODBC or native drivers,
FTP, or external loaders.
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Repository
The Informatica repository is a set of tables that stores the
metadata you create using the Informatica Client tools.
You create a database for the repository, and then use the
Repository Manager to create the metadata tables in the
database.
You add metadata to the repository tables when you
perform tasks in the Informatica Client application such as
creating users, analyzing sources, developing mappings or
mapplets, or creating sessions. The Informatica Server
reads metadata created in the Client application when you
run a session. The Informatica Server also creates
metadata such as start and finish times of a session or
session status.
Contd :KEYLABSTRAINING.COM
[email protected]
Repository Contd..
When you use Power Center, you can develop global and
local repository to share metadata:
Global repository. The global repository is the hub of the domain.
Use the global repository to store common objects that multiple
developers can use through shortcuts. These objects may include
operational
or
application
source
definitions,
reusable
transformations, mapplets, and mappings.
Local repositories. A local repository is within a domain that is
not the global repository. Use local repositories for development.
From a local repository, you can create shortcuts to objects in
shared folders in the global repository. These objects typically
include source definitions, common dimensions and lookups, and
enterprise standard transformations. You can also create copies
of objects in non-shared folders.
KEYLABSTRAINING.COM
[email protected]
Repository Architecture
Repository Client
Repository Server
---------------------------Repository Agent
Repository Database
KEYLABSTRAINING.COM
[email protected]
Creating a Repository
To create Repository
1.
Launch the Repository Manager by choosing Programs-Power
Center (or
Power Mart) Client-Repository Manager from the
Start Menu.
2.
In
Repository.
the
Repository
Manager,
choose
Repository-Create
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Folders
Folders provide a way to organize and store all metadata in the
repository, including mappings, schemas, and sessions. Folders are
designed to be flexible, to help you organize your data warehouse
logically. Each folder has a set of properties you can configure to
define how users access the folder. For example, you can create a
folder that allows all repository users to see objects within the folder,
but not to edit them. Or you can create a folder that allows users to
share objects within the folder.
Shared Folders
When you create a folder, you can configure it as a shared folder.
Shared folders allow users to create shortcuts to objects in the
folder. If you have reusable transformation that you want to use in
several mappings or across multiple folders, you can place the object
in a shared folder.
For example, you may have a reusable Expression transformation
that calculates sales commissions. You can then use the object in
other folders by creating a shortcut to the object.
KEYLABSTRAINING.COM
[email protected]
Folder Permissions
Permissions allow repository users to perform tasks within a
folder. With folder permissions, you can control user access to the
folder, and the tasks you permit them to perform.
Folder permissions work closely with repository privileges.
Privileges grant access to specific tasks while permissions grant
access to specific folders with read, write, and execute qualifiers.
However, any user with the Super User privilege can perform all
tasks across all folders in the repository. Folders have the
following types of permissions:
Read permission. Allows you to view the folder as well as objects in
the folder.
Write permission. Allows you to create or edit objects in the folder.
Execute permission. Allows you to execute or schedule a session or
batch in the folder.
KEYLABSTRAINING.COM
[email protected]
Creating Folders
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Questions/Comments
KEYLABSTRAINING.COM
[email protected]
Designer
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Importing Sources
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Creating Targets
You can create target definitions in the Warehouse Designer for file and
relational sources. Create definitions in the following ways:
Import the definition for an existing target. Import the target
definition from a relational target.
Create a target definition based on a source definition. Drag
one of the following existing source definitions into the Warehouse
Designer to make a target definition:
Creating targets
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Contd..
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Transformations
KEYLABSTRAINING.COM
[email protected]
Transformations
A transformation is a repository object that generates,
modifies, or passes data
The Designer provides a set of transformations that
perform specific functions
Data passes into and out of transformations through
ports that you connect in a mapping or mapplet
Transformations can be active or passive
KEYLABSTRAINING.COM
[email protected]
Transformations
Active transformations
Aggregator
performs aggregate calculations
Filter
serves as a conditional filter
Router
serves as a conditional filter (more than one filters)
Joiner
allows for heterogeneous joins
Source qualifier
represents all data queried from the source
Passive transformations
Expression
performs simple calculations
Lookup
looks up values and passes to other objects
Sequence generator generates unique ID values
Stored procedure
calls a stored procedure and captures return
values
Update strategy
allows for logic to insert, update, delete, or reject
data
KEYLABSTRAINING.COM
[email protected]
Transformations Contd..
Create the transformation. Create it in the Mapping Designer
as part of a
Developer as a reusable
transformation.
Configure the transformation. Each type of transformation
has a unique set of
KEYLABSTRAINING.COM
[email protected]
Expression Transformation
You can use the Expression transformations to calculate values in
a single row before you write to the target.
For example, you might need to adjust employee salaries,
concatenate first and last
names, or convert strings to
numbers.
You can use the Expression transformation to perform any nonaggregate calculations.
You can also use the Expression transformation to test conditional
statements before
you output the results to target tables or
other transformations.
KEYLABSTRAINING.COM
[email protected]
Expression Transformation
Calculating Values
To use the Expression transformation to calculate values for a single
row, you must include the following ports:
Input or input/output ports for each value used in the calculation.
For
example, when
configuration
expression.
Variable Port : Variable Port is used like local variable inside Expression
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Description
SQL Query
Defines a custom query that replaces the default query the Informatica Server uses
to read data from sources represented in this Source Qualifier
User-Defined
Join
Specifies the condition used to join data from multiple sources represented in the
same Source Qualifier transformation
Source Filter
Specifies the filter condition the Informatica Server applies when querying records.
Number of
Sorted
Ports
Indicates the number of columns used when sorting records queried from relational
sources. If you select this option, the Informatica Server adds an ORDER BY to
the default query when it reads source records. The ORDER BY includes the
number of ports specified, starting from the top of the Source Qualifier.
When selected, the database sort order must match the session sort order.
Tracing Level
Sets the amount of detail included in the session log when you run a session
containing this transformation.
Select Distinct
Specifies if you want to select only unique records. The Informatica Server includes a
SELECT DISTINCT statement if you choose this option.
KEYLABSTRAINING.COM
[email protected]
Joiner Transformation
While a Source Qualifier transformation can join data originating from a common source database, the
Joiner transformation joins two related
heterogeneous sources residing in different locations or file systems. The combination of sources can be
varied. You can use the following sources:
If two relational sources contain keys, then a Source Qualifier transformation can easily join the sources
on those keys. Joiner transformations typically combine information from two different sources that do
not have matching keys, such as flat file sources.
The Joiner transformation allows you to join sources that contain binary data.
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
KEYLABSTRAINING.COM
[email protected]
Description
Case-Sensitive
String
Comparison
Cache Directory
Specifies the directory used to cache master records and the index to
these records. By default, the caches are created in a directory
specified by the server variable $PMCacheDir. If you override the
directory, be sure there is enough disk space on the file system. The
directory can be a mapped or mounted drive.
Join Type
Specifies the type of join: Normal, Master Outer, Detail Outer, or Full
Outer.
KEYLABSTRAINING.COM
[email protected]
Lookup Transformation
Used to look up data in a relational table, view, synonym or Flat File.
It compares Lookup transformation port values to lookup table column
values based on the lookup condition.
Connected Lookups
Receives input values directly from another transformation in the
pipeline
For each input row, the Informatica Server queries the lookup table or
cache based on the lookup ports and the condition in the transformation
Passes return values from the query to the next transformation
Un Connected Lookups
Receives input values from an expression using the
KEYLABSTRAINING.COM
[email protected]
Lookup Transformation
KEYLABSTRAINING.COM
[email protected]
Unconnected lookup
KEYLABSTRAINING.COM
[email protected]
Static Cache
Dynamic Cache
KEYLABSTRAINING.COM
[email protected]
Within a session. When you configure a session, you can instruct the Informatica
Server to either treat all records in the same way (for example, treat all records as
inserts), or use instructions coded into the session mapping to flag records for
different database operations.
Within a mapping. Within a mapping, you use the Update
transformation to flag records for insert, delete, update, or reject .
KEYLABSTRAINING.COM
[email protected]
Strategy
Setting
Description
Insert
Delete
Treat all records as deletes. For each record, if the Informatica Server finds a
corresponding record in the target table (based on the primary key value),
the Informatica Server deletes it. Note that the primary key constraint must
exist in the target definition in the repository.
Update
Treat all records as updates. For each record, the Informatica Server looks for
a matching primary key value in the target table. If it exists, the Informatica
Server updates the record. Again, the primary key constraint must exist in
the target definition.
Data
Driven
setting you choose depends on your update strategy and the status of data in target tables:
Setting
Use To
Insert
Populate the target tables for the first time, or maintaining a historical
data warehouse. In the latter case, you must set this strategy for the
entire data warehouse, not just a select group of target tables.
Delete
Update
Update target tables. You might choose this setting whether your data
warehouse contains historical data or a snapshot. Later, when you
configure how to update individual target tables, you can determine
whether to insert updated records as new records or use the updated
information to modify existing records in the target.
Data
Driven
Exert finer control over how you flag records for insert, delete, update,
or reject. Choose this setting if records destined for the same table
need to be flagged on occasion for one operation (for example, update),
or for a different operation (for example, reject). In addition, this
setting provides the only way you can flag records for reject.
KEYLABSTRAINING.COM
[email protected]
Informatica Online
Training By
Keylanstraining.com
Ph:+919550645679(IND).
KEYLABSTRAINING.COM
[email protected]