PCX 961 GettingStartedGuide en
PCX 961 GettingStartedGuide en
(Version 9.6.1)
Version 9.6.1
June 2014
This software and documentation contain proprietary information of Informatica Corporation and are provided under a license agreement containing restrictions on use
and disclosure and are also protected by copyright law. Reverse engineering of the software is prohibited. No part of this document may be reproduced or transmitted in
any form, by any means (electronic, photocopying, recording or otherwise) without prior consent of Informatica Corporation. This Software may be protected by U.S.
and/or international Patents and other Patents Pending.
Use, duplication, or disclosure of the Software by the U.S. Government is subject to the restrictions set forth in the applicable software license agreement and as
provided in DFARS 227.7202-1(a) and 227.7702-3(a) (1995), DFARS 252.227-7013©(1)(ii) (OCT 1988), FAR 12.212(a) (1995), FAR 52.227-19, or FAR 52.227-14
(ALT III), as applicable.
The information in this product or documentation is subject to change without notice. If you find any problems in this product or documentation, please report them to us
in writing.
Informatica, Informatica Platform, Informatica Data Services, PowerCenter, PowerCenterRT, PowerCenter Connect, PowerCenter Data Analyzer, PowerExchange,
PowerMart, Metadata Manager, Informatica Data Quality, Informatica Data Explorer, Informatica B2B Data Transformation, Informatica B2B Data Exchange Informatica
On Demand, Informatica Identity Resolution, Informatica Application Information Lifecycle Management, Informatica Complex Event Processing, Ultra Messaging and
Informatica Master Data Management are trademarks or registered trademarks of Informatica Corporation in the United States and in jurisdictions throughout the world.
All other company and product names may be trade names or trademarks of their respective owners.
Portions of this software and/or documentation are subject to copyright held by third parties, including without limitation: Copyright DataDirect Technologies. All rights
reserved. Copyright © Sun Microsystems. All rights reserved. Copyright © RSA Security Inc. All Rights Reserved. Copyright © Ordinal Technology Corp. All rights
reserved.Copyright © Aandacht c.v. All rights reserved. Copyright Genivia, Inc. All rights reserved. Copyright Isomorphic Software. All rights reserved. Copyright © Meta
Integration Technology, Inc. All rights reserved. Copyright © Intalio. All rights reserved. Copyright © Oracle. All rights reserved. Copyright © Adobe Systems
Incorporated. All rights reserved. Copyright © DataArt, Inc. All rights reserved. Copyright © ComponentSource. All rights reserved. Copyright © Microsoft Corporation. All
rights reserved. Copyright © Rogue Wave Software, Inc. All rights reserved. Copyright © Teradata Corporation. All rights reserved. Copyright © Yahoo! Inc. All rights
reserved. Copyright © Glyph & Cog, LLC. All rights reserved. Copyright © Thinkmap, Inc. All rights reserved. Copyright © Clearpace Software Limited. All rights
reserved. Copyright © Information Builders, Inc. All rights reserved. Copyright © OSS Nokalva, Inc. All rights reserved. Copyright Edifecs, Inc. All rights reserved.
Copyright Cleo Communications, Inc. All rights reserved. Copyright © International Organization for Standardization 1986. All rights reserved. Copyright © ej-
technologies GmbH. All rights reserved. Copyright © Jaspersoft Corporation. All rights reserved. Copyright © is International Business Machines Corporation. All rights
reserved. Copyright © yWorks GmbH. All rights reserved. Copyright © Lucent Technologies. All rights reserved. Copyright (c) University of Toronto. All rights reserved.
Copyright © Daniel Veillard. All rights reserved. Copyright © Unicode, Inc. Copyright IBM Corp. All rights reserved. Copyright © MicroQuill Software Publishing, Inc. All
rights reserved. Copyright © PassMark Software Pty Ltd. All rights reserved. Copyright © LogiXML, Inc. All rights reserved. Copyright © 2003-2010 Lorenzi Davide, All
rights reserved. Copyright © Red Hat, Inc. All rights reserved. Copyright © The Board of Trustees of the Leland Stanford Junior University. All rights reserved. Copyright
© EMC Corporation. All rights reserved. Copyright © Flexera Software. All rights reserved. Copyright © Jinfonet Software. All rights reserved. Copyright © Apple Inc. All
rights reserved. Copyright © Telerik Inc. All rights reserved. Copyright © BEA Systems. All rights reserved. Copyright © PDFlib GmbH. All rights reserved. Copyright ©
Orientation in Objects GmbH. All rights reserved. Copyright © Tanuki Software, Ltd. All rights reserved. Copyright © Ricebridge. All rights reserved. Copyright © Sencha,
Inc. All rights reserved.
This product includes software developed by the Apache Software Foundation (http://www.apache.org/), and/or other software which is licensed under various versions
of the Apache License (the "License"). You may obtain a copy of these Licenses at http://www.apache.org/licenses/. Unless required by applicable law or agreed to in
writing, software distributed under these Licenses is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
implied. See the Licenses for the specific language governing permissions and limitations under the Licenses.
This product includes software which was developed by Mozilla (http://www.mozilla.org/), software copyright The JBoss Group, LLC, all rights reserved; software
copyright © 1999-2006 by Bruno Lowagie and Paulo Soares and other software which is licensed under various versions of the GNU Lesser General Public License
Agreement, which may be found at http:// www.gnu.org/licenses/lgpl.html. The materials are provided free of charge by Informatica, "as-is", without warranty of any
kind, either express or implied, including but not limited to the implied warranties of merchantability and fitness for a particular purpose.
The product includes ACE(TM) and TAO(TM) software copyrighted by Douglas C. Schmidt and his research group at Washington University, University of California,
Irvine, and Vanderbilt University, Copyright (©) 1993-2006, all rights reserved.
This product includes software developed by the OpenSSL Project for use in the OpenSSL Toolkit (copyright The OpenSSL Project. All Rights Reserved) and
redistribution of this software is subject to terms available at http://www.openssl.org and http://www.openssl.org/source/license.html.
This product includes Curl software which is Copyright 1996-2013, Daniel Stenberg, <[email protected]>. All Rights Reserved. Permissions and limitations regarding this
software are subject to terms available at http://curl.haxx.se/docs/copyright.html. Permission to use, copy, modify, and distribute this software for any purpose with or
without fee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.
The product includes software copyright 2001-2005 (©) MetaStuff, Ltd. All Rights Reserved. Permissions and limitations regarding this software are subject to terms
available at http://www.dom4j.org/ license.html.
The product includes software copyright © 2004-2007, The Dojo Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to
terms available at http://dojotoolkit.org/license.
This product includes ICU software which is copyright International Business Machines Corporation and others. All rights reserved. Permissions and limitations
regarding this software are subject to terms available at http://source.icu-project.org/repos/icu/icu/trunk/license.html.
This product includes software copyright © 1996-2006 Per Bothner. All rights reserved. Your right to use such materials is set forth in the license which may be found at
http:// www.gnu.org/software/ kawa/Software-License.html.
This product includes OSSP UUID software which is Copyright © 2002 Ralf S. Engelschall, Copyright © 2002 The OSSP Project Copyright © 2002 Cable & Wireless
Deutschland. Permissions and limitations regarding this software are subject to terms available at http://www.opensource.org/licenses/mit-license.php.
This product includes software developed by Boost (http://www.boost.org/) or under the Boost software license. Permissions and limitations regarding this software are
subject to terms available at http:/ /www.boost.org/LICENSE_1_0.txt.
This product includes software copyright © 1997-2007 University of Cambridge. Permissions and limitations regarding this software are subject to terms available at
http:// www.pcre.org/license.txt.
This product includes software copyright © 2007 The Eclipse Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to terms
available at http:// www.eclipse.org/org/documents/epl-v10.php and at http://www.eclipse.org/org/documents/edl-v10.php.
This product includes software licensed under the terms at http://www.tcl.tk/software/tcltk/license.html, http://www.bosrup.com/web/overlib/?License, http://
www.stlport.org/doc/ license.html, http:// asm.ow2.org/license.html, http://www.cryptix.org/LICENSE.TXT, http://hsqldb.org/web/hsqlLicense.html, http://
httpunit.sourceforge.net/doc/ license.html, http://jung.sourceforge.net/license.txt , http://www.gzip.org/zlib/zlib_license.html, http://www.openldap.org/software/release/
license.html, http://www.libssh2.org, http://slf4j.org/license.html, http://www.sente.ch/software/OpenSourceLicense.html, http://fusesource.com/downloads/license-
agreements/fuse-message-broker-v-5-3- license-agreement; http://antlr.org/license.html; http://aopalliance.sourceforge.net/; http://www.bouncycastle.org/licence.html;
http://www.jgraph.com/jgraphdownload.html; http://www.jcraft.com/jsch/LICENSE.txt; http://jotm.objectweb.org/bsd_license.html; . http://www.w3.org/Consortium/Legal/
2002/copyright-software-20021231; http://www.slf4j.org/license.html; http://nanoxml.sourceforge.net/orig/copyright.html; http://www.json.org/license.html; http://
forge.ow2.org/projects/javaservice/, http://www.postgresql.org/about/licence.html, http://www.sqlite.org/copyright.html, http://www.tcl.tk/software/tcltk/license.html, http://
www.jaxen.org/faq.html, http://www.jdom.org/docs/faq.html, http://www.slf4j.org/license.html; http://www.iodbc.org/dataspace/iodbc/wiki/iODBC/License; http://
www.keplerproject.org/md5/license.html; http://www.toedter.com/en/jcalendar/license.html; http://www.edankert.com/bounce/index.html; http://www.net-snmp.org/about/
license.html; http://www.openmdx.org/#FAQ; http://www.php.net/license/3_01.txt; http://srp.stanford.edu/license.txt; http://www.schneier.com/blowfish.html; http://
www.jmock.org/license.html; http://xsom.java.net; http://benalman.com/about/license/; https://github.com/CreateJS/EaselJS/blob/master/src/easeljs/display/Bitmap.js;
http://www.h2database.com/html/license.html#summary; http://jsoncpp.sourceforge.net/LICENSE; http://jdbc.postgresql.org/license.html; http://
protobuf.googlecode.com/svn/trunk/src/google/protobuf/descriptor.proto; https://github.com/rantav/hector/blob/master/LICENSE; http://web.mit.edu/Kerberos/krb5-
current/doc/mitK5license.html; http://jibx.sourceforge.net/jibx-license.html; and https://github.com/lyokato/libgeohash/blob/master/LICENSE.
This product includes software licensed under the Academic Free License (http://www.opensource.org/licenses/afl-3.0.php), the Common Development and Distribution
License (http://www.opensource.org/licenses/cddl1.php) the Common Public License (http://www.opensource.org/licenses/cpl1.0.php), the Sun Binary Code License
Agreement Supplemental License Terms, the BSD License (http:// www.opensource.org/licenses/bsd-license.php), the new BSD License (http://opensource.org/
licenses/BSD-3-Clause), the MIT License (http://www.opensource.org/licenses/mit-license.php), the Artistic License (http://www.opensource.org/licenses/artistic-
license-1.0) and the Initial Developer’s Public License Version 1.0 (http://www.firebirdsql.org/en/initial-developer-s-public-license-version-1-0/).
This product includes software copyright © 2003-2006 Joe WaInes, 2006-2007 XStream Committers. All rights reserved. Permissions and limitations regarding this
software are subject to terms available at http://xstream.codehaus.org/license.html. This product includes software developed by the Indiana University Extreme! Lab.
For further information please visit http://www.extreme.indiana.edu/.
This product includes software Copyright (c) 2013 Frank Balluffi and Markus Moeller. All rights reserved. Permissions and limitations regarding this software are subject
to terms of the MIT license.
This Software is protected by U.S. Patent Numbers 5,794,246; 6,014,670; 6,016,501; 6,029,178; 6,032,158; 6,035,307; 6,044,374; 6,092,086; 6,208,990; 6,339,775;
6,640,226; 6,789,096; 6,823,373; 6,850,947; 6,895,471; 7,117,215; 7,162,643; 7,243,110; 7,254,590; 7,281,001; 7,421,458; 7,496,588; 7,523,121; 7,584,422;
7,676,516; 7,720,842; 7,721,270; 7,774,791; 8,065,266; 8,150,803; 8,166,048; 8,166,071; 8,200,622; 8,224,873; 8,271,477; 8,327,419; 8,386,435; 8,392,460;
8,453,159; 8,458,230; and RE44,478, International Patents and other Patents Pending.
DISCLAIMER: Informatica Corporation provides this documentation "as is" without warranty of any kind, either express or implied, including, but not limited to, the
implied warranties of noninfringement, merchantability, or use for a particular purpose. Informatica Corporation does not warrant that this software or documentation is
error free. The information provided in this software or documentation may include technical inaccuracies or typographical errors. The information in this software and
documentation is subject to change at any time without notice.
NOTICES
This Informatica product (the "Software") includes certain drivers (the "DataDirect Drivers") from DataDirect Technologies, an operating company of Progress Software
Corporation ("DataDirect") which are subject to the following terms and conditions:
1. THE DATADIRECT DRIVERS ARE PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT
LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT.
2. IN NO EVENT WILL DATADIRECT OR ITS THIRD PARTY SUPPLIERS BE LIABLE TO THE END-USER CUSTOMER FOR ANY DIRECT, INDIRECT,
INCIDENTAL, SPECIAL, CONSEQUENTIAL OR OTHER DAMAGES ARISING OUT OF THE USE OF THE ODBC DRIVERS, WHETHER OR NOT
INFORMED OF THE POSSIBILITIES OF DAMAGES IN ADVANCE. THESE LIMITATIONS APPLY TO ALL CAUSES OF ACTION, INCLUDING, WITHOUT
LIMITATION, BREACH OF CONTRACT, BREACH OF WARRANTY, NEGLIGENCE, STRICT LIABILITY, MISREPRESENTATION AND OTHER TORTS.
Table of Contents i
Chapter 4: Developing a Mapping. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Developing a Mapping Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Step 1. Create a Mapping. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Step 2. Add a Transformation to the Mapping. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Step 3. Add a Target to the Mapping. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Step 4. Run the Mapping and Review Results. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
PowerCenter Express Tips. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Appendix A: Glossary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
ii Table of Contents
Preface
The PowerCenter Express Getting Started Guide is written for developers. It provides a tutorial to help first-
time users learn how to use Informatica Developer and Informatica Administrator for PowerCenter Express
tasks. This guide assumes that you have an understanding of flat file concepts, relational database concepts,
and the database engines in your environment.
Informatica Resources
The site contains product information, user group information, newsletters, access to the Informatica How-To
Library, the Informatica Knowledge Base, Informatica Product Documentation, and access to the Informatica
user community.
Informatica Documentation
The Informatica Documentation team takes every effort to create accurate, usable documentation. If you
have questions, comments, or ideas about this documentation, contact the Informatica Documentation team
through email at [email protected]. We will use your feedback to improve our
documentation. Let us know if we can contact you regarding your comments.
The Documentation team updates documentation as needed. To get the latest documentation for your
product, navigate to Product Documentation from http://mysupport.informatica.com.
iii
contact the Support YouTube team through email at [email protected] or send a tweet to
@INFASupport.
Informatica Marketplace
The Informatica Marketplace is a forum where developers and partners can share solutions that augment,
extend, or enhance data integration implementations. By leveraging any of the hundreds of solutions
available on the Marketplace, you can improve your productivity and speed up time to implementation on
your projects. You can access Informatica Marketplace at http://www.informaticamarketplace.com.
Informatica Velocity
You can access Informatica Velocity at http://mysupport.informatica.com. Developed from the real-world
experience of hundreds of data management projects, Informatica Velocity represents the collective
knowledge of our consultants who have worked with organizations from around the world to plan, develop,
deploy, and maintain successful data management solutions. If you have questions, comments, or ideas
about Informatica Velocity, contact Informatica Professional Services at [email protected].
iv Preface
CHAPTER 1
Introduction to PowerCenter
Express
This chapter includes the following topics:
You can use PowerCenter Express to extract data from multiple sources, transform the data according to
business logic that you build in the client application, and load the transformed data to targets. You can also
run a profile to analyze the structure and content of your data, and to determine the quality of your data.
You can access data in relational databases, flat files, web services, and social media web sites.
PowerCenter Express includes the Informatica domain, application services that process data, repositories to
store metadata, Informatica Administrator (the Administrator tool), and Informatica Developer (the Developer
tool).
The Administrator tool, the repositories, and application services run on a single machine. The Developer tool
can run on one or more machines.
An organization wants to consolidate data from three flat files that contain customer records from different
geographic areas. After analyzing the content of the data, the organization wants to load the data to a
relational database table.
To complete this task, the organization uses the Developer tool to run a profile on the data, consolidate the
data, and write the consolidated data to a relational database. Then, the organization uses the Administrator
tool to monitor the progress of the profile job and the progress of the workflow that consolidates the data.
1
PowerCenter Express Architecture
The PowerCenter Express application clients, application services, and repositories are components that run
within the Informatica domain. The Informatica domain is the fundamental administrative unit in Informatica.
The following figure shows the PowerCenter Express components that run within the Informatica domain:
Application Clients
A group of clients that you use to access underlying Informatica functionality. Application clients send
requests to the Service Manager or application services. The Service Manager runs the application services
and performs domain functions including authentication, authorization, and logging.
The Informatica domain includes the following application clients for PowerCenter Express:
Informatica Developer
Informatica Developer (the Developer tool) is an application client that developers can use to design and
implement data integration and data federation solutions.
Informatica Administrator
Informatica Administrator (the Administrator tool) is an application client that consolidates the
administrative tasks for domain objects such as services, connections, and licenses. Administrators
manage the domain and the security of the domain through the Administrator tool.
Application Services
A group of services that represent server-based functionality. Administrators configure the application
services that are required by the application clients.
The Informatica domain includes the following application services for PowerCenter Express:
Repositories
Repositories are a group of relational databases that store metadata about objects and processes required to
handle user requests from application clients.
The Informatica domain includes the following databases and directory for PowerCenter Express:
Model repository
The Model repository is a relational database that stores the metadata for projects. The Model repository
also stores run-time and configuration information for applications that are deployed to a Data Integration
Service.
Profile warehouse
The profile warehouse is a relational database that the Data Integration Services uses to store profile
results.
Informatica Developer
Informatica Developer (the Developer tool) is an application client that you can use to design and implement
data integration solutions.
You can use the Developer tool to import metadata, create connections, and create logical data objects.
You can use the Developer tool to complete the following tasks:
Create a physical data object.
Import metadata to create a physical data object or manually create the physical data object. Create
physical data objects for sources, lookups, and targets that you want to use in a mapping, profile, or
mapping specification.
Develop mappings.
Develop mappings to run data integration jobs.
Informatica Developer 3
Informatica Developer User Interface
The Developer tool is an application that you use to design and implement data integration solutions. The
Developer tool workbench includes an editor and views.
You edit objects, such as mappings, in the editor. The Developer tool displays views, such as the Properties
view, based on which object is selected in the editor and your selection of which views you want to display.
Outline view
Displays objects that are dependent on an object selected in the Object Explorer view.
Properties view
Displays the properties for an object that is selected in the editor.
Tags view
Displays tags that define an object in the Model repository based on business usage.
Alerts view
Displays connection status alerts.
You can hide views and move views to another location in the Developer tool workbench. Click Window >
Show View to select the views you want to display.
Help view
Displays context-sensitive online help.
Progress view
Displays the progress of operations in the Developer tool, such as a mapping run.
Search view
Displays the search results. You can also launch the search options dialog box.
Informatica Administrator
Informatica Administrator (the Administrator tool) is an application client that consolidates the administrative
tasks for domain objects such as services, connections, and licenses.
You manage the domain and the security of the domain through the Administrator tool.
Note: If you have PowerCenter Express Personal Edition, you do not have access to the administer security.
Informatica Administrator 5
Informatica Administrator User Interface
The Administrator tool is an application that you use to manage the Informatica domain and the security of
the Informatica domain. The Administrator tool interface contains tabs, header items, views, a navigator, and
a contents panel.
1. Navigator
2. View in the tab
3. Tab
4. Header area
5. Contents panel
The tabs and views that are available in the Administrator tool differ based on your product license and user
permissions. The Navigator displays a hierarchy of objects. The types of objects in the Navigator differ based
on the tab that you select. The contents panel displays details about the object that you select in the
Navigator.
Logs
View log events for the domain and services within the domain.
Monitoring
View the status of profile jobs, preview jobs, mapping jobs, and workflows for the Data Integration
Service.
Security
Manage users, groups, roles, and privileges. If you have PowerCenter Express Personal Edition, you do
not have access to the Security tab.
Help
Access help for the current tab, determine the Informatica version, and configure the data usage policy.
To set up the Developer tool, you connect to the Model repository and create a project and folder to store
your work. You select the default Data Integration Service to preview data and run mappings.
Lesson Concepts
The Informatica domain is a collection of services that perform data integration jobs and monitoring jobs. You
manage the domain through the Administrator tool.
The Model Repository Service manages the Model repository. The Model repository is a relational database
that stores the metadata for projects and folders. A project stores objects that you create in the Developer
tool. A project can also contain folders that store related objects that are part of the same business
requirement.
The Data Integration Service performs data integration jobs for the Developer tool. Data integration jobs
include previewing data, and running profiles, mappings, and workflows.
Lesson Objectives
In this lesson, you complete the following beginner-level tasks:
8
• Create a project to store the objects that you create in the Developer tool.
• Create a folder in the project that can store related objects.
• Select the default Data Integration Service to preview data and to run mappings in the Developer tool.
Lesson Prerequisites
Before you start this lesson, verify the following prerequisites:
Lesson Timing
Set aside 10 to 15 minutes to complete the tasks in this lesson.
If you installed Informatica services and the Informatica client at the same time on the same machine, the
installation process adds the domain to the Developer tool. You can skip this step and continue to “Step 2.
Start Informatica Developer” on page 11.
1. From the Windows Start menu, click Informatica PowerCenter Express > Launch Informatica
Administrator.
The default browser opens to the Administrator tool login page.
2. Enter the user name and password provided to you.
3. Click Log In.
The Domain tab of the Administrator tool opens.
4. Record the name of the domain that appears in the Domain Navigator.
The default domain name is Domain_<machine_name>, where <machine_name> is the name of the
machine where Informatica services is installed.
1. From the Windows Start menu, click Informatica PowerCenter Express > Launch Informatica
Developer.
The Welcome page of the Developer tool appears. If you have started the Developer tool before, the
Developer tool opens to the Workbench.
The steps that you use to connect to the Model repository depend on the following installation methods:
You installed Informatica services and the Informatica client at the same time on the same machine.
The installation process adds the domain and Model repository. You can connect to the Model repository
the first time that you set up the Developer tool.
When you add the domain, use the domain name, host name, and port number that you recorded in “Step 1.
Log In to the Administrator Tool” on page 9.
1. From the Developer tool menu, click File > Connect to Repository.
The Connect to Repository dialog box appears.
3. Click Add.
The New Domain dialog box appears.
4. Enter the domain name, host name, and port number for the domain.
The following table lists the default values for the domain:
8. Click OK.
The Connect to Repository dialog box appears.
9. Click Browse to select the Model Repository Service associated with the Model repository.
The Choose Service dialog box appears.
10. Expand the domain and select the Model Repository Service.
The following figure shows the selected Model Repository Service:
1. In the Object Explorer view, right-click the Model Repository Service associated with the Model
repository to which you want to connect.
2. Select Connect.
The Connect to Domain dialog box appears.
3. Enter the user name and password provided to you.
1. From the Developer tool menu, click File > New > Project.
The New Project dialog box appears.
2. Enter Tutorial for the project name.
If you installed Informatica services and the Informatica client at the same time on the same machine, the
installation process selects the default Data Integration Service. You can skip this step and continue to the
next lesson.
5. Click OK.
The Informatica Marketplace provides prebuilt solutions to augment, extend, or enhance your data
integration implementation.
To access Informatica Marketplace, click the Marketplace button on the toolbar. The Developer tool
opens a Marketplace tab in the editor. You must register as a user before you can log in to the
Marketplace for the first time. You might need to close the Connection Explorer view to better see the
Marketplace content.
Lesson Concepts
A physical data object is the representation of data that is based on a flat file or relational database. You
import physical data objects into the Developer tool so that you can read data from data sources, look up
data from data sources, or write data to data sources. You can use a physical data object as a source or a
target in a mapping. You can review the source data before you transform the data.
A profile is a set of metadata that describes the content and structure of a data set. Data profiling is often the
first step in a project. You can run a profile to evaluate the structure of data and verify that data columns
contain the types of information that you expect. If a profile reveals problems in data, you can define
processes in your project to fix the problems. For example, if a profile reveals that a column contains values
of greater than expected length, you can design data quality processes to remove or fix the problem values.
Lesson Objectives
In this lesson, you complete the following tasks:
Lesson Prerequisites
Before you start this lesson, complete the first lesson in this tutorial.
Lesson Timing
Set aside 15-20 minutes to complete the tasks in this lesson.
21
Step 1. View a Mapping Source
View a mapping source to review the data that the source contains.
1. In the Object Explorer view, select the SALES_TRANSACTIONS flat file data object.
2. Right-click the object and select Profile.
The New window opens.
3. Select Profile and click Next.
The New Profile window opens.
4. Click Browse, select the Tutorial_Objects folder, and click OK.
Export data.
You can export the data that displays in the Data Viewer view to a tab-delimited flat file, such as a TXT
or CSV file. Export data when you want to create a local copy of the data. To export the data, right-click
a row of data in the Data Viewer view and select Export Data.
You can also export column values and column pattern data from profile results to a tab-delimited flat
file, such as a TXT or CSV file. To export column values and column pattern data from a profile, click the
Export Patterns to File button ( ) in the Details section of the Results view.
Developing a Mapping
This chapter includes the following topics:
Lesson Concepts
A mapping is a set of inputs and outputs that represent the data flow between sources and targets. You link
the sources and targets with transformation objects that define the rules for data transformation. Use an
Aggregator transformation in a mapping to perform aggregate calculations, such as averages and sums.
After you validate and save a mapping, you can run the mapping in the Developer tool. When you run a
mapping, the Data Integration Service uses the instructions configured in the mapping to read, transform, and
write data.
You can also add a mapping to an application and deploy the application to the Data Integration Service.
Deploy a mapping to the Data Integration Service so that you can run mappings from the command line.
Lesson Objectives
In this lesson, you complete the following tasks:
• Create a mapping that uses the SALES_TRANSACTIONS flat file data object as a source.
• Add an Aggregator transformation to the mapping that calculates the total and average sales by
dealership.
• Add a flat file data object to the mapping as a target so that the mapping writes the total and average
sales to a flat file.
• Run the mapping and review the results in the target file.
29
Lesson Prerequisites
Before you start this lesson, complete the earlier lessons in this tutorial.
Lesson Timing
Set aside 25-30 minutes to complete the tasks in this lesson.
8. In the Properties view of the Aggregator transformation, click the Ports tab.
9. For the DEALERSHIP port, select the checkbox in the Group By column.
When you group values, the Data Integration Service generates one row for each group.
10. Select Output Only and click the New button ( ) two times to create two output ports.
The Developer tool adds two ports under Output Only with default names of Field and Field1.
11. Select each output port and change the name, type, precision, and scale.
The following table describes the values to enter:
TotalSales decimal 10 2
AverageSales decimal 10 2
5. Click Finish.
The t_DealershipSales data object opens.
6. In the Columns section of the Overview view, click the New button ( ) three times to create three
columns.
The Developer tool adds three columns with default names of Field, Field1, and Field2.
7. Select each column and change the name, native type, precision, and scale.
The following table describes the values to enter:
DealerID string 10 0
TotalSales number 10 2
AverageSales number 10 2
DEALERSHIP DealerID
TotalSales TotalSales
AverageSales AverageSales
1. Right-click the editor of the m_DealershipSales mapping, and select Run Mapping.
The Run Mapping dialog box appears and displays the progress of the mapping run.
2. Click Window > Show View > Progress to view the progress of the mapping run.
3. Click the hyperlinked text that shows the result of the mapping run.
4. Click Go to Log.
The mapping log opens.
5. To save the log to a file, click File > Save a Copy As and choose a directory.
When you link ports automatically, you can link by position or by name. When you link ports
automatically by name, you can specify a prefix or suffix by which to link the ports.
To link ports automatically, select Mapping > Auto Link, select the to and from objects, and then select
whether to link the ports by name or position. If you link ports by name, you can click Show Advanced
to specify a prefix or suffix for port names.
You can convert the mapping objects to icons and align the icons in the editor. To align mapping objects
as icons, click Layout > Arrange All Iconic.
A quick outline displays objects that are dependent on a mapping object selected in the editor. You can
use the quick outline to sort dependent objects by name or by type, or to search for dependent objects.
Select an object in the quick outline to navigate to the object in the editor.
To display the quick outline, select the mapping or an object in the mapping and then click Navigate >
Quick Outline.
Use the point-and-click method to add functions and ports to a port expression.
When you create an expression, you can enter the expression manually or use the point-and-click
method. To minimize errors when you create an expression, select functions and ports from the point-
and-click interface.
To add a function to an expression, double-click the function on the Functions tab. To add a port to an
expression, double-click the port name in the Ports tab.
Running a Workflow
This chapter includes the following topics:
Lesson Concepts
A workflow is a graphical representation of a set of events, tasks, and decisions that define a business
process. You add a Mapping task to a workflow to run a mapping from the workflow. You connect workflow
objects with sequence flows to specify the order that the Data Integration Service runs the objects.
After you validate and save a workflow, you add the workflow to an application and deploy the application to
the Data Integration Service. You run a workflow instance from the deployed application. The Data
Integration Service uses the instructions configured in the workflow to run the events and tasks and to
evaluate the decisions.
Lesson Objectives
In this lesson, you complete the following tasks:
• Review a sample mapping that joins data from two flat file sources.
• Create a workflow and add a Mapping task that runs the sample mapping.
• Deploy the workflow to the Data Integration Service and then run an instance of the workflow.
Lesson Prerequisites
Before you start this lesson, complete the earlier lessons in this tutorial.
Lesson Timing
Set aside 10-15 minutes to complete the tasks in this lesson.
40
Step 1. Review the Mapping
Open and review a sample mapping that uses a Joiner transformation to join data from two flat file sources.
1. In the Object Explorer view, expand Mappings in the TransformationSamples folder located in the
Samples project.
2. Double-click the m_Customers_Transactions_Joiner mapping.
The Developer tool opens the mapping in the editor. The mapping contains two flat file sources. The
Read_UNIQUE_CUSTOMERS source contains information about each customer. The
Read_SALES_TRANSACTIONS source contains information about each sales transaction, including a
CUST_NO column that identifies the customer that made the transaction.
The following figure shows the m_Customers_Transactions_Joiner mapping open in the editor:
6. Select the Mapping task in the editor and click the General tab in the Properties view.
7. Enter mt_m_Customers_Transactions_Joiner for the Mapping task name.
8. Verify that the m_Customers_Transactions_Joiner mapping is selected.
9. In the editor, hold the cursor over the Start event to display a yellow circle. Drag the yellow circle to the
Mapping task.
A sequence flow appears between the objects. Sequence flows connect workflow objects and determine
the order that the Data Integration Service runs the objects in the workflow.
10. Hold the cursor over the Mapping task to display a yellow circle. Drag the yellow circle to the End event.
11. Right-click in the editor, and select Align All to Grid.
The Developer tool arranges the objects in the editor based on data flow and aligns them to a grid.
3. Click Next.
4. Enter Tutorial_Application for the application name and click Finish.
5. In the Deploy Completed dialog box, click Run Object to run an instance of the deployed workflow.
6. In the Run Object dialog box, verify that the workflow is selected and then click OK.
The Developer tool deploys the workflow and then the Data Integration Service runs an instance of the
workflow.
You can use the infacmd wfs startWorkflow command to run additional instances of the same workflow
from the deployed application. For more information, see the PowerCenter Express Command
Reference.
Maximize the editor in the Developer tool when a workflow contains many objects.
If you are developing a workflow that contains many objects, you might want to maximize the editor so
that you can view all objects in the workflow. To maximize the editor, click Window > Navigation >
Maximize Active View or Editor.
Use the editor to find mapping and workflow objects that are open in the editor.
You can find mapping and workflow objects that are open in the editor. To display the fields below the
editor, select Edit > Find/Replace. To find an object, specify a search string and the types of objects to
find.
Monitoring a Workflow
This chapter includes the following topics:
Lesson Concepts
Use the Monitoring tool to monitor the status of running workflow instances. You can view the state, unique
ID, start time, and elapsed time for each workflow instance. You can abort or cancel a running workflow
instance.
You also use the Monitoring tool to view logs for workflow instances. The workflow log file includes
information about errors, task processing, and expression evaluation in sequence flows. When a workflow
instance includes a Mapping task, the Data Integration Service generates a separate log file for the mapping.
The mapping log file includes any errors encountered during the mapping run and load summary and
transformation statistics.
Lesson Objectives
In this lesson, you complete the following tasks:
Lesson Prerequisites
Before you start this lesson, complete the earlier lessons in this tutorial.
Lesson Timing
Set aside 5-10 minutes to complete the tasks in this lesson.
47
Step 1. Access the Monitoring Tool
Access the Monitoring tool from the Developer tool to begin the lesson.
1. Click Window > Show View > Progress to open the Progress view.
2. Click the Menu button ( ) and select Monitor Jobs.
The Choose Data Integration Service dialog box opens.
3. Select the Data Integration Service to which you deployed the workflow, and then click OK.
The Monitoring tool opens.
1. In the Navigator of the Monitoring tool, expand the application named Tutorial_Application, and then
select Workflows.
The contents panel displays the workflow instance that was run from the deployed application.
2. In the contents panel, expand the workflow instance, and then expand the Mapping task.
The contents panel displays details about the workflow instance, the Mapping task, and the mapping that
the Mapping task ran.
3. Select the workflow instance in the contents panel and then click Actions > View Logs for Selected
Object.
The File Download dialog box appears. Notice that the name of the workflow log uses the following
format: <workflow instance ID>_log.txt
4. Click Open to view the workflow log file.
5. Close the workflow log file.
When you monitor workflow instances, you can filter the workflow instances by state. For example, you
might want to filter the workflow instances to display only the Failed workflow instances in the contents
panel.
To filter workflow instances by state, click the arrow above the State column in the contents panel and
select the state that you want to filter by.
View monitoring reports for the Data Integration Service or for an application.
You can view monitoring reports about objects that run on the Data Integration Service. For example,
you can view the Longest Duration Workflows report to determine the workflows that were running the
longest during the specified time period.
To view monitoring reports, select the Data Integration Service or an application in the Navigator. In the
contents panel, click the Reports view.
Creating a Connection
This chapter includes the following topics:
Lesson Concepts
A connection is a repository object that defines a connection in the domain configuration repository. Create
connections to access data from relational databases, ODBC data sources, social media data sources, or
third-party web services.
In this lesson, you create a connection to a relational database. You can create a connection to any of the
following relational database types:
• IBM DB2
• Microsoft SQL Server
• Oracle
In addition, you can create a JDBC or an ODBC connection to access tables in a relational database through
JDBC or ODBC. The properties that you configure for a connection depend on the database type.
Note: All of the physical data objects in the Samples project in the Model repository are flat file data objects.
Flat file data objects do not require a connection. As a result, the domain configuration repository does not
contain sample connections.
Lesson Objectives
In this lesson, you complete the following task:
51
Lesson Prerequisites
Before you start this lesson, verify the following prerequisites:
• You have the relational database host name and port number.
• You have a username and password to connect to the relational database.
Lesson Timing
Set aside 5-10 minutes to complete the task in this lesson.
1. In the Developer tool, click the Create Connection button ( ) in the tool bar.
The New Database Connection dialog box appears.
6. Click Next.
7. Enter a username and password to access the database.
8. Configure the remaining connection properties. The properties depend on the connection type.
9. Click Test Connection to verify that you entered the connection properties correctly and that you can
connect.
10. Click Finish.
The Add Connection dialog box appears.
11. Click Yes to add the connection to the Connection Explorer view.
The new connection appears in the Connection Explorer view.
You can use the connection to import data from the database as a physical data object, preview data in
the database, run a profile on data in the database, and run mappings that read from and write to the
database.
Use the Connection Explorer view in the Developer tool to view data in a relational database.
You can add relational database connections to the Connection Explorer view. You can connect to the
relational database by right-clicking the database in the Connection Explorer view and selecting
Connect. Expand the database schema to view the tables, views, and synonyms in the schema.
You can use the Developer tool and the Administrator tool to create and manage connections. Any
connection that you create in one tool is available in the other tool.
To manage connections in the Administrator tool, click the Connections view in the Domain tab.
Glossary
application
A deployable object that can contain data objects, mappings, and workflows.
application service
A service that runs on the node in the Informatica domain. You manage application services in Informatica
Administrator or through the infacmd command program. Configure each application service based on your
environment requirements.
associated service
An application service that is associated with another application service. For example, the Model Repository
Service is associated with the Data Integration Service.
column profile
A type of profile that determines the characteristics of columns in a data source, such as value frequency,
percentages, patterns, and datatypes.
deploy
To make objects within an application accessible to end users. Depending on the types of objects in the
application, end users can then run mappings or workflows.
domain
A domain is the fundamental administrative unit for the Informatica node and services.
folder
A container for objects in the Model repository. Use folders to organize objects in a project and create folders
to group objects based on business needs.
Informatica Administrator
Informatica Administrator (the Administrator tool) is an application that consolidates the administrative tasks
for domain objects such as services, nodes, licenses, and grids. You manage the domain and the security of
the domain through the Administrator tool.
Informatica Developer
Informatica Developer (the Developer tool) is an application that you use to design data integration solutions.
The Model repository stores the objects that you create in the Developer tool.
Appendix A: Glossary 57
mapping
A set of inputs and outputs linked by transformation objects that define the rules for data transformation.
mapplet
A reusable object that contains a set of transformations that you can use in multiple mappings or validate as
a rule.
native authentication
The authentication method used to authenticate users logging in to PowerCenter Express applications. The
Service Manager stores group and user account information and performs authentication in the domain
configuration database.
node
A logical representation of a machine or a blade. Each node runs a Service Manager that performs domain
operations on that node.
permission
The level of access a user has to a metadata object. Even if a user has the privilege to perform certain
actions, the user may also require permission to perform the action on a particular object. You define object
permissions in Metadata Manager. An object can inherit permissions from the containing folder or resource.
pipeline branch
A segment of a pipeline between any two mapping objects.
predicate expression
An expression that filters the data in a mapping. A predicate expression returns true or false.
predicate optimization
Optimization method that simplifies or rewrites the predicate expressions in a mapping. With predicate
optimization, the Data Integration Service attempts to apply predicate expressions as early as possible to
increase mapping performance.
privilege
An action that a user can perform in Informatica Developer. You assign privileges to users and groups for the
Informatica domain and the Repository Service.
privilege group
An organization of privileges that defines common user actions.
58 Glossary
profile
An object that contains rules to discover patterns in source data. Run a profile to evaluate the data structure
and verify that data columns contain the type of information that you expect.
project
The top-level container to store objects created in Informatica Developer. Create projects based on business
goals or requirements.
rule
Reusable business logic that defines conditions applied to source data when you run a profile. Use rules to
further validate the data in a profile and to measure data quality progress. You can create a rule in
Informatica Analyst or Informatica Developer.
semi-join optimization
Optimization method that reduces the number of rows extracted from the source. With semi-join optimization,
the Data Integration Service modifies the join operations in a mapping. The Data Integration Service applies
the semi-join optimization method to a Joiner transformation when a larger input group has rows that do not
match a smaller input group in the join condition. The Data Integration Service reads the rows from the
smaller group, finds the matching rows in the larger group, and performs the join operation.
pushdown optimization
Optimization method that pushes transformation logic to a source or target database. With pushdown
optimization, the Data Integration Service translates the transformation logic into SQL queries and sends the
SQL queries to the database. The database runs the SQL queries to process the data.
service process
A run-time representation of a service running on a node.
transformation
A repository object in a mapping that generates, modifies, or passes data. Each transformation performs a
different function.
Appendix A: Glossary 59