Data Integration Strategy: Mark Mitchell Senior Product Specialist - EMEA
Data Integration Strategy: Mark Mitchell Senior Product Specialist - EMEA
Mark Mitchell
Senior Product Specialist - EMEA
The development, release and timing of any Informatica product described herein remains at the sole
discretion of Informatica. This information should not be relied upon in making a purchasing decision.
1
Data Integration Product Strategy
Data
Integration
Projects
Admin & Deploy and manage changes in large scale Administrator Low
Operate environments. High availability,Informatica
grid, 8.x
pushdown. Unified administration, security.
Foundation Services
(Repository, Grid, HA, Security, Admin., Logging, Licensing, etc.)
Connectivity Services
Galileo Release
1H 2009
Da Vinci Release
2H 2008
Mission-Critical Deployments
(Admin, Operator)
CHARTER
• Delivering enterprise grade data integration
• Supporting Integration Competency Centers (ICCs)
• Focus: Administrators and Operators
• Delivery timeframe: GA Released 12 Oct 2007
KEY CAPABILITIES
Glossary
Management
Profile, Cleanse
& Integrate
Analyst
Specify
Rules
Developer
Active
Collaborate
Scorecard
• Globalization
Gramm-Leach-Bliley Act, 14%
• Reduce costs through outsourcing
& offshoring California database breach notification act, 15%
HIPAA, 40%
• Support
Glen Carter
A Health Care Provider needs to outsource the analysis of health related data to a
third party marketing research firm
Packaged Packaged
Applications Applications
PowerCenter +
Relational and Relational and
Flat Files Data Masking Option Flat Files
Customer Customer
CUSTID FULLNAME CREATEDDATE CUSTID FULLNAME CREATEDDATE
117 Andrew Davies 4/16/1996 448 Kan Crone 3/2/1976
638 Elizabeth Murphy 1/14/1998 259 Ludie Dowden 9/5/1982
890 Richard Block 4/6/2000 913 Jarad Bayne 11/19/2004
• ACCTID is masked using Mask Format to preserve the structure, two alphabetic characters
followed by a hyphen followed by five numeric characters
• CREATEDDATE is masked using Range masking, to generate a random date between 01/01/1950
and 01/01/2010
• BALANCE needs to be blurred plus or minus 10% in order to preserve the distribution of balances
across all accounts
Customer Customer
CUSTID FULLNAME CREATEDDATE CUSTID FULLNAME CREATEDDATE
117 Andrew Davies 4/16/1996 448 Kan Crone 3/2/1976
638 Elizabeth Murphy 1/14/1998 259 Ludie Dowden 9/5/1982
890 Richard Block 4/6/2000 913 Jarad Bayne 11/19/2004
• Customer and Customer Accounts tables have to be masked consistently to preserve referential
integrity
• Maintain repeatability. For example, mask “117” to “448” again and again
•Change repeatable value for different runs. For example, mask “117” to “448” for test environment
but to “772” for development environment
• Phone Number
• Generate a random phone number but preserve the incoming
phone format
• Email Address
• Generate a random email address of the correct format with @, .,
etc
• URL
• Generate a random URL value with the correct format
• IP
• Generate a random IP address within the same network range
Customer
PHONE EMAIL SSN CREDITCARD
(206) 923-3477 [email protected] 275-85-8158 4552-7473-4192-6624
Customer
PHONE EMAIL SSN CREDITCARD
(988) 676-4900 [email protected] 275-53-0840 4552-7464-3620-2545
• Generate an SSN with the correct format but that has not been issued so far
• Generate a valid credit card Number while preserving the issuer identifier number
Customer
FULLNAME STREET CITY STATE
John Smith 100 Cardinal way Redwood City CA
Customer
FULLNAME STREET CITY STATE
• Address Masking. For example, mask 100 Cardinal way to 6 Meadows Pkwy
35
Informatica BPM Functionality
Data Service Orchestration and Human Workflow
• Eclipse based
• Visual and Source editors for BPMN, XFORM,
WSDL, XSD etc.
• Drag and drop interface eliminates coding (and
errors!)
• Import and Export of standard artifacts (WSDL, XSD
etc.)
• Single-click Deploy
• BPEL engine
• Executes BPEL code generated by Orchestration
Designer or by third party
• Interaction with external participants is exclusively
based on Web Services technology (WSDL)
• Supports long running processes
• Newer versions of processes can be deployed without
terminating existing versions
Zoom and
Timeline Control Shows
Event
Information
Shows
Proces
s Path
• Sequencing
• Start a process after the completion of another process or after a specific time has
been reached
• Synchronization of master data
• Synchronize master data between multiple independent data sources
• Conditional Logic
• Take differentiated action depending on the outcome of another process activity
• Different handlers for System and Business exceptions
• Human Workflow
• Complex decisions requiring human intervention
• Looping
• Iteratively execute a process activity based on standard looping criteria (for, while,
repeat-until)
44