Integration Patterns For Virtual MDM Implementations - WSN1
Integration Patterns For Virtual MDM Implementations - WSN1
Integration Patterns For Virtual MDM Implementations - WSN1
4. Closing Notes
INTEGRATION PATTERNS
OVERVIEW
Salesforce
Billing
Acquire Manage DW
Deliver
Patterns for Data Acquisition Patterns for managing the Patterns for delivering data
and Ingestion Virtual MDM hub from Virtual MDM hub
Deal with all kinds data Employing methods to Landing prepared data Execution of Data
Early DQ subroutines
sources prepare data for MDM onboarding onboarding flows
• New Source
addition
SaaS Applications
• Optional in • Using ETL Tool • ETL Job
• Transformation
Virtual MDM
• Landing • Message
• Used for Messages for processing
Standardization MDM to
• Delta Loads • Add/Update
only for better process
matching type Client API
• Messaging
• Using Client calls
XaaS APIs
• Events • Sometimes
used for • Add/Update
deterministic • Using Services Web Service or
de-dupe per Rest API calls
Other Cloud
Hosted / Bluemix source • Using Change
( CDC can be rolled into
Applications • Data Model Data Capture
ETL Jobs )
Change
Loading data into Virtual MDM in a Batch ( applicable for Initial Loads and Batch Loads )
ETL Tools
Change Data Capture
Virtual MDM offerings typically provide certain core interface packages for –
Managing data in the Virtual MDM hub
Performing certain actions like Searches etc.
Extension points to perform additional processing in regular out-of-box flows
• The IBM InfoSphere Metadata Asset Manager (IMAM) tool is used to import
the schema metadata into the InfoSphere Information Server common
repository
• Attribute Completeness
How many Last Name fields are populated, how Last Name fields have no
value (NULLs), and so forth
• Attribute Validity
This includes passing in a set of anonymous (ANON) values and checking
which ones are ANONs
Validating with an external source
Validating reference data
• Attribute conformance
Data of birth (DOB) falls within a set range, such as from 1910 to 2013
Format checking in general, such as Drivers License (DL) always use digits
12 © 2015 IBM Corporation
Loading data into Virtual MDM in a Batch
XMI Files
Persist Stage
XMeta configuration in
SCM or (MDM, ASCL)
XMeta
DSX Model
DevOps (DSX)
Repository
Compile,
[ MDM Design MetaData ] Deploy & Test
Jobs [ Job MetaData ]
15 © 2015 IBM Corporation
Points to note about MDM Connector Stage
Data Reads and Writes of MDM Connector Stage currently support InfoSphere MDM SE
and Healthcare offering which run on InfoSphere MDM SE
memget and memput calls are currently supported
Implementations need variations and enhancements for the read/write capabilities
served by the memget and memput API callouts and this should be available in a future
release
The challenges around this approach is that there is no standards for Data Acquisition from sources which also elevates
the problem of end-to-end automation of the Data Acquisition and Ingestion flows
A sample InfoSphere DataStage Job is provided to assist in designing this “end-to-end” automation of the Data Acquisition
and Ingestion flows
MDM
Workbench
MDM SE
Metadata
IMAM Import
Job
Template
InfoSphere Information Server InfoSphere MDM SE
XML
2. Invoke utility on
Copy to filesystem MDM SE Server JMX Job
Utility
mpxdata run job
1. Drop prepared
source extracts
17
Sources
© 2015 IBM Corporation
Data Acquisition and Ingestion - Processing source systems’
messages and Events
Most of the topics around Managing the Virtual MDM Hub relate to Governance
Integration with IBM’s Business Process Manager, IBM Operational Decision Manager,
Cognos etc. will fall under this area
These is a full session to cover this area
https://events.na.collabserv.com/portal/wippages/register.php?id=c248284145&l=en-US
• Master Data Stewardship & Governance by Jay Limburn - 30 Jun 2015, 10-11AM EST
The key topics we will cover under this section are –
Federated Query across Virtual MDM Hubs
Integration with InfoSphere Information Server’s QualityStage
Specialized Integrations for Free Text Searches – IBM’s Watson Explorer Integration
Working with Events from InfoSphere MDM SE
20 © 2015 IBM Corporation
Managing the Virtual MDM Hub – QualityStage Integration
AttributeMaps/ pre-
interaction Handler InfoSphere MDM SE
MDM Configuration
Workbench
AddressStdHandler
Callback deploy
OR
Handler Custom Handler
Japanese
Address Std. Name and
Job Address Std.
Job
https://www.youtube.com/watch?v=T5jlMjOrxsQ
21 © 2015 IBM Corporation
Managing the Virtual MDM Hub - Federated Queries
Provides cross-
organization identity
governance without a
centralized database
Ideal Information
Exchange and cross-
jurisdiction public sector
use cases
Customer’s
Products
from MDM
Customer info
from MDM
Unstructured internal
information related to
customer
Indexed 3rd party
information related
to customer
These event’s obviously have implications and the key to those are the ones which may
entail Stewardship actions or delivery of the output of the associated views to consuming
systems
Will be covered in the next section, where we look into Patterns for delivering data from
Virtual MDM hub
APIs providing real- Handling Key events Getting data out of the Dropping off data to
Managing adherence various end-points
time access to golden provided by the Virtual Virtual MDM Hub
to Rules and Policies
views MDM Hub based on requirements
On-Premise
End points
Warehouses
Big Data
Environments
SaaS Applications
• Using Client • Processing • Using ETL Tool • Using ETL
APIs MDM Events to Tool
notify end • Selective
• Using Services points or Extracts • Pre-
routing processing
• Using Flex • Full Extracts data
Search • Routing MDM
Events to ETL • Mapping to
XaaS
Tools End-points
Other Cloud
Hosted / Bluemix
Applications
Bluemix
ERP
Handling Events
OutBound Brokers
Custom Message Processing
Hooking Queues to ETL for Processing
Columns to
Output Link
Map
Information Server
Meta Data Repository
MDM
29 Database © 2015 IBM Corporation
Integration Patterns for Virtual MDM Implementations
SPECIALIZED INTEGRATIONS
Client Value
• Enhanced customer service
• Quicker, more accurate issue resolution
• Increased upsell/cross opportunity
Usage
• Enhance detection of relationships between
entities
• Enhance entity resolution from the evidence
hidden within the documents.
• Enrich knowledge base by adding additional
information to MDM records
MDM Services
• Enrich SFDC ‘Account’
TIP information by leveraging the
broader enterprise master
Data Refinery/Bluemix information in InfoSphere MDM
• Developer appeal
33 © 2015 IBM Corporation
Integration Patterns for Virtual MDM Implementations
CLOSING NOTES
Provide Feedback to organizers on your preferences for deep dive topics in one or more
integration areas you are interesting in
Contact me if you are dealing with issues or specific opportunities where you need help
[email protected]
Implementation
Cloud Enabled
Styles
BACKUP SLIDES
If you are upgrading, please contact your local IBM rep and keep Todd Weise ( [email protected] ) on
copy
We are continuing our partnerships with Javelin - Clover ETL’s external statements
http://www.cloveretl.com/ibm-mdm
http://www.cloveretl.com/content/cloveretl-guarantees-support-ibm-infosphere-mdm-users-and-enhances-their-data-processing-pos
http://www.redbooks.ibm.com/redbooks.nsf/RedpieceAbstracts/sg248220.html?Open
`