0% found this document useful (0 votes)

625 views80 pages

Informatica MDM Training 2

The document discusses Informatica Master Data Management (MDM) load processes. It describes configuring trust, validation rules, relationships, and lookups to define how data is consolidated from multiple source systems into MDM hubs during the load process. Trust uses confidence factors to determine which data values from different sources are most reliable when merging records. Validation rules can downgrade trust if data fails user-defined validity checks. Relationships define associations between hubs, and lookups translate source keys to MDM surrogate keys.

Uploaded by

ravinder pal singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

625 views80 pages

Informatica MDM Training 2

Uploaded by

ravinder pal singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 80

Informatica Master Data

Management (MDM)

1
Topic 4: Load Process

2
Objectives

Following are the objectives of this topic:

• Configure Trust

• Configure Validation Rules

• Configure Relationships

• Configure Lookups

• Describe the Load Process

3
Trust

• Dynamic Cell-level Survivorship

• Base Object property

• A mechanism for measuring the confidence factor associated with each cell based on
its source system, change history, and other business rules

• Defined at a column level for each contributing source system

• Ensures that the most reliable data at the cell level is consolidated based on data
characteristics

4
Trust

When two base object records

merge: 2 Base Object records to merge:

ROWID_O Name Phone

• MRM calculates the trust for each BJECT
trusted column in the two base
100 Doug McDougal Grp 1-555-901-4670
object records being merged
200 The Doug McDougal Group 201-10810
• Cell with the highest values
survive in the final merged record Calculate Trust:

ROWID_O Name Phone

BJECT

100 62
Doug McDougal Grp 56

1-555-901-4670

200 The Doug

Winners Survive:

71McDougal Group 37
201-10810

ROWID_O Name Phone

BJECT

100 The Doug McDougal Group 1-555-901-4670

5
Trust

When an update comes in from a Base object prior to update:

source:
ROWID_O Name Phone
BJECT
• MRM calculates the trust on the
incoming data and compares it to 100 The Doug McDougal Group 1-555-901-4670

the trust of the data in the base 71 56

object

• Updates are only applied to the Data from staging table:

base object for cells that have ROWID_O Name Phone
higher trust on the incoming data BJECT
100 DMcD Group 201-10810

75
 50 
Base Object cells only updated where new data has higher
trust weighting:
ROWID_O Name Phone
BJECT
100 DMcD Group 1-555-901-4670

6
Trust

• Trust is an option property for a base object column

• If trust is switched off for a column, then the most recently updated value from any
source is the survived value in the base object

• Only switch on trust for a column if:

• Two or more source systems contribute to the column
• The sources are not deemed to be equally reliable providers of values to the column

7
Trust

Factors affecting trust score:

• Source of the Data – Each trust enabled column must have a maximum and minimum
trust weighting assigned for each source system

• Decay period for Data – Each trust enabled column must have a decay period
assigned for each source system that tells how long the trust weighting takes to drop
from maximum trust to minimum trust

• Decay Type – Each trust enabled column must have a decay type assigned for each
source system that tells how the trust value decreases from maximum to minimum
during the decay period

Decay Types

Slow Initial Rapid Initial Linear

Rapid Later Slow Later

8
Trust

Trust Demo

9
Validation Rules

• Defines a condition under which a data value is not valid

• Base Object property

• If the validation condition is met, then trust weighting is downgraded

• Trust after validation downgrade is

TRUST – (TRUST * downgrade_pct/100)

• Reserve Minimum Trust can be set to avoid having trust scores below the minimum
trust value

x := TRUST – (TRUST * downgrade_pct/100)

if x < MINIMUM_TRUST then x := MINIMUM_TRUST

endif;

• Validation check can be done on any column in a base object and Downgrade can be
applied to any other columns in the base object

1
Validation Rules

Some examples of validation rules:

• Downgrade trust on Last Name if

length(last_name)<3 and last_name <> ‘NG’

• Downgrade trust on Middle Name if

middle_name is null

• Downgrade trust on Address Line 1, City, State, Zip, and Valid Address Ind if

valid_address_ind = ‘False’

1
Validation Rules

Validation Rules Demo

1
Relationships

• Relationships are association between base objects via a matching column

• Property of the Base Object

• Types of relationships:
• One to Many Relationship
• Many to Many Relationship

1
Relationship

One-to-many Relationship

• One table (the child) contains a foreign key column, which matches a unique key
column of another table (the parent)

• One-to-many relationships are always defined from the child table in the relationship
(i.e. the referencing table rather than the referenced table).

1
Relationship

Many-to-many Relationship

• A base object acts as an intersection table between another two base objects

• The intersection table has a one-to-many relationship with the other two base objects

1
Lookups

• Lookups are translation of source’s primary or foreign key value into corresponding
base object key value

• Configured on the Staging Tables

• Two types of lookups:

• Automatic lookups for loading primary keys
• User defined lookups for loading foreign keys

Category (CRM) C_Category (MRM)

CATG_NO: AB001 = ROWID_OBJECT: 123
Lookup translation

1
Lookups

Automatic Lookups

• MRM automatically handles lookups loading/updating the primary key of a Base Object

Staging Table for Customer data from Customer Base Object

CRM System (C_STG_CRM_CUST):
ROWID_OBJECT FULL_NAME
PKEY_SRC_OBJECT FULL_NAME

10810 JOHN J HANCOCK

3507 JOHN JAMES HANCOCK

Customer Cross-Reference

ROWID_OBJECT ROWID_SYSTEM PKEY_SRC_OBJECT FULL_NAME

10810 CRM_SYS 3507 JOHN J HANCOCK

10810 SALES_SYS A53UT1 JOHN HANCOCK

1
Lookups

User Defined Lookups

• For user-defined relationships, the corresponding lookups has to be manually

configured

• Lookups can be based on XREF table or an Unique Key in Base Object

Staging Table for Address data from CRM Customer Base Object
System (C_STG_CRM_ADDR):
ROWID_OBJECT FULL_NAME
PKEY_SRC_OBJECT CRM_ID

10810 JOHN J HANCOCK

ADDR100 3507

Customer Cross-Reference

ROWID_OBJECT ROWID_SYSTEM PKEY_SRC_OBJECT SUB_CATG_CODE

10810 CRM_SYS 3507 JOHN J HANCOCK

10810 SALES_SYS A53UT1 JOHN HANCOCK

1
Lookups

Shadow Foreign Key

• The foreign key value stored on the cross-reference (X-ref) is the same as the value
stored on the base object

• This facilitates certain MRM internal processes on parent merge

• However, it makes it difficult to tie child X-ref’s back to their original parent X-ref

• Shadow foreign key is an additional column added to the X-ref for every foreign key
defined on the base object

• Contains the source system’s original foreign key value

• Name of shadow foreign key column is S_FKColumnName, for example
• Foreign key column name = Customer_ROWID
• Shadow foreign key column name = S_Customer_ROWID

1
Lookups

Shadow Foreign Key on XREF

Staging Table for Address data Customer Base Object

from CRM System
(C_STG_CRM_ADDR): ROWID_OBJECT FULL_NAME

PKEY_SRC_
CUST_ID 10810 JOHN J HANCOCK
OBJECT

ADDR100 3507 Customer Cross-Reference

ROWID_ ROWID_ PKEY_SRC_
FULL_NAME
OBJECT SYSTEM OBJECT

10810 CRM_SYS 3507 JOHN J HANCOCK

Address Base Object

ROWID_
CUST_ID ADDRESS
OBJECT

24680 10810 123 Main St

Address Cross-Reference
ROWID_ ROWID_ PKEY_SRC_
CUST_ID S_CUST_ID ADDRESS
OBJECT SYSTEM OBJECT

24680 CRM_SYS ADDR100 10810 3507 123 Main St

2
Load Process

Load process is a two-step process:

• Apply Updates

• Apply Inserts

Tokenize
STRIP_ON_LOAD_IND= 0
End
Process Inserts
LOAD
job
STRIP_ON_LOAD_IND = 1

Tokenize

2
Load Process

Updates

• Load job applies updates for existing records whose

LAST_UPDATE_DATE (Staging table) > SRC_LUD (XREF table)

• The update process always updates the XREF table record

• The update process may update the Base Object depending on trust:
• For columns not flagged for trust, update happens if incoming data has new LUD
• For columns flagged for trust, load job compares trust weightings of staging table data
to trust weightings of existing data in base object to determine what can be updated

• If history flag is switched on for the Base Object, then the update process writes to the
history tables of Base Object and XREF

2
Load Process

Inserts

• Load job applies inserts for records that do not exist in the XREF table

• ROWID_OBJECT values are generated for the new records

• New records are inserted into base object and XREF with CONSOLIDATION_IND = 4

• If history flag is switched on for the Base Object, then the insert process writes to the
history tables of Base Object and XREF

2
Load Process

Rejects

• Referential Integrity is maintained among base objects in the consolidated data model

• Rejects will occur in the load process if any records violate the RI constraint
• Parent records do not exist
• Child records are loaded before the parent records
• Lookup has been defined incorrectly
• Rejected records are inserted in the reject table of Staging table

staging_table_name_REJ

2
Topic 5: Match Process

2
Objectives

Following are the objectives of this topic:

• Match & Merge Overview

• Match Rules Configuration

• Exact Match/Search Strategy

• Fuzzy Match/Search Strategy

• Match Server Architecture

2
Match & Search Strategy

Match Process

2
Match & Merge Overview

Challenges with identifying duplicate records

• Misspellings, typing, and transcription errors

• Nicknames

• Synonyms

• Abbreviations

• Foreign and Anglicized words

• Prefix and suffix abbreviations

• Concatenation or splitting of words

• Noise words and punctuation

• Casing and character set variations

2
Match & Merge Overview

• To merge or link records, MRM(Master Reference Manager) needs to know which

records are likely duplicates of each other

• Match rules tell MRM how to identify likely duplicates

• Match rules also tell MRM if two matching records are similar enough to automatically
merge/link, or if they should be reviewed by a data steward

3
Match & Merge Overview

Data Consolidation Options

• Merging (merge-style base objects)

physically combines the matched
records in the base object. Makes the
most-current best version of the truth
(BVT) available

• Linking (link-style base objects) quickly

determines the BVT without physically
combining the records. Provides much
faster overall throughput

3
Match/Search Strategy

Exact
• Does not allow for any variations in the data in the match columns
• Very simple match process, therefore fast

Fuzzy
• Allows for variations in spelling, formats, word order, nicknames, synonyms, etc.
• More complex match process, therefore slower

3
Match/Search Strategy

High level process flow for the match process

Fuzzy
Register
MATCH Fuzzy or Generate Search for Match
job Exact? Keys Candidates

Exact

Compare records to match against rest of Compare records to match against

records in base object match candidates

Populate match table with matched ids

End
MATCH
job

3
Match Path

Match Path

• A Match Path represents the base object which will provide data for matching purpose

• Traverse the hierarchy between records across multiple base objects or within a single
base object

• Foreign Key Relationships between tables are used to traverse the relationships

• Parent-to-child or child-to-parent relationships can be specified

Match Path - Check for Missing Children

• By default, MDM does an inner join between the base objects defined in the Match
Path

• The join therefore excludes rows that don’t have corresponding rows in the joined
tables

• To include those records, switch on “Check for Missing Children” – MDM will then do an
outer join instead of an inner join

3
Match Path

Match Path – Inter Table

3
Match Path

Match Path – Intra Table

3
Match Column

• A match column contains an identifying characteristic of the base object to be

consolidated

• Each base object can have multiple match columns

• Examples:
Full Name

Generation

Address

Phone

• Provider column(s) is the base object columns that provide the data for the match
column:
• Can be a single column or a concatenation of columns
• Must be a VARCHAR / CHAR column to concatenate
• Date column is also supported for matching

3
Match Column

Each match column is based on one or

Customer: Would get false
more columns positive
ROWID_ Name matches if
∙ From base object
OBJECT matching just
on Name
∙ Or from X-ref (in some cases)

200 John Smith

∙ Or from child base object (in some
cases) 250 John Smith
Include
300 John J Smith Address
attributes in the
Match to
reduce false
Address: positives

CUSTOME Address
R_ROWID

200 123 Main Street, Boston MA

250 109 Broad Street, Boulder CO

300 123 Main Street, Boston MA

3
Exact Match/Search Strategy

Steps for defining Exact Match Rules

• Select Match/Search Strategy = Exact
• Define Match Path
• Define Match Columns
• Create at least one Match Rule Set
• Create Match Rules for Match Rule Set(s)

3
Exact Match/Search Strategy

Match Columns

• A match column contains an identifying characteristic of the base object record to be

consolidated

• Exact Match Columns:

• Does not make allowance for any variations in data content
• Records will match if they have identical values in the match columns used in match
rules

4
Exact Match/Search Strategy

Match Rule & Match Rule Set

• Match Rules are grouped into Match Rule Sets

• Can define multiple rule sets
• Only one match rule set can be active at any point in time
• Match rule defines the combination of columns that constitute a match

Match Rule - Auto property

• Match rules are flagged either for auto merge/auto link or for manual merge/link

• Matches resulting from auto merge/auto link rules will result in the records being
automatically merged/linked by the system when the auto merge/auto link batch job
runs

• Matches resulting from manual merge/link rules will be queued for review by a data
steward

4
Exact Match/Search Strategy

Match Rule & Match Rule Set

• Every checked Match Column adds an “And” condition
• Every new Rule is an “Or” condition
• Net result of match is that the same 2 records will only match once, on the first match
rule that they match on

AND 2 records match if

they have the
same values in
Match Col 1 and in
Rule # Match Column 1 Match Column 2 Match Column 3 Match Col 3

1  
OR 2  
2 records match if
they have the
same values in
Match Col 2 and in
Match Col 3

4
Exact Match/Search Strategy

Match Rule – Null Matching

• By default, NULL is not regarded Data Example:

as being the same as NULL ROWID_ Customer_Name Generation
OBJECT
• NULL Matches NULL: Use this
500 Douglas McDougal Jr
flag to specify the match columns
in a match rule that should be 550 Doug McDougal
regarded as matches even if the
560 D McDougall Jr
2 values being compared are
both NULL 570 Doug McDougall

• NULL Matches non-NULL: Use In the above example the effects of Null
Matching on the Generation column are
this flag to specify the match shown
columns in a match rule that
should be regarded as matches
when one of the values being
compared is NULL and the other
is not

4
Exact Match/Search Strategy

Match Rule – Non-Equal

Matching
Data Example:
• Specifies that 2 records are a
match if they do not have the ROWID_ Custome Customer_Name CRM_
same values in the non-equal OBJECT r_Type FLAG
match column 500 ORG The Doug McDougal
Group
• Reverses whatever
would/would NOT have

550 IND Doug McDougal Y
matched without Non-equal
match 560 IND D McDougall Y
570 ORG Doug McDougal
• If using non-equal match, then
MUST switch on Validate
Matches property in Base • If non-equal match is used on the CRM_FLAG column to
prevent 2 records from the CRM system from matching
Object Advanced Properties each other, then –
• NULL=Y is a match
• NULL=NULL is a match
• Y=Y is not a match

4
Exact Match/Search Strategy

Match Rule – Segment Matching

• Allows a match rule to be limited Data Example:
to a specific subset of data ROWID_ Custome Customer_Name CRM_
OBJECT r_Type FLAG
• Different match rules can use
different segment values 500 ORG The Doug McDougal
Group
550 IND Doug McDougal Y
560 IND D McDougall Y
570 ORG Doug McDougal

• Use a Segment Match value of ‘ORG’ on Customer Type

match column to create a match rule that only applies to
Organizations.
• Use a Segment Match value of ‘IND’ on Customer Type
match column to create a match rule that only applies to
Individuals.

4
Exact Match/Search Strategy

Match Rule

4
Fuzzy Match/Search Strategy

Steps for defining Fuzzy Match Rules

• Select Match/Search Strategy = Fuzzy
• Choose a Population
• Define Match Path
• Define Match Key
• Define Match Columns
• Create at least one Match Rule Set & choose Search Level
• Create Match Rules for Match Rule Set(s)

4
Fuzzy Match/Search Strategy

Population

• Population is intended to addresses the name distribution problem

• Common family names in each population skew the data and query performance
e.g. Smith, Williams in English-speaking populations

• Each population also has a large number of uncommon names that tend to have the
most error and variability
• Match needs to account for both of these situations in the way that the keys are built,
to give optimal search performance for both

• Defines how to identify matches within a particular population and language

• Defines how to build keys and perform searches on name and address

• Supports a specific set of match purposes

4
Fuzzy Match/Search Strategy

Population

4
Fuzzy Match/Search Strategy

Match Key
• Match key is used to search for match candidates
• It is a fixed-length, compressed, and encoded value
• Built from a combination of the words and numbers in a name or address
• For one name or address, multiple SSA match keys are generated
• Match Key Properties:
• Key Type
• Key Width
• Path Component
• Match Column Contents

5
Fuzzy Match/Search Strategy

Match Key – Key Type

• The match key type describes important characteristics about a column to MDM Hub

• Should be based on the main identifying data in your base object

• For standard population, the options are:

Key Type Description

Use if the data contains organization names or both organization
Organization Name
names and individual names
Person Name Use if the data contains individual names only

Address Part1 Use if the data contains addresses

5
Fuzzy Match/Search Strategy

Match Key – Key Width

∙ Determines the degree of variance that will be supported in the key values

∙ Represents tradeoff between match precision and the space used by match key
records
Key Width Description

• Generates the most keys

• Allows for the most variance in key values i.e. supports greatest
Extended
search completeness
• Uses the most disk space
• Generates the fewest keys
Limited • Does not allow for word order variances
• Uses the least disk space

• Aims for balance between Limited and Extended i.e. balance between
Standard
disk usage/performance and search completeness

• Generates single key

Preferred
• Might result in fewer match candidates

5
Fuzzy Match/Search Strategy

Match Key – Path Component

• Contains the column that forms the basis for defining the Match Key

• Can be any table defined in the Match Path

Match Key – Match Column Contents

• The column(s) from Path Component that provide data to the Match Key

5
Fuzzy Match/Search Strategy

Full_Name Match Key

Match Key – Example

• Key Type = Organization_Name; ELIZABETH S O'BRIAN PCOJLK$-

PCWG$$OG
• Key Width = Standard; VL/IEFLM
VL/IJ/$-
• Path Component = Customer
ELIZABETH O BRIEN MIDIA*P-
• Match Column Contents = Full_Name MIP$$$DI
PC>AO$$-
PCP$$$>>
ELIZABETH O'BRIEN PCOG$$$$
VL/IEFLM
BETH O'BRIEN MMU$?/$-
PCOG$$$$
VL/IEFLM
LIZ O'BRIEN PCOG$$$$
SXOG$$$-
VL/IEFLM

5
Fuzzy Match/Search Strategy

Match Key

5
Fuzzy Match/Search Strategy

Match Column

• A match column contains an identifying characteristic of the base object record to be

consolidated

• Can be a fuzzy column or an exact column

• Fuzzy Match Column

• The column name you choose defines the type of data that the match expects that
column to contain
• Examples: Person Name, Address Part 1, Address Part 2, etc.

• Exact Match Column

• Acts as a filter in the match
• Can have additional properties when used in match rules like Null match, Non-equal
match and segment match

5
Fuzzy Match/Search Strategy

Match Column

5
Fuzzy Match/Search Strategy

Match Rule Set

• They are logical grouping of Match Rules that collectively act on a base object for
identifying duplicates

• Multiple rule sets can be defined for a base object

• Only one rule set can be active at any point in time

• Each rule set has a Search Level and can comprise of one or more Match Rules

5
Fuzzy Match/Search Strategy

Match Rule Set – Search Level

• Determines how many match candidates are returned in the search phase of match
process

Search Level Description

• Least complex and generates fewest candidates

Narrow
• Gives the best performance

Typical • The appropriate level of search level for typical data sets

• Used when the data set is small or if it is critical to identify

Exhaustive
the highest number of matching records
• Supports the highest level of complexity
Extreme • Gives the worst performance as it generates the most
candidates

5
Fuzzy Match/Search Strategy

Match Rule Set – Search Level Examples

Key Type = Organization_Name; Key Width = Standard;

Record to be Matched = “ELIZABETH S O’BRIAN”

Narrow 3 Typical 14 Exhaustive 26 Extreme 27

Start key End Key Start key End Key Start key End Key Start key End Key
PCOG$$$$ PCOG$$$/ PCWG$$$$ PCWG$$ZZ PVS$$$$$ PVS$BZZZ PVS$$$$$ PVS$BZZZ
PC$$$$$$ PC$$$$$/ PCOG$$$$ PCOJZZZZ MM/OB/$$ MM/OB/$/ MM/OAH$$ MM/OB/ZZ
PCWG$$$$ PCWG$$ZZ OVOG$$$$ OVOJZZZZ M-WG$$$$ M-WG$$ZZ M-TO$$$$ M-WJZZZZ
VL/IEF$$ VL/IEF$/ PCWG$$$$ PCWG$$ZZ PCTO$$$$ PCWJZZZZ
PC$$$$$$ PC$$$$$/ MMV>B/$$ MMV>B/$/ MMV>AH$$ MMV>B/ZZ
PVS$$$$$ PVS$$$$/ RSWG$$$$ RSWG$$ZZ RSTO$$$$ RSWJZZZZ

MM/OB/$$ MM/OB/$/ P?WG$$$$ P?WG$$ZZ P?TO$$$$ P?WJZZZZ

MMV>B/$$ MMV>B/$/ KXWG$$$$ KXWG$$ZZ KXTO$$$$ KXWJZZZZ

P?WG$$$$ P?WG$$ZZ PBWG$$$$ PBWG$$ZZ PBTO$$$$ PBWJZZZZ

PVLKB/$$ PVLKB/$/ PVLGB/$$ PVLGB/$/ PVLGAH$$ PVLGB/ZZ

S$S$B/$$ S$S$B/$/ PVKSB/$$ PVKSB/$/ PVKSAH$$ PVKSB/ZZ

TNKBJ/$$ TNKBJ/$/ PAWG$$$$ PAWG$$ZZ PATO$$$$ PAWJZZZZ

TIWG$$$$ TIWG$$ZZ RAWG$$$$ RAWG$$ZZ RATO$$$$ RAWJZZZZ

YMU$B/$$ YMU$B/$/ PVLKB/$$ PVLKB/$/ PVLKAH$$ PVLKB/ZZ

… … … …
6
Fuzzy Match/Search Strategy

Match Rule Set

6
Fuzzy Match/Search Strategy

Match Rule

• Determines what constitutes a match during match process

• Fuzzy Match Rule Properties:

• Match Purpose
• Match Level
• Accept Limit Adjustment
Match Rule – Match Purpose

• Determines the fields that will be used in the match

• Different fields are required fields for different purposes
• There are also optional fields for each purpose that can help improve the match

• Determines the importance accorded to each field

6
Fuzzy Match/Search Strategy

Match Rule – Match Level

• Determines how precise the match is i.e. how similar a candidate record is to the
queued record to be considered a match

• Supported match levels are:

• Conservative: Tight Matching
• Typical: Appropriate for most matches
• Loose: Allows more variance in the values being matched

Match Rule – Accept Limit Adjustment

• Determines the acceptability of a match for the specified match level

• The Accept Limit Adjustment allows a coarse adjustment to what is considered to be a

match for this match rule:
• A positive adjustment results in tighter matching
• A negative adjustment results in looser matching

6
Fuzzy Match/Search Strategy

Match Rule

6
Fuzzy Match/Search Strategy

Match Rule – Syntax Used in Rule Description

Symbol Description
Column_1 (Fuzzy) Indicates that Column_1 is a fuzzy match column

Column_1 (Fuzzy) (+2) Indicates that the fuzzy column, Column_1, has had its weighting in the rule manually
increased

Column_2 {‘a’} Set of segment match values for Column_2

Column_3 (Ø) Indicates that null match is switched on for Column_3.

Can be combined with non-equal match: Column_3 (≠ Ø)

Column_4 (≠) Indicates that non-equal match (anti-match) is switched on for Column_4. Can be combined
with null match: Column_4 (≠ Ø)

6
Match Server Architecture

Match server is multi-threaded

∙ Can configure how many threads MDM Hub will create for matching
∙ If not configured, 4 threads will be created regardless of the number of CPUs on the
machine

Multiple match servers can be configured

∙ Allows match jobs to be run in parallel. A single match job is not load balanced across
multiple match servers
∙ MDM Hub will assign match jobs to available match servers on a round robin basis

6
Topic 6: Merge Process

6
Objectives

Following are the objectives of this topic:

• Configure Merge Settings

• Describe Immutable Source Systems

• Describe Distinct Systems

• Describe the Un-Merge Process

6
Merge Process

Merge Process

6
Merge Process

Merge

• Consolidation process of two matched records in the Base Object

• Merge can be Auto-Merge or Manual-Merge depending on the degree of matching

Immutable Source Systems

• An immutable source means that the source system is seen as a distinct source

• All records coming from this source always have a consolidation indicator of 1

• If two immutable records must be merged, then a data steward needs to perform a
manual verification in order to allow that change. The data steward will have to choose
the key that remains

Distinct Systems

∙ Records from source marked as Distinct will not merge amongst themselves

7
Merge Process

Un-Merge Process

• By default, unmerging parent records does not unmerge associated child records

• Unmerge Child When Parent Unmerges option allows you to specify what happens if
records in the parent base object are unmerged

• Pre-Requisites for enabling this option are:

• The parent-child relationship must already be configured in the child base object
• The foreign key column in the child base object must be a match-enabled column

7
Topic 7: Batch Process

7
Objectives

Following are the objectives of this topic:

• Overview of Batch Viewer

• Executing Stored Procedures

• Job Status & Job Statistics

• Scheduling Considerations

• Overview of Batch Group

• Viewing Logs and Rejected Records

7
Batch Process

Batch Viewer

• Provides a way to execute a batch job from the Hub Console

• Shows job completion status (Success / Failure / Warning) with associated message

• Shows job statistics

• Useful for starting the run of a single job, or running jobs that don’t often need to run
(e.g. Synchronize Trust job after changing Trust settings)

• Does not provide any automation or scheduling

7
Batch Process

Batch Viewer

7
Executing Stored Procedures

Stored Procedures

• All public MRM batch processes can be executed through stored procedures

• Can easily be integrated with any job scheduling software – Tivoli, CA Unicenter etc.

• The full list of public batch processes per user-defined object can be found in
C_REPOS_TABLE_OBJECT_V

SELECT * FROM C_REPOS_TABLE_OBJECT_V WHERE PUBLIC_IND = 1

• Various Run Status upon completion of a Stored Procedure:

• 0 = Completed Successfully
• 1 = Completed with Errors/Warnings
• 3 = Failed

7
Job Status & Job Statistics

Job Status and Statistics

• Job status & statistics can be viewed in the Batch Tool or query the C_REPOS_JOB*
tables directly

7
Scheduling Considerations

Stage Jobs

• If cleanse server machine has enough CPU and memory to handle multiple cleanse
servers, then parallelize stage jobs

Load Jobs

• Easiest way to schedule Load jobs is in serial

• If large number of Loads run for a short batch window, then need to Load separate
targets in parallel and check all dependencies before each Load starts

Match/Merge Jobs

• Determine whether to run match-merge once per object per batch window, or after
every source load

• Consider whether to tokenize after load. Can switch off the STRIP_ON_LOAD indicator
so that the strip process does not run as part of the load

7
Batch Group

Batch Group

• A batch group is a collection of individual batch jobs (e.g. Stage, Load, Match, etc.) that
can be executed with a single command

• Each batch job in a group can be executed sequentially or in parallel to other jobs

• Group Levels – Jobs in a particular Group Level are executed in parallel

Viewing Logs and Rejected Records

• History logs can be viewed across all Batch Groups, based on their execution status by
clicking on the appropriate node under the “Logs By Status” node

• A batch group that contains stage jobs may encounter rejected records. These can be
viewed by selecting the log record for the stage job that contains the rejected record,
then clicking the “View Rejects” button

7
Batch Group

Batch Group

Jobs in the same

level are executed in
parallel

Individual levels are

executed in sequence

Database Management System PPT
100% (3)
Database Management System PPT
88 pages
MDM Notes
No ratings yet
MDM Notes
45 pages
Informatica MDM Interview Preparation
100% (1)
Informatica MDM Interview Preparation
35 pages
Informatica MDM Training 2
No ratings yet
Informatica MDM Training 2
80 pages
Informatica MDM Basics
90% (10)
Informatica MDM Basics
42 pages
Informatica IICS Interview Questions
100% (2)
Informatica IICS Interview Questions
33 pages
IICS Cloud & PC Scenario Real Time Interview Questions
67% (3)
IICS Cloud & PC Scenario Real Time Interview Questions
4 pages
CD I Bootcamp Consolidated Day 11650307222139
No ratings yet
CD I Bootcamp Consolidated Day 11650307222139
91 pages
Informatica MDM Questions
100% (1)
Informatica MDM Questions
3 pages
Informatica MDM Training 4
No ratings yet
Informatica MDM Training 4
36 pages
IDMC Best Practices and Standards
100% (1)
IDMC Best Practices and Standards
27 pages
Iics Training Module Details
No ratings yet
Iics Training Module Details
22 pages
Informatica Mapping Scenarios
No ratings yet
Informatica Mapping Scenarios
81 pages
Option 1 Option 2 Option 3 Option 4 Correct Answer Option
No ratings yet
Option 1 Option 2 Option 3 Option 4 Correct Answer Option
33 pages
IICS MCQs
100% (1)
IICS MCQs
7 pages
IDQ Reference
No ratings yet
IDQ Reference
31 pages
Informatica Imp
No ratings yet
Informatica Imp
141 pages
SCD Type-2 Using Dynamic Lookup
100% (2)
SCD Type-2 Using Dynamic Lookup
12 pages
Informatica, Datawarehouse, Oracle, Unix - FINAL INTERVIEW QUESTIONS (ETL - INFORMATICA)
No ratings yet
Informatica, Datawarehouse, Oracle, Unix - FINAL INTERVIEW QUESTIONS (ETL - INFORMATICA)
63 pages
Best Informatica Interview Questions
67% (3)
Best Informatica Interview Questions
38 pages
Informatica
0% (1)
Informatica
32 pages
Informatica MDM Match Tuning Guide
No ratings yet
Informatica MDM Match Tuning Guide
13 pages
Informatica Cloud Enterprise Labs
No ratings yet
Informatica Cloud Enterprise Labs
90 pages
Informatica Interview Questions (Scenario-Based) - Edureka
No ratings yet
Informatica Interview Questions (Scenario-Based) - Edureka
29 pages
Informatica MDM Intregration With PIM - Design Blueprint v1.0
100% (1)
Informatica MDM Intregration With PIM - Design Blueprint v1.0
22 pages
Informatica Interview Questions Scenario Based
80% (5)
Informatica Interview Questions Scenario Based
14 pages
Advanced Concepts in Informatica
No ratings yet
Advanced Concepts in Informatica
58 pages
Informatica Lab
100% (2)
Informatica Lab
34 pages
CE Student Guide PC - HandsOnWorkshop
No ratings yet
CE Student Guide PC - HandsOnWorkshop
328 pages
Informatica MDM Training Course Content
No ratings yet
Informatica MDM Training Course Content
5 pages
Learning Informatica PowerCenter 9.x
From Everand
Learning Informatica PowerCenter 9.x
Rahul Malewar
3/5 (4)
Informatica Data Quality PDF
No ratings yet
Informatica Data Quality PDF
44 pages
Informatica Interview Questioner Ambarish PDF
No ratings yet
Informatica Interview Questioner Ambarish PDF
211 pages
50 Frequently Asked Informatica Interview Questions With Answers
No ratings yet
50 Frequently Asked Informatica Interview Questions With Answers
8 pages
Informatica Experienced Interview Questions - Part 1
No ratings yet
Informatica Experienced Interview Questions - Part 1
5 pages
Implementing SCD 2 With MD5
100% (1)
Implementing SCD 2 With MD5
6 pages
Informatica Interview Questions and Answers
No ratings yet
Informatica Interview Questions and Answers
58 pages
1st Round Interview Questions
No ratings yet
1st Round Interview Questions
5 pages
SCD Type1
100% (1)
SCD Type1
16 pages
Informatica Transformations Ans SQL Queries
No ratings yet
Informatica Transformations Ans SQL Queries
32 pages
Tableau Finalpresentation 161211155749
100% (2)
Tableau Finalpresentation 161211155749
43 pages
Informatica Interview Questions Scenario Based PDF
No ratings yet
Informatica Interview Questions Scenario Based PDF
14 pages
MDM Training Program
No ratings yet
MDM Training Program
53 pages
Informatica Geek Interview Questions
No ratings yet
Informatica Geek Interview Questions
69 pages
Informatica Scenarios
No ratings yet
Informatica Scenarios
12 pages
Informatica Senarios
No ratings yet
Informatica Senarios
26 pages
1.01 Aggregation Using Sorted Input: Informatica Mappings
No ratings yet
1.01 Aggregation Using Sorted Input: Informatica Mappings
64 pages
Insurance Project Simple Explanation
100% (5)
Insurance Project Simple Explanation
1 page
SQL Basic
100% (1)
SQL Basic
53 pages
Informatica Interview Questioner-Ambarish
No ratings yet
Informatica Interview Questioner-Ambarish
211 pages
Informatica MDM
0% (1)
Informatica MDM
4 pages
Srikant - Informatica AXON EDC MDM Consultant - GoAhead
No ratings yet
Srikant - Informatica AXON EDC MDM Consultant - GoAhead
4 pages
INFA Scenario Based Q N A - 16 Pages
No ratings yet
INFA Scenario Based Q N A - 16 Pages
16 pages
Mastering Azure Synapse Analytics: Learn how to develop end-to-end analytics solutions with Azure Synapse Analytics (English Edition)
From Everand
Mastering Azure Synapse Analytics: Learn how to develop end-to-end analytics solutions with Azure Synapse Analytics (English Edition)
Debananda Ghosh
No ratings yet
Informatica Scenario Based Interview Questions With Answers
No ratings yet
Informatica Scenario Based Interview Questions With Answers
10 pages
MDM Questions
No ratings yet
MDM Questions
1 page
Talend Interview Questions
No ratings yet
Talend Interview Questions
6 pages
Data Profiling With Informatica Data Quality
No ratings yet
Data Profiling With Informatica Data Quality
5 pages
Etl Testing New Faqs23
No ratings yet
Etl Testing New Faqs23
3 pages
Operate Database Application
No ratings yet
Operate Database Application
26 pages
Informatica Interview Questions and Answers
No ratings yet
Informatica Interview Questions and Answers
5 pages
CauseListFile IS55D18F8M0
No ratings yet
CauseListFile IS55D18F8M0
286 pages
Tourism Database Management System
No ratings yet
Tourism Database Management System
22 pages
Practice Questions for Snowflake Snowpro Core Certification Concept Based - Latest Edition 2023
From Everand
Practice Questions for Snowflake Snowpro Core Certification Concept Based - Latest Edition 2023
Exam OG
5/5 (1)
Dbms Complete Notes
No ratings yet
Dbms Complete Notes
66 pages
DBDM Ai Dev PDF
No ratings yet
DBDM Ai Dev PDF
119 pages
Model PIM 08 PDF
100% (1)
Model PIM 08 PDF
5 pages
The Data Model Resource Book: Volume 3: Universal Patterns for Data Modeling
From Everand
The Data Model Resource Book: Volume 3: Universal Patterns for Data Modeling
Len Silverston
No ratings yet
HSST Computer Science
No ratings yet
HSST Computer Science
14 pages
Question Bank For Oracle SQL and PLSQL
No ratings yet
Question Bank For Oracle SQL and PLSQL
12 pages
Practice Practical 3
No ratings yet
Practice Practical 3
4 pages
Letter Format
No ratings yet
Letter Format
4 pages
English Core SrSec 2022-23
No ratings yet
English Core SrSec 2022-23
8 pages
Table 1 Data Dictionary For Activity - Log
No ratings yet
Table 1 Data Dictionary For Activity - Log
9 pages
Alteryx 160923184340
No ratings yet
Alteryx 160923184340
25 pages
21.1 Results For An Overview of The Interface: Twentyone
No ratings yet
21.1 Results For An Overview of The Interface: Twentyone
46 pages
IDIOMS
No ratings yet
IDIOMS
2 pages
CIS 321 - Project
No ratings yet
CIS 321 - Project
6 pages
IELTS Writing Task 2: 'Artificial Intelligence' Essay
No ratings yet
IELTS Writing Task 2: 'Artificial Intelligence' Essay
1 page
UNIT4
No ratings yet
UNIT4
16 pages
Database Management System
No ratings yet
Database Management System
60 pages
Preeti File
No ratings yet
Preeti File
7 pages
Operations Management Assignment PDF
No ratings yet
Operations Management Assignment PDF
3 pages
Operations Management Assignment PDF
No ratings yet
Operations Management Assignment PDF
3 pages
Academic Appeal Form (Stage 1)
No ratings yet
Academic Appeal Form (Stage 1)
3 pages
Title: Major Assignment: Assessment Summary
No ratings yet
Title: Major Assignment: Assessment Summary
3 pages
BHANE
No ratings yet
BHANE
1 page
Mock Interview SQL Developer
No ratings yet
Mock Interview SQL Developer
4 pages
Test Code Mytap Mysql Dan Jawaban Mahasiswa
No ratings yet
Test Code Mytap Mysql Dan Jawaban Mahasiswa
17 pages
DBMS Answers Mid Questin Bank
No ratings yet
DBMS Answers Mid Questin Bank
17 pages
SearchArnAction PDF
No ratings yet
SearchArnAction PDF
1 page
11 - SQL FOREIGN KEY Constraint
No ratings yet
11 - SQL FOREIGN KEY Constraint
9 pages
(MS-DSDG) : Dataset Diffgram Structure Specification: Open Specification Promise Community Promise
No ratings yet
(MS-DSDG) : Dataset Diffgram Structure Specification: Open Specification Promise Community Promise
56 pages
Ir - 2
No ratings yet
Ir - 2
15 pages
Master data management Complete Self-Assessment Guide
From Everand
Master data management Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
3121 Edmund Pinto Database Management Assignment 7
No ratings yet
3121 Edmund Pinto Database Management Assignment 7
16 pages
Review of Data Consistency and Integrity Constraint
No ratings yet
Review of Data Consistency and Integrity Constraint
7 pages
57.4 - IMDB Dataset - mp4
No ratings yet
57.4 - IMDB Dataset - mp4
3 pages
SQL Statements
No ratings yet
SQL Statements
11 pages
Data Models With Examples
No ratings yet
Data Models With Examples
4 pages
Table Description: Week - 1 Case Study
No ratings yet
Table Description: Week - 1 Case Study
6 pages
Question Bank-1
No ratings yet
Question Bank-1
3 pages
Database Systems Lab 3 Key Constraints
No ratings yet
Database Systems Lab 3 Key Constraints
4 pages