What is Tokenization process in Informatica MDM?

There are two types of Match Strategy-Fuzzy and Exact.
The Tokenization process runs only on Base Objects with Fuzzy match strategy. Tokenization is the process of identifying the Match pairs between the records(Fuzzy Match Key) by generating and comparing tokens. Tokenization generates 1-20 tokens for each record of Fuzzy Match Key depending on the Key width and the length of the record. A token is an 8 char system generated value that are stored in BaseObject_STRP table(column-SSA_KEY).If there is a match between the tokens of two records,then they are identified as Match Pairs.The process of matching is only executed on these match pairs.So tokenization is basically the first step in Matching Process before executing the match process.Match tokens are generated in MDM Hub Console by executing the 'Generate Match Tokens' Batch Job in the Batch viewer after the staging and loading process are done.

1)Select Match/Merge Setup option inside the Base Object and set Match/Search Strategy in 'Properties' tab as Fuzzy.

2)Define the match path in 'Paths' tab as it allows to traverse the hierarchy between the records.This allows us to use the match columns from related table or the same table.

3)Add Match Columns and Fuzzy Match Key under the 'Match Columns' tab.There can be only one Fuzzy Match Key.We can define either Fuzzy and Exact Columns types for Fuzzy Match Strategy(In Exact,only Exact match columns can be defined)

4)Add match rule sets in 'Match Rule Sets' tab. Set the required search level and Merge type(Auto or Manual)by using the icons at the right end, and save the changes.

5)Run the 'Generate Match Tokens' Batch Job in Batch Viewer.

9 comments:

UnknownJanuary 16, 2017 at 10:02 PM
Nice blog its very informative and useful blog keep sharing.
Planning to learn Informatica MDM Training
Informatica MDM Online Training
UnknownMay 31, 2017 at 6:31 AM
I appreciate you sharing this article. Really thank you. Informatica online training
UnknownMarch 15, 2018 at 4:07 AM
Hi Varsha,

That’s really cool…. I followed these instructions and it was like boom… it worked well..

Does anyone know of a guide for the HL7 Parser using Informatica PowerCenter 10.1.1 and Informatica Developer 10.1.1 HotFix1?
I imported the HL7 Parser and have manually used the transformation to convert/view the XML output; However, I am struggling on figuring out how to add an unformulated source text file, an XML target, and importing it into PowerCenter so I can automate it.

Excellent tutorials - very easy to understand with all the details. I hope you will continue to provide more such tutorials.

Many Thanks,
Mahesh
infotrellisusMay 29, 2019 at 7:09 AM
Mastech InfoTrellis - Data and Analytics Consulting Company extending premier services in Master Data Management, Big Data and Data Integration.

Visit for More : www.infotrellis.com

infotrellisusJuly 31, 2019 at 7:41 AM
Mastech InfoTrellis - Data and Analytics Consulting Company extending premier services in Master Data Management, Big Data and Data Integration.

Visit for More : https://mastechinfotrellis.com/data-management/
James ZicrovOctober 3, 2019 at 10:33 PM
I think there is a need to look for some more information about Informatica and its crucial aspects.

Informatica Read Rest API
MOUNIKASeptember 21, 2020 at 4:31 AM
Nice post.
Informatica message Queue online training
Informatica message Queue training
Informatica power center online training
Informatica power center training
Manual Testing online training
Manual Testing training
Microservices online training
Microservices training
KITS TechnologiesMay 22, 2021 at 2:27 AM
I cannot thank you enough for the blog.Thanks Again. Keep writing.
oracle sql plsql training
go langaunage training
azure training
java training
salesforce training
hadoop training
mulesoft training
linux training
mulesoft training
web methods training
UfundAugust 18, 2022 at 1:18 AM
Nice Post! Thank you so much for sharing great post with us.

UFUND

Header Ads

What is Tokenization process in Informatica MDM?

9 comments:

Most Popular

Follow Us

Popular

Recent

Online Now

Categories

Blog Archive

Random Posts

Recent in Sports

Header Ads

What is Tokenization process in Informatica MDM?

You Might Also Like

9 comments:

Most Popular

Follow Us

Popular

Recent

Online Now

Categories

Blog Archive

Random Posts

Recent in Sports