Hi,
I'm using a Dependent Unduplicate Match. I haven't found anywhere described the process of choosing the rows that go on the MATCH flow, vs the ones that go on the DUPLICATE flow. How does QS choose between them; how are the ones that go on the MATCH flow better than the ones from the DUPLICATE flow?
Thanks
Unduplicate match
-
- Premium Member
- Posts: 425
- Joined: Sat Nov 19, 2005 9:26 am
- Location: New York City
- Contact:
The key factor in the match process is the composite weight for the record. First you set a cut off value
The unduplicate match process set all records with composite weights above the match cutoff as a group of duplicates. Within the group the record with the highest composite weight and that matches to itself is declare the master record
IBM Infosphere QualityStage User guide, page 85
The unduplicate match process set all records with composite weights above the match cutoff as a group of duplicates. Within the group the record with the highest composite weight and that matches to itself is declare the master record
IBM Infosphere QualityStage User guide, page 85
Julio Rodriguez
ETL Developer by choice
"Sure we have lots of reasons for being rude - But no excuses
ETL Developer by choice
"Sure we have lots of reasons for being rude - But no excuses