Round robin

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
krish889
Participant
Posts: 2
Joined: Fri Dec 12, 2014 7:14 pm

Round robin

Post by krish889 »

Hi,
When i used Round robin partition in Remove duplicate stage it is not eliminating properly all duplicate data... can you brief me whats the reason behind that.

Thank you.
KRISHNA
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Because you're not sending all of the duplicate keys to the same partition. Use hashed partitioning on the key field(s) that drive the duplicate check.
-craig

"You can never have too many knives" -- Logan Nine Fingers
priyadarshikunal
Premium Member
Premium Member
Posts: 1735
Joined: Thu Mar 01, 2007 5:44 am
Location: Troy, MI

Post by priyadarshikunal »

Also I suggest you to read the Parallel job developer guide to understand different partitioning method to make sure you choose the correct partitioning strategy in future.
Priyadarshi Kunal

Genius may have its limitations, but stupidity is not thus handicapped. :wink:
Post Reply