Page 1 of 1

Finding Distinct Value

Posted: Tue Jul 28, 2015 4:00 am
by ani_chutti@yahoo.co.in
Dear All, I have two seq files 1 & 2. I want to get the distinct value in Seq file 1 which is NOT in Seq 2. I mean

Seq 1 : 10,20,30,40
Seq 2 : 10,30,50,60
My output should show only 20,40

Pl let me know job design. I dont need the common values between two files

Posted: Tue Jul 28, 2015 6:00 am
by rkashyap
If key-values in Seq1 and Seq1 are in a single row, then first Pivot them to create one row per key.

Subsequently in Lookup Stage ... use Seq1 as primary link, Seq2 as reference lookup ... Rejects will have values from Seq1 which is not present in Seq2.

Posted: Tue Jul 28, 2015 6:43 am
by chulett
So... a hashed file lookup since there's no 'Lookup stage' in a Server job. :wink: