Rank and Priority in a Job

Nagasudheerkumar · Post by **Nagasudheerkumar** » Sun Jun 07, 2015 4:24 pm

I have a Scenario:

Input(text file):
ID  VALUE   CODE 
01    1000   DED
01    1000   DED
01    1000   DEP
01    1000   DEP
01    1000   INN
01    1000   INN
02    2000   INN
02    2000   DEP
02    2000   INN
02    2000   DEP
03    3000   INN
03    3000   INN
04    4000   DED
04    4000   DED

Output(table):
ID    VALUE   CODE
01    1000     DEP 
02    2000     DEP
03    3000     INN
04    4000     DED

Output should be DEP, if DEP is not there then INN, last Priority should be DED.
There are 5000 records which have duplicates like this, can anybody let me know how to accomplish this logic.

ray.wurlod · Post by **ray.wurlod** » Sun Jun 07, 2015 6:16 pm

Rather than rely on us to spoon-feed you a solution, please suggest what YOU have tried, and allow us to guide you.

There are probably several solutions - I would tend to one that encoded the CODE field into a sortable value, sort by that value, then use a Remove Duplicates stage. Paying close attention, of course, to correct partitioning.