Join logic

rafidwh · Post by **rafidwh** » Sat Jan 09, 2010 12:59 pm

Hi All,
Need help in implementing the following logic

I have three files as shown in the below given format

File1

ID IND
1 Y

File2

ID IND
1 Y
1 X
1 A
1 X

File3
ID IND
1 Y
1 Z

All the above 3 files are the reference files and the input file or the main file will look like

Main File
ID IND1 IND2 IND3
1 - - -

The output should be like

ID IND1 IND2 IND3
1 Y X A

IND1 = Y since it found match in first file
IND2 = X (Since it has more occurances than A)
IND3 = A (Since it is the next most dominant in the second file)

We will pull the indicator value from third file only when we dont find any match from 1st and 2nd file.

Thanks in Advance

ray.wurlod · Post by **ray.wurlod** » Sat Jan 09, 2010 1:39 pm

Server or parallel job? You have posted in the server forum but marked the job as parallel. The answer will be different depending upon job type.

rafidwh · Post by **rafidwh** » Sun Jan 10, 2010 12:06 am

Ray,

Sorry for posting on server forum. Its a parallel job.

Thanks!

rafidwh · Post by **rafidwh** » Sun Jan 10, 2010 10:54 pm

Hi,

Any idea team.

Sainath.Srinivasan · Post by **Sainath.Srinivasan** » Mon Jan 11, 2010 4:10 am

That must be straight-forward. What have you tried so far ?

Please post your attempts so others can guide you rather than asking others to 'do it for you'.

Anyhow, one method will be
1.) Get ind1 from your reference 1
2.) Aggregate the values in your files and sort by desc id and count.
3.) Take first two rows for ind 2 and ind 3.

I will leave this logic to your imagination.

ray.wurlod · Post by **ray.wurlod** » Mon Jan 11, 2010 4:02 pm

Waiting for question to be posted in Parallel forum.

If I were to answer the question here, my answer would pertain to server jobs and waste both your time and mine.