Hash file equavalant in parallel Job

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
edward_m
Charter Member
Charter Member
Posts: 257
Joined: Fri Jun 24, 2005 9:34 am
Location: Philadelphia,PA

Hash file equavalant in parallel Job

Post by edward_m »

I was trying to design a job to load data into oracle table, before table load i would like to compare the records based on the primary key which consists of 4 keys, if record already exists then log those records into file, if not insert those records.
In server job i would have create hash file based on primary key then in transformer use hash lookup and constraint.
Is there any similar functionality in parallel stage, i think we could use lookup stage but not sure which constraints to use.
could you please point me in right direction?

Thanks in advance for your help.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

A hash file is a file used for shaping a block of hash.

DataStage has hashed files.

There is no equivalent in parallel engine.

But it seems to me that a Lookup stage with a reject output link would serve your purpose adequately. Just be aware that the reference data set must fit into memory.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
edward_m
Charter Member
Charter Member
Posts: 257
Joined: Fri Jun 24, 2005 9:34 am
Location: Philadelphia,PA

Post by edward_m »

Thanks for your help.
Post Reply