Can anyone offer any advice on a job design to perform a n-way outer join?
I have a job to build a datamart FACT table using various other tables. It does an 8-way outer join like this:
Code: Select all
DB2 UDB -->
-> JOIN -->XFORM -->
DB2 UDB --> > JOIN --> XFORM -- >
DB2 UDB --> > JOIN -->
DB2UDB -->
etc...
The net result of the job is about 3 million keys, but there are far fewer keys coming from the source tables on the input to some of the joins.
I'm wondering if there is any way round having all the transformers because they add to the CPU cost of running the job?
The job is being run on a single node configuration, DataStage is combining the joins and transformers into a single process, but using APT_DISABLE_COMBINATION I can see that the transformers take almost as much elapsed time and CPU as the joins.