Join Stage Design Question

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
saikramesh
Participant
Posts: 1
Joined: Wed Nov 17, 2004 4:14 am

Join Stage Design Question

Post by saikramesh »

I am trying to convert an sql to a datastage job. This excercise is to look for performance improvement in datastage, if any. The existing application unloads the data from informix database using unload command.

The sql is joining on 8 tables and out of these 8 tables, 3 are having million+ rows.

I have used ODBC stage to unload the data from informix for each table. Then I have joined the data using Join Stage. The join is done as below
Atable + BTable = AB.out
AB.out + CTable = ABC.out.....
and so on.....

The partitioning is set to "Auto". Joins are taking too long. My existing sql is taking around 2 minutes to execute and the joins are taking 10 minutes. Thats huge time !!.... Ne obvious performance improvements ??

One thing I can think of is that in every Join Stage operation, partitioning and sorting is taking a lot of time. But thats the requirement of Join Stage.

Any help in this regard is greatly appreciated.

Regards,
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

PX questions should really be over in the "Enterprise Edition" forum, that's where the PX gurus hang out and play. :wink:
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply