Maximum Record Limit For lookup stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Vignesh Shanmugam
Participant
Posts: 10
Joined: Mon Feb 22, 2010 1:58 am
Location: Chennai

Maximum Record Limit For lookup stage

Post by Vignesh Shanmugam »

Hi all,

I have been told not to use Lookup stage if the Record counts Exceed by 1 Million by my senior datastage developers.

Can any body give an idea about the approximate count that could be handled in LookUp stage with out getting crashed.

Please provide your Input on this.

Currently i am using JOIN stage for 5 million records.But i should handle reject and Lookup failure conditions as well .Which is very easy in Lookup Stage.

Thanks In Advance
DSShishya
Premium Member
Premium Member
Posts: 37
Joined: Tue Oct 27, 2009 9:43 pm

Re: Maximum Record Limit For lookup stage

Post by DSShishya »

Lookup stage can handle any number of records. There is no limit to that.

A general threshold of 1 million records is set just to be on the safer side for performance reasons.

It mainly depends on the resource allocated to the server machine, memory in particular.

Talk to the admin and find out what is the memory availability for the server machine.
Sreenivasulu
Premium Member
Premium Member
Posts: 892
Joined: Thu Oct 16, 2003 5:18 am

Post by Sreenivasulu »

Instead of join stage you can use Merge stage. It has reject links.

Regards
Sreeni
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

The limit is upto the Physical memory that is allocated to your server. It upto the amount of reference data that can be easily stored in the memory without overrun.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The upper bound of the memory usable by the lookup stage is also dependant upon the platform implementation, see the threads on AIX (i.e. this thread). Using a number of "1 million records" is a bit misleading -> are the records 5 bytes wide or 500?
nvdlrao
Participant
Posts: 8
Joined: Tue Jul 27, 2010 12:28 pm

Post by nvdlrao »

Sreenivasulu wrote:Instead of join stage you can use Merge stage. It has reject links.

Regards
Sreeni
The Join Stage in DataStage will have the reject link from 8.2 on wards. Its the IBM Software Labs Tester mistake for Join Stage not having the reject link.
vdlrao
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

I disagree.

What does a "reject" mean in the context of a join?

When I join tables using SQL there are no rejects!
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
nvdlrao
Participant
Posts: 8
Joined: Tue Jul 27, 2010 12:28 pm

Post by nvdlrao »

ray.wurlod wrote:I disagree.

What does a "reject" mean in the context of a join?

When I join tables using SQL there are no rejects! ...
The "reject" link in JOIN Stage means, the records which don't have the common key will go to reject link. Its the mistake of IBM Software Labs' Tester, for join not having the reject link. From 8.2 onwards DataStage will have the reject link in Join Stage.
vdlrao
asorrell
Posts: 1707
Joined: Fri Apr 04, 2003 2:00 pm
Location: Colleyville, Texas

Post by asorrell »

Actually, no - it wasn't a mistake. It was designed to provide the same exact functionality as SQL joins - none of which support a "reject" capability.

And at this point there is no such thing as 8.2, my understanding is that the next major release (currently in beta) may be called 8.5 when it ships
Andy Sorrell
Certified DataStage Consultant
IBM Analytics Champion 2009 - 2020
nvdlrao
Participant
Posts: 8
Joined: Tue Jul 27, 2010 12:28 pm

Post by nvdlrao »

asorrell wrote:Actually, no - it wasn't a mistake. It was designed to provide the same exact functionality as SQL joins - none of which support a "reject" capability.

And at this point there is no such thing as 8.2, my understanding is that the next major release (currently in beta) may be called 8.5 when it ships
I am not sure which is the next version of 8.1. But it is going to have the reject link for JOIN Stage from the next version onwards. Well what about the Merge Stage in DataStage? Did it designed to provide the same exact functionality as SQL Merge?. Does it not support reject link?
vdlrao
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Merge is not SQL functionality - it implements a "master and update" mechanism.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
priyadarshikunal
Premium Member
Premium Member
Posts: 1735
Joined: Thu Mar 01, 2007 5:44 am
Location: Troy, MI

Post by priyadarshikunal »

Merge statement does exists in Oracle and to capture rejects it uses when not matched then clause. I think it was introduced in Oracle 9i version. However i don't know whether they took the idea from ETL tools ot ETL tools tried to mimic the Merge statement of oracle.

Well, unless you know the exact timeline of first use of it in SQL or ETL tool, it seems like the question of Egg and Chicken.
Priyadarshi Kunal

Genius may have its limitations, but stupidity is not thus handicapped. :wink:
Post Reply