Search found 53125 matches

by ray.wurlod
Sun Apr 03, 2005 6:34 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Perfromance issue in data stage
Replies: 4
Views: 1759

I agree with the foregoing with the exception that you can use Oracle sequences. The trick is, you use them in a reference lookup when loading the file for sqlldr to process subsequently. Point an OCI, DRS or ODBC stage at Oracle, and have it execute do something like SELECT ROWNUM, SEQUENCE_NAME.NE...
by ray.wurlod
Sun Apr 03, 2005 6:24 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: how to use isnull
Replies: 4
Views: 1539

The IsNull() function can take any data type as its argument, and returns 1 (= true) if the argument is null, or 0 (= false) if the argument is not null. The only time you can get an error - which is actually a warning - is if you try to use an unassigned variable as the argument. This can happen be...
by ray.wurlod
Sun Apr 03, 2005 6:18 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: WriteHash failure
Replies: 5
Views: 1202

Simple arithmetic, though with big numbers

With 120 million records it's almost certain that 64-bit addressing would be needed - divide 2GB by 120 million and you don't get very large records (17 bytes). There is approximately 14 bytes per record storage overhead with 32-bit addressing, so that would leave only three bytes per record for dat...
by ray.wurlod
Sun Apr 03, 2005 6:13 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Order of execution
Replies: 4
Views: 1340

Almost certainly PX will be giving you "pipeline parallelism", in which a downstream stage can be processing row #1 while an upstream stage has already begun processing row #2 (even though row #1 is still somewhere in the job). You can code to prevent this, but that's defeating one of the things tha...
by ray.wurlod
Sun Apr 03, 2005 1:02 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: LOOK UPS FOR RELATIONAL OPERATORS
Replies: 1
Views: 953

You need to specify the search columns as "Key". This means that in the SQL there will be a parameter marker in a WHERE clause for each such column. For example

Code: Select all

WHERE BOOKED_DATE >= ? AND BOOKED_DATE <= ?
by ray.wurlod
Sun Apr 03, 2005 12:59 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Handling multipls files.
Replies: 15
Views: 4834

FUNCTION GetFileNames(FilePath) Command = "ls -1rt " : FilePath Ans = "" * Capture a list of file names (note option "1" in ls command, so that * output contains ONLY a list of file names, one per line). Call DSExecute("UNIX", Command, Out...
by ray.wurlod
Sun Apr 03, 2005 12:47 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Problem while executing a job
Replies: 2
Views: 1084

Check that the library search path (SHLIB_PATH, LIBPATH or LD_LIBRARY_PATH, depending on your Unix) is correctly specified in the dsenv script in the DataStage Engine directory. This script defines environment variables for all DataStage processes, including agent processes of connected clients.
by ray.wurlod
Sat Apr 02, 2005 9:05 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: NOT COLUMN OF INSERTED/UPDATED TABLE
Replies: 4
Views: 2601

Only to observe that that message is generated by DB2, not by DataStage. View the SQL that DataStage is generating. Use the tracing facilities that allow you to see the SQL that DB2 is processing. Figure out why they might be different. Ensure that the SQL you view in the DataStage job does actually...
by ray.wurlod
Sat Apr 02, 2005 8:59 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Row count
Replies: 6
Views: 1868

Having checked the "first line is column names" in the Sequential File stage output link properties (Format tab), you don't need to worry about the first line; you'll never see it in the data processed by your DataStage job. Effectively the Sequential File stage discards the first line as part of re...
by ray.wurlod
Sat Apr 02, 2005 8:55 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Handling multipls files.
Replies: 15
Views: 4834

Create an earlier Routine Activity to return the comma-separated list of filenames. When supplying the job parameter value, click on Use Job Parameter and choose the return value from the Routine Activity. You could also use an Execute Command Activity, but you'd have to create a script to turn the ...
by ray.wurlod
Fri Apr 01, 2005 4:23 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Routines issue while upgrading DS
Replies: 4
Views: 1183

The project directory is on the server machine. Log in on the server machine and cd to the project directory. The log is a text file; you can view it with any text viewer, such as more, cat, vi and so on.
by ray.wurlod
Fri Apr 01, 2005 4:18 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Row count
Replies: 6
Views: 1868

Code: Select all

@INROWNUM > 1
will work.

Note, however, if you are feeding this from a Sequential File stage with "first line is column names" set, then the first line has already been removed by the time the data flow reaches the Transformer stage.
by ray.wurlod
Fri Apr 01, 2005 4:17 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: hash file parameters
Replies: 2
Views: 702

The default settings for a hashed file will quite happily handle 1 million records totalling 1GB. Minimum modulus will allow you to pre-allocate disk space. If you set that accurately then split load is extraneous; it's the threshold at which the hashed file expands to hold more data. Merge load is ...
by ray.wurlod
Fri Apr 01, 2005 4:12 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: SQL in Routines
Replies: 7
Views: 2253

I have to split data from a TEXT field in SQL Server into strings of 72 chars breaking the string at a "Space" or "Enter". I am doing this in a Routine by passing in the TEXT filed. OK, you don't need SQL in the routine for this. FUNCTION Split72(TheText, FileName) SplitText = Fmt(TheTe...
by ray.wurlod
Fri Apr 01, 2005 4:03 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: SQL in Routines
Replies: 7
Views: 2253

There is a library of functions collectively called the BASIC SQL Client Interface that mimics the ODBC 2.0 API. If you use this you would not (probably) be limited to 72 characters per line, but would need to have an ODBC data source name configured. Why can't you use DataStage components to perfor...