Search found 53125 matches

by ray.wurlod
Fri May 30, 2008 4:04 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Configuration file best practice
Replies: 1
Views: 935

Having oversized configuration files is a waste of time and effort. All the nodes have to be started even if they process no rows. DataStage parallel jobs move data in chunks not smaller than 32KB (and often larger). Best practice, therefore, is to have a selection of configuration files available, ...
by ray.wurlod
Fri May 30, 2008 4:02 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Performance improvement for Lookup
Replies: 1
Views: 958

Data Set without a doubt. The other two require export/import or index-building operations. The operator to write/read a Data Set is copy (which tells you that it's very low cost - the virtual Data Set associated with the link is simply copied to/from the Data Set's data files on disk).
by ray.wurlod
Fri May 30, 2008 3:59 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: ETLStats - how to obtain correct link counts for PX jobs
Replies: 1
Views: 1271

Read the help for DSGetLinkInfo(). It can obtain single node count or total count. You can adapt ETLStats routines to suit a particular need. That said, I am surprised at your assertion, and await Kim's response to it.
by ray.wurlod
Fri May 30, 2008 3:57 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Determine end of input data
Replies: 14
Views: 4227

Yes you can.

Even in parallel jobs, before-job and after-job subroutines are written in DataStage BASIC and executed by the conductor process.
by ray.wurlod
Fri May 30, 2008 3:56 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Username info needs to be captured
Replies: 4
Views: 1284

Or you could essay GetEnvironment("USER") in a Transformer stage.
by ray.wurlod
Fri May 30, 2008 3:53 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: executing Orch admin
Replies: 6
Views: 2601

For test purposes you might even like to set APT_CONFIG_FILE explicitly in the routine under test.
by ray.wurlod
Fri May 30, 2008 3:49 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: splitting records into multiple files
Replies: 1
Views: 890

Doesn't Search work for you?
:roll:

Learn about the UNIX split command.
by ray.wurlod
Fri May 30, 2008 3:47 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Converting a nullable source to a non-nullable result
Replies: 17
Views: 6320

Please post the exact, complete error message and the record schema associated with the link.
by ray.wurlod
Fri May 30, 2008 3:45 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: How to know the update date and time of a source
Replies: 8
Views: 1952

Tip: create/import a table definition with "metadata supports multi-valued fields" check box checked. Then use a UniVerse stage to extract from DS_AUDIT. Use two output links, one un-normalized and the other normalized on MODS.
by ray.wurlod
Fri May 30, 2008 3:41 pm
Forum: IBM QualityStage
Topic: Using PX Data Sets Instead of QS Fixed width Flat Files
Replies: 0
Views: 1196

The piece of the manual that you have quoted says it all - you need to create a Data Set with one field, of type Raw, that contains the data record specified by the QualityStage file definition. There are several ways you might accomplish this, such as a StringToRaw() function or reading your file w...
by ray.wurlod
Fri May 30, 2008 6:57 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Source with zero rows but target with one row
Replies: 6
Views: 1936

Here is a sample of how you might go about it; I have supplied a suitable after-job subroutine. Full error handling has been omitted for clarity. SUBROUTINE NoRows(InputArg,ErrorCode) $IFNDEF JOBCONTROL.H $INCLUDE DSINCLUDE JOBCONTROL.H $ENDIF DEFFUN UtilityRunJob(Arg1,Arg2,Arg3,Arg4)...
by ray.wurlod
Fri May 30, 2008 6:41 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Regarding Checkpoing
Replies: 3
Views: 1355

Restart works by a checkpoint being created for each successfully completed activity. On restart, if checkpoints exist, DataStage skips any checkpointed activity. Sometimes, however, you need a particular activity to run even though it completed successfully on the previous run - for example an acti...
by ray.wurlod
Fri May 30, 2008 6:37 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Join Header and Tralier
Replies: 12
Views: 3678

Or, indeed, in an External Source stage.
by ray.wurlod
Fri May 30, 2008 6:35 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Determine end of input data
Replies: 14
Views: 4227

ETLStats is still applicable.

The only way to aggregate over the entire data set is to run the Aggregator stage in a single node (that is, sequentially). You can use a Sort/Merge collector to preserve any existing sorting.
by ray.wurlod
Fri May 30, 2008 6:34 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Multiple readers per node loading into ODBC
Replies: 10
Views: 12166

The ODBC Enterprise stage will establish one connection to the data source per processing node defined in the configuration file.