Search found 15603 matches

by ArndW
Tue Aug 14, 2007 3:35 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: DataStage corruption
Replies: 8
Views: 5681

eldonp - At my last site, a large AIX implementation, I noticed that several times on some of the engine that dynamic files were corrupted, upon further inspection it seems that the DATA.30 files were *poof* gone. Over the course of months this happened several times and I was never able to track do...
by ArndW
Tue Aug 14, 2007 3:25 pm
Forum: General
Topic: How to concatenate multiple mainframe files in FTP stage?
Replies: 2
Views: 1222

Just cat them together from the Windows command line; i.e. "a.txt + b.txt + c.txt"
by ArndW
Tue Aug 14, 2007 3:23 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Sybase Stage error
Replies: 12
Views: 2518

Are you stating that if you take the same query and put it into your PX sybase stage and write that to a sequential file with no other stages in the job, you get only half as many rows?
by ArndW
Tue Aug 14, 2007 3:11 pm
Forum: General
Topic: Datastage Job Hangs
Replies: 4
Views: 1685

There isn't enough information to make any sort of an analysis; it is like asking a technician at the auto shop to "sometimes my car won't start and sometimes it stalls in traffic. What is wrong?".

Replace your database output stages with peek stages to see if the hangs are caused by writes.
by ArndW
Tue Aug 14, 2007 5:27 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: How to handle the null values
Replies: 1
Views: 909

Although this is a topic that has seen a lot of posts, I'll make a stab at an answer. Somewhere in your stage you are using DMA_CODE without having handled nulls. Remember, this also includes use of constraints.
by ArndW
Tue Aug 14, 2007 5:22 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Partitioning Method in Sort Stage
Replies: 7
Views: 1667

No advantages to using all 5 at all, just performance disadvantages! You've hit the nail on the head, it is sufficient to partition on just one column.
by ArndW
Tue Aug 14, 2007 5:17 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Fatal Error: sortRec
Replies: 11
Views: 13618

Do you expect a column called "sortRec" in your input stream? What stages do you have? Have you tried to disable combination to see if you get a stage name that is causing the error?
by ArndW
Mon Aug 13, 2007 11:10 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Risks / Breakpoints for DSEE
Replies: 6
Views: 1868

The &PH& file (or directory) comes from the term "phantom" occasionally seen in DataStage; this was the name given to processes executed in the background and their standard output (at the time called como files, hence the existence of the now little-used &COMO& directory) would be s...
by ArndW
Mon Aug 13, 2007 10:55 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: renaming of files after extraction
Replies: 11
Views: 2848

You can issue the UNIX command "orchadmin dump {yourfile} > {sequential file}" to create a sequential file. But doing this from within the job that generates the dataset is useless. How about generate a fileset - you get sequential files but still have the advantage given by PX parallelism?
by ArndW
Mon Aug 13, 2007 10:52 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Partitioning Method in Sort Stage
Replies: 7
Views: 1667

Ideally you have your data hash partitioned on your 5 columns, so that each node at runtime only needs to sort data in its own stream and does not need to repartition.
by ArndW
Mon Aug 13, 2007 8:13 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Error handling aproaches
Replies: 2
Views: 770

That is too broad a question, similar in scope to me asking "Can someone tell me about what approach to take with politics".
DataStage does what you tell it to do; so you need to know what you would like to do and then the appropriate question is "How can I make DataStage do it that way".
by ArndW
Mon Aug 13, 2007 5:33 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Risks / Breakpoints for DSEE
Replies: 6
Views: 1868

Why would you want to repeatedly backup your production projects? The only things that should change regularly are the log contents. I'd only backup after applying new jobs or changes and perhaps ensure quarterly backups - but daily or weekly would be overkill and only cause overhead. On an annual b...
by ArndW
Mon Aug 13, 2007 5:21 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Errors from custom routine but only when running job
Replies: 3
Views: 1069

Yes, the line numbers are referring to the generate code. And yes, there is an offset that sometimes needs to be applied. Ever since the dawn of time (well, call it 1982 when I started using UniVerse) there has been an issue with the compiler getting its line numbers incorrect on some lines (stupidl...
by ArndW
Mon Aug 13, 2007 5:05 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: PX:Merge records
Replies: 7
Views: 2064

You mean add the record length of each row to the end of the row, or keep a total of the sum of all lengths in a stage variable?
by ArndW
Mon Aug 13, 2007 4:11 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Risks / Breakpoints for DSEE
Replies: 6
Views: 1868

The important one is to ensure that the &PH& directory gets flushed regularly. This is usually done by setting the autopurge at a project and job level, but if that fails for some reason this directory can fill up quickly on a busy system and can cause real job slowdowns. If you put in a dai...