Search found 3045 matches

by vmcburney
Wed Apr 11, 2007 5:45 am
Forum: General
Topic: ownership History of datastage
Replies: 3
Views: 1881

I interviewed the initial and chief architect in Lee Scheffler interview - The ghost of DataStage past with some history in it but not all the ownership details. Do a forum search for vmark and you might find some threads about it.
by vmcburney
Mon Apr 09, 2007 8:13 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: DataStage vs COBOL
Replies: 12
Views: 5590

Don't forget DataStage for Unix System Services on the mainframe. This is a massively parallel product that really leverages the muscle of a mainframe and does not need to generate COBOL code. Might be cheaper than the 360 and you can leverage it for off mainframe processing as well.
by vmcburney
Thu Apr 05, 2007 5:02 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Use of copy Stage In PX
Replies: 2
Views: 1105

If you want to use reference data more than once, or use different columns from the same table/file in more than one lookup, the copy stage is excellent for performance. The source data is converted into native parallel data just once and used in different places in the job.
by vmcburney
Tue Apr 03, 2007 10:46 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Using Transformer to Capture Duplicate Records
Replies: 4
Views: 4264

Good tip, much better than mine since you have to sort the data to do a remove duplicates you might as well add the key change property when you sort.
by vmcburney
Tue Apr 03, 2007 5:35 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Using the Change Capture stage
Replies: 4
Views: 1494

To merge the two files you would need to run Change Capture AND Change Apply. Otherwise you would only end up with output from the primary file and nothing from the lookup file. You might be better off with the Merge stage that runs updates from the lookup file against the primary file in one step.
by vmcburney
Tue Apr 03, 2007 5:32 pm
Forum:
Topic: Metastage scalability
Replies: 8
Views: 3421

To address the original question - you should be able to upgrade your MetaStage to version 8. The MetaStage repository gets replaced by the Metadata Server. The Business Glossary 7 becomes a better browser based version 8. Some of the MetaStage client tools are missing until IBM release the Metadata...
by vmcburney
Tue Apr 03, 2007 5:27 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Rangelookup
Replies: 8
Views: 2494

It still makes no sense. Why is there only one lookup row? Does this row have those actual values it in or is this some type of pseudo code? It looks like you only join on about 10 columns and then use the pseudo code columns (such as NOT50 and 200-300) to build Stage Variable business rule code. Bu...
by vmcburney
Tue Apr 03, 2007 12:09 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Using Transformer to Capture Duplicate Records
Replies: 4
Views: 4264

A transformer can use stage variables to remember values from the previous row. You do this by using one stage variable to compare and several to hold the comparison values. If you remember that Stage Variables are executed in the order they appear this makes it easy to compare adjacent rows of data...
by vmcburney
Tue Apr 03, 2007 12:03 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Job Warnings Turn OFF @ Project level
Replies: 8
Views: 2241

I think you can use the message handler to not only demote a warning to information but to suppress the message altogether. As Ray says this is a high risk approach. Each warning message may be telling you of a row of data being dropped from your job or an insert statement that hasn't work. You want...
by vmcburney
Mon Apr 02, 2007 5:29 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: File & data level validations
Replies: 8
Views: 2354

There are implicit metadata validation and conversions that the stages perform without any code and explicit validation and conversion that you can encode into a Modify or Transformer stage via functions. Generally if data comes from a database most of it can be left as implicit - an integer field o...
by vmcburney
Sun Apr 01, 2007 5:26 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: WHich is faster?
Replies: 9
Views: 2120

We don't have two cent coins in Australia any more Ray. Since they wiped out the 1 and 2 cent pieces you have to add your 5 cents worth.
by vmcburney
Sat Mar 31, 2007 4:07 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: WHich is faster?
Replies: 9
Views: 2120

It depends on your ratio of inserts to updates! Faster still is to split your input into two streams and send them to a bulk load for inserts and an update only for updates.
by vmcburney
Fri Mar 30, 2007 5:25 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: PX with Sequential File
Replies: 9
Views: 2496

If you don't mind support both server and parallel jobs on your site the server one will be a low fuss method - less warning messages for sequential file data. Parallel job becomes better if you have a sort requirement or you have more than a couple stages between the input and output. 2 nodes and 4...
by vmcburney
Wed Mar 28, 2007 4:38 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Exception Handler : Why use it.. and how to handle warnings
Replies: 20
Views: 6895

The sequence job exception handler is like a VB on error goto statement, it is there in case an unexpected error occurs somewhere in the sequence job code. A job failing is not really an unexpected error. It is something sequence jobs are designed to handle - you use triggers to decide what happens ...
by vmcburney
Sat Mar 24, 2007 6:11 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: For new Parallel Jobs which Version better 7.5 or Hawk?
Replies: 6
Views: 2585

Assuming that version 8 is not as stable (as is true of any version 0 of a product is) two incentives of accepting the risk and going with the new release would be the slowly changing dimension stage and the six new QualityStage stages. Both could save you a lot of dev time and make for more robust ...