Search found 3045 matches

by vmcburney
Tue Nov 28, 2006 3:58 pm
Forum: General
Topic: Bob Zurek's Blog (Infomatica Advertisements)
Replies: 2
Views: 1685

I am blogging on this report later in the week. I call it a draw, Informatica ahead on Ability to Execute, IBM ahead on Visionary, both the only vendors in the leader quadrant, daylight to third. Great to see Informatica sparring with Bob on his blog. Hopefully vendors are becoming more aware of the...
by vmcburney
Tue Nov 28, 2006 4:09 am
Forum: Data Integration
Topic: IBM Information Server release date November 30
Replies: 1
Views: 9091

They barely snuck into November. You should ask for your IoD entry fee back!
by vmcburney
Mon Nov 27, 2006 9:01 pm
Forum: General
Topic: Unix vs Sql Severs
Replies: 2
Views: 2008

What do you have running on those Windows servers? Do you have both the target database and the DataStage engine on the same machines? If so you can improve performance by moving the database onto its own server. You can also upgrade your DataStage server with more RAM and look into upgrading to Ent...
by vmcburney
Mon Nov 27, 2006 6:43 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Handling special character in XML
Replies: 2
Views: 1070

That's tricky. I wonder if those characters makes it badly formed XML. You could try preprocessing the file using XLS. Do a forum search on XLS for previous threads, I don't know a lot about it myself. You could also try stripping out these characters using an operating system script such as search ...
by vmcburney
Mon Nov 27, 2006 6:41 pm
Forum: General
Topic: Type II Dimension Implementation Using DS 7.1
Replies: 6
Views: 3152

I think in the current version of DataStage Server jobs you can find the CRC32 function on the right mouse click menu within a Transformer stage. It is now part of the standard DataStage install. Generating surrogate keys or sequence numbers is easy, just create a Stage Variable in your transformer ...
by vmcburney
Sun Nov 26, 2006 5:23 pm
Forum: General
Topic: Type II Dimension Implementation Using DS 7.1
Replies: 6
Views: 3152

They are on server edition, they would need to procrastinate and come up with a bundle of money to upgrade to enterprise edition. The tricky part is change data capture. You need to check to see if incoming rows supersede existing rows. You can either write a long statement in a transformer comparin...
by vmcburney
Thu Nov 23, 2006 5:49 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Staggered Sequential Key
Replies: 3
Views: 1134

In your DataStage Director open up the job monitor, from a right mouse click display all stage instances, what you should see in your row counts is that one instance of the Transformer processed about 50 rows and the other instance processed 66 rows. That gave Transformer instance 1 the odd numbers ...
by vmcburney
Wed Nov 22, 2006 6:20 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: New Version of Datastage
Replies: 14
Views: 3941

Parallel edition does give you C++ for transformations. Microsoft SSIS will also be allowing C programming for transformations in a future version. Comparisons of server jobs versus parallel jobs shows the C++ of parallel jobs is many times more efficient so for pure performance I don't see a reason...
by vmcburney
Mon Nov 20, 2006 11:35 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Key generator for large volume
Replies: 6
Views: 1346

I would stick with a simple meaningless counter where the seed value is the current maximum value in the target table. I wouldn't get fancy concatenating intelligent fields to derive a dumb surrogate, it would just take longer. You can use the surrogate key stage or you can also generate a key in a ...
by vmcburney
Mon Nov 20, 2006 11:33 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: ETL Comparison
Replies: 8
Views: 2307

Normally comparing jobs via export file comparison, html report comparison or reporting assistant querying can be quite difficult unless you are focusing on a particular aspect of the jobs. In your case database output stage comparisons. This narrow requirement makes it worthwhile building a custom ...
by vmcburney
Mon Nov 20, 2006 4:41 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: ETL Comparison
Replies: 8
Views: 2307

Generated SQL code does not appear in export or reports because it is generated, not stored! Your SQL statement is built from the input column list, the update type and the key fields. If you compare the column lists of output stages this gives you most of the SQL statement logic. When you see a htm...
by vmcburney
Mon Nov 20, 2006 4:35 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: New Version of Datastage
Replies: 14
Views: 3941

I had a couple blogs about it, IBM Information Server for Dummies and the old My top ten features in DataStage Hawk and Ray had a good overview of the next release in The Product Formerly Known as Hawk.
by vmcburney
Thu Nov 16, 2006 7:19 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Replace simple px jobs by server jobs DS 7.5.2
Replies: 8
Views: 2203

Change data capture or replication between the databases so you don't have to move all the data every day. Both of these are not in the standard DataStage product though they are both options for the IBM Information Server along with DataStage 8.
by vmcburney
Thu Nov 16, 2006 12:42 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Replace simple px jobs by server jobs DS 7.5.2
Replies: 8
Views: 2203

We get so many server job to parallel job questions, this might be the first conversion the other way, are you completely dropping parallel edition? There is a summary in DataStage tip: What do you mean I need to optimise small jobs? You can add a config file job parameter to your job to restrict a ...
by vmcburney
Wed Nov 15, 2006 10:05 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Surrogate key generation, update and use within a single job
Replies: 2
Views: 1354

I don't think you can use and update a lookup in the one job unless you used an inefficient sparse database lookup. There is a new function in DataStage 8 to update in memory lookups. In server edition you can read from and write to a hash file but it doesn't work the same way in parallel jobs. You ...